全文检索搜索引擎都会有这样一个功能:输入一个字符便自动提示出可选的短语:
html
要实现这种功能,能够利用solr的SuggestComponent,SuggestComponent这种方法利用Lucene的Suggester实现,并支持Lucene中可用的全部查找实现。算法
这个是本身的字段:apache
<field name="name" type="string" indexed="true" stored="true"/> <field name="username" type="string" indexed="true" stored="true"/> <field name="password" type="string" indexed="true" stored="true"/> <field name="phone" type="string" indexed="true" stored="true"/>
<field name="suggest_username" type="text_suggest" indexed="true" stored="true"/> <copyField source="username" dest="suggest_username"/>
copyField的source表示源,dest表示目标。浏览器
<fieldType name="text_suggest" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.EnglishPossessiveFilterFactory"/> </analyzer> </fieldType>
tokenizer就是分词器,官方解释:ide
The job of a tokenizer is to break up a stream of text into tokens, where each token is (usually) a sub-sequence of the characters in the text。测试
就是指将文本流分解为标记tokens,这写tokens也是文本的子序列。优化
filter就是过滤器,官方解释:ui
The job of a filter is usually easier than that of a tokenizer since in most cases a filter looks at each token in the stream sequentially and decides whether to pass it along, replace it or discard it.搜索引擎
不一样的过滤器将输入流替换或者丢弃或者直接经过。code
solrconfig.xml文件也在新建核心core的conf文件夹下
<searchComponent name="suggest" class="solr.SuggestComponent"> <lst name="suggester"> <str name="name">AnalyzingSuggester</str> <str name="lookupImpl">AnalyzingLookupFactory</str> <str name="dictionaryImpl">DocumentDictionaryFactory</str> <str name="field">suggest_username</str> <str name="weightField">suggest_username</str> <str name="payloadField">id</str> <str name="suggestAnalyzerFieldType">text_suggest</str> <str name="buildOnStartup">false</str> </lst> </searchComponent>
在searchComponent中的suggester须要配置一些参数。
buildOnStartup;启动的时候构建建议索引。
<requestHandler name="/suggest" class="solr.SearchHandler" startup="lazy" > <lst name="defaults"> <str name="suggest">true</str> <str name="suggest.count">10</str> </lst> <arr name="components"> <!-- 上面配置的searchComponent名字suggest --> <str>suggest</str> </arr> </requestHandler>
这里
经过浏览器地址栏输入链接测试:
http://127.0.0.1:8983/solr/user/suggest?suggest=true&suggest.build=true&suggest.dictionary=AnalyzingSuggester&suggest.q=a
suggest.q=a ;suggest查询内容。
https://lucene.apache.org/solr/guide/8_1/suggester.html#suggest-request-handler-parameters
"responseHeader": { "status": 0, "QTime": 10 }, "command": "build", "suggest": { "AnalyzingSuggester": { "aoa": { "numFound": 3, "suggestions": [ { "term": "aoa lee", "weight": 0, "payload": "7859b42e-3428-40c0-9036-6d50767a5ff2" }, { "term": "aoa lee key", "weight": 0, "payload": "0bead5d5-2570-44ba-830b-030f8a888ea3" }, { "term": "aoa lee key lol bob", "weight": 0, "payload": "9cc3c4d7-7d34-422b-8164-a4c4c92caa08" } ] } } } }
返回的结果中主要有三个参数: