solr 3.5安装 及分词器配置

1、搭建说明java

环境:web

1.系统:windows7 x86_64apache

2.java 版本 :jdk1.8windows

3.tomcat 版本:tomcat 6浏览器

4.solr:solr3.5.0tomcat

5.lucene:lucene 3.5.0app

6.中文分词起mmseg4jwebapp

2、下载jsp

solr 3.5下载地址:http://www.apache.org/dist//lucene/solr/ide

3、solr 3.5的安装

1.修改tomcat http服务监听端口

<Connector port="8080" protocol="HTTP/1.1" connectionTimeout="20000" redirectPort="8443" URIEncoding="UTF-8"/>

 修改默认端口8080,能够不修改,添加字符编码  URIEncoding="UTF-8" 不然中文会乱码

    注:若是是你一个系统启动多个tomcat你要同时修改三个端口,以保证每一个tomcat都不一样,才能同时启动

    1.tomcat http端口

    2.tomcat shutdown 端口

2.在apache-solr-3.5.0\dist目录下找到apache-solr-3.5.0.war把重命名为solr.war 并上传到 tomcat-6/webapps下。

3.建立一个solr.xml文件,在tomcat-7.0.14\conf\Catalina\localhost\下建立一个solr.xml文件。内容为:      

<?xml version="1.0" encoding="UTF-8"?>

<Context docBase="F:\apache-tomcat-6.0.37\webapps\solr"   debug="0" crossContext="true">

    <Environment name="solr/home" type="java.lang.String"  value="F:\apache-tomcat-6.0.37\solr" override="true" />

</Context>

注:docBase 指向应用所在目录,主要用来为应用设置环境变量

4.把apache-solr-3.5.0\example目录下的solr文件夹上传到e:/tomcat-6目录下(即tomcat根目录下,同bin在一个目录下).索引文件会默认会放到 e:/tomcat-6\solr\data下.

5.从新tomcat 在浏览器中输入:http://localhost:8080/  会出现:

        welcome to solr  

        solr admin  说明发布已成功!

 

 

 

4、分词器配置

   1.整合 汉字分词器 IKAnalyzer 3.2.8

        1).添加IKAnalyzer3.2.8.jar

           把IKAnalyzer3.2.8.jar放到 tomcat-7.0.14\webapps\solr\WEB-INF\lib下;

        2).schema.xml 增长信息

            进入目录编辑schema.xml文件,在<Types>下添加如下内容: 

            

<fieldType name="text" class="solr.TextField" >  

               <analyzer class="org.wltea.analyzer.lucene.IKAnalyzer"/>  

               <analyzer type="index">  

                   <tokenizer class="org.wltea.analyzer.solr.IKTokenizerFactory" isMaxWordLength="false"/>  

                   <filter class="solr.StopFilterFactory"  

                           ignoreCase="true" words="stopwords.txt"/>  

                   <filter class="solr.WordDelimiterFilterFactory"  

                           generateWordParts="1"  

                           generateNumberParts="1"  

                           catenateWords="1"  

                           catenateNumbers="1"  

                           catenateAll="0"  

                           splitOnCaseChange="1"/>  

                   <filter class="solr.LowerCaseFilterFactory"/>  

                   <filter class="solr.EnglishPorterFilterFactory"  

                       protected="protwords.txt"/>  

                   <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>  

               </analyzer>  

                 <analyzer type="query">  

                   <tokenizer class="org.wltea.analyzer.solr.IKTokenizerFactory" isMaxWordLength="false"/>  

                   <filter class="solr.StopFilterFactory"  

                           ignoreCase="true" words="stopwords.txt"/>  

                   <filter class="solr.WordDelimiterFilterFactory"  

                           generateWordParts="1"  

                           generateNumberParts="1"  

                           catenateWords="1"  

                           catenateNumbers="1"  

                           catenateAll="0"  

                           splitOnCaseChange="1"/>  

                   <filter class="solr.LowerCaseFilterFactory"/>  

                   <filter class="solr.EnglishPorterFilterFactory"  

                       protected="protwords.txt"/>  

                   <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>  

               </analyzer>  

    </fieldType>

 

而后在<fields>下添加:

 

<field name="name1" type="text" indexed="true" stored="true" required="true" />

        3)、重起tomcat 在浏览器中输入:http://localhost:8080/solr/admin/analysis.jsp进去 solr admin 显示:

相关文章
相关标签/搜索