CentOS下安装Solr5.3

时间 2019-11-12

原文原文链接

https://lucene.apache.org/solr/html

前言java

Solr如今是一个独立的服务器。

从Solr5.0开始，Solr再也不发布为在任何Servlet容器中部署的“war”Web应用程序包（Web Application Archive）。Solr如今部署为一个独立的java服务器应用程序，包含在Unix和Windows平台上可使用的启动和中止脚本，以及将Solr做为服务安装到类Unix平台的/etc/init.d下的安装脚本。

本质上，Solr仍然以Servlet APIs实现，并在Jetty上运行，但只是做为一个实现。部署为“webapp”到其余的Servlet容器（或其余Jetty实例）上不被支持，可能在将来的Solr 5.x版本不会工做。而可能会带来Solr的其余改变，事实上是利用自定义网络协议栈功能。web

安装JREapache

须要Java Runtime Environment(JRE) 1.7或更高版本，先验证。json

# java -version浏览器
安装可参考《在CentOS下安装JDK8》，包含JRE。服务器

安装Solr5.3网络

去http://www.apache.org/dyn/closer.lua/lucene/solr/5.3.0下载Solr安装文件solr-5.3.0.tgz。架构
将solr-5.3.0.tgz文件放到/tmp目录下，执行以下脚本： app

# cd /tmp

# tar -zxvf solr-5.3.0.tgz // 解压压缩包
建立应用程序和数据目录

# mkdir -p /data/solr /usr/local/solr
建立运行solr的用户并赋权

# groupadd solr

# useradd -g solr solr

# chown -R solr.solr /data/solr /usr/local/solr
安装solr服务

# solr-5.3.0/bin/install_solr_service.sh solr-5.3.0.tgz -d /data/solr -i /usr/local/solr

检查服务状态

# service solr status

将会看到以下输出：

Solr process 29692 running on port 8983

{

"solr_home":"/data/solr/data/",

"version":"5.3.0 1696229 - noble - 2015-08-17 17:10:43",

"startTime":"2015-09-16T01:32:03.919Z",

"uptime":"0 days, 0 hours, 3 minutes, 6 seconds",

"memory":"89.8 MB (%18.3) of 490.7 MB"}

solr命令用法

定位到solr应用程序目录

# cd /usr/local/solr/solr
查看solr命令选项

# ./bin/solr

Usage: solr COMMAND OPTIONS where COMMAND is one of: start, stop, restart, status, healthcheck, create, create_core, create_collection, delete

Standalone server example (start Solr running in the background on port 8984):

./solr start -p 8984

SolrCloud example (start Solr running in SolrCloud mode using localhost:2181 to connect to ZooKeeper, with 1g max heap size and remote Java debug options enabled):

./solr start -c -m 1g -z localhost:2181 -a "-Xdebug -Xrunjdwp:transport=dt_socket,server=y,suspend=n,address=1044"

Pass -help after any COMMAND to see command-specific usage information, such as: ./solr start -help or ./solr stop -help

# ./bin/solr start -help

Usage: solr start [-f] [-c] [-h hostname] [-p port] [-d directory] [-z zkHost] [-m memory] [-e example] [-s solr.solr.home] [-a "additional-options"] [-V]

-f Start Solr in foreground; default starts Solr in the background and sends stdout / stderr to solr-PORT-console.log

-c or -cloud Start Solr in SolrCloud mode; if -z not supplied, an embedded ZooKeeper instance is started on Solr port+1000, such as 9983 if Solr is bound to 8983

-h <host> Specify the hostname for this Solr instance

-p <port> Specify the port to start the Solr HTTP listener on; default is 8983 The specified port (SOLR_PORT) will also be used to determine the stop port STOP_PORT=($SOLR_PORT-1000) and JMX RMI listen port RMI_PORT=(1$SOLR_PORT). For instance, if you set -p 8985, then the STOP_PORT=7985 and RMI_PORT=18985

-d <dir> Specify the Solr server directory; defaults to server

-z <zkHost> ZooKeeper connection string; only used when running in SolrCloud mode using -c To launch an embedded ZooKeeper instance, don't pass this parameter.

-m <memory> Sets the min (-Xms) and max (-Xmx) heap size for the JVM, such as: -m 4g results in: -Xms4g -Xmx4g; by default, this script sets the heap size to 512m

-s <dir> Sets the solr.solr.home system property; Solr will create core directories under this directory. This allows you to run multiple Solr instances on the same host while reusing the same server directory set using the -d parameter. If set, the specified directory should contain a solr.xml file, unless solr.xml exists in ZooKeeper. This parameter is ignored when running examples (-e), as the solr.solr.home depends on which example is run. The default value is server/solr.

-e <example> Name of the example to run; available examples: cloud: SolrCloud example techproducts: Comprehensive example illustrating many of Solr's core capabilities dih: Data Import Handler schemaless: Schema-less example

-a Additional parameters to pass to the JVM when starting Solr, such as to setup Java debug options. For example, to enable a Java debugger to attach to the Solr JVM you could pass: -a "-agentlib:jdwp=transport=dt_socket,server=y,suspend=n,address=18983" In most cases, you should wrap the additional parameters in double quotes.

-noprompt Don't prompt for input; accept all defaults when running examples that accept user input

-V Verbose messages from this script

# ./bin/solr create -help

Usage: solr create [-c name] [-d confdir] [-n configName] [-shards #] [-replicationFactor #] [-p port]

Create a core or collection depending on whether Solr is running in standalone (core) or SolrCloud mode (collection). In other words, this action detects which mode Solr is running in, and then takes the appropriate action (either create_core or create_collection). For detailed usage instructions, do:

bin/solr create_core -help

or

bin/solr create_collection –help

安装solr服务脚本用法

运行安装脚本

# /tmp/solr-5.3.0/bin/install_solr_service.sh

ERROR: Must specify the path to the Solr installation archive, such as solr-5.0.0.tgz

Usage: install_solr_service.sh path_to_solr_distribution_archive OPTIONS

The first argument to the script must be a path to a Solr distribution archive, such as solr-5.0.0.tgz (only .tgz or .zip are supported formats for the archive)

Supported OPTIONS include:

-d Directory for live / writable Solr files, such as logs, pid files, and index data; defaults to /var/solr

-i Directory to extract the Solr installation archive; defaults to /opt/ The specified path must exist prior to using this script.

-p Port Solr should bind to; default is 8983

-s Service name; defaults to solr

-u User to own the Solr files and run the Solr process as; defaults to solr This script will create the specified user account if it does not exist.

NOTE: Must be run as the root user

建立集合

在这个部分，咱们建立一个简单的Solr集合。

Solr能够有多个集合，但在这个示例，咱们只使用一个。使用以下命令，建立一个新的集合。咱们以solr用户运行以免任何权限错误。

# su - solr -c "/usr/local/solr/solr/bin/solr create -c gettingstarted -n data_driven_schema_configs"

在这个命令中，gettingstarted是集合的名字，-n指定配置集合。Solr默认提供了3个配置集合。这里咱们使用的是schemaless，意思是能够提供任意名字的任意列，类型将会被猜想。

Setup new core instance directory: /data/solr/data/gettingstarted

Creating new core 'gettingstarted' using command: http://localhost:8983/solr/admin/cores?action=CREATE&name=gettingstarted&instanceDir=gettingstarted

{ "responseHeader":{ "status":0, "QTime":3247}, "core":"gettingstarted"}

你如今已经建立了集合，并能够开始添加数据。默认的架构只须要提供一列：id。没有其余默认列，只有动态列。

添加和查询文档

在这个部分，咱们将浏览Solr Web界面，添加一些文档到集合中。

问你使用Web浏览器访问http://your_server_ip:8983/solr，Solr Web界面将会显示为：

这个Web界面包含大量的有用信息，能够被用于调试在使用中产生的任何问题。

集合被划分为核，这就是为何在Web界面中有大量的对核的参照。如今，gettingstarted只包含一个核，名为gettingstarted。在左手边，能够看到“Core Selector”下拉菜单，咱们能够选择gettingstarted看到更多信息。

在选择gettingstarted核以后，选择“Documents”。文档存储可被Solr搜索的真实数据。由于咱们使用了一个无模式的配置，咱们可使用任何列。我使用以下的JSON示例添加了一个单一文档，经过拷贝如下到“Documents(s)”列：

{

"number": 1,

"president": "George Washington",

"birth_year": 1732,

"death_year": 1799,

"took_office": "1789-04-30",

"left_office": "1797-03-04",

"party": "No Party"

}

点击“Submit document”添加文档到索引。过一会，你会看到以下信息：

添加文档后的输出：

Status: success

Response:

{

"responseHeader": {

"status": 0,

"QTime": 290

}

你可使用一个相似的或彻底不一样的结构添加更多文档，但你也能够只使用一个文档继续。

如今，选择左边的“Query”去查询咱们刚刚添加的文档。保持屏幕中的默认值，在点击“Execute Query”以后，你最多看到10个文档，依赖于你添加了多少：

查询输出

{

"responseHeader": {

"status": 0,

"QTime": 39,

"params": {

"q": "*:*",

"indent": "true",

"wt": "json",

"_": "1442371884598"

}

},

"response": {

"numFound": 1,

"start": 0,

"docs": [

{

"number": [

1

],

"president": [

"George Washington"

],

"birth_year": [

1732

],

"death_year": [

1799

],

"took_office": [

"1789-04-30T00:00:00Z"

],

"left_office": [

"1797-03-04T00:00:00Z"

],

"party": [

"No Party"

],

"id": "b9b294c1-4b68-4d96-adc2-f6fb77f60932",

"_version_": 1512437472611532800

}

]

}

参考：http://lucene.apache.org/solr/quickstart.html