之前由于在项目中使用的solr3.6,服务器总是莫名其妙的挂掉,各种解决未果,于是尝试换到solr4,想试试solrcloud和zookeeper集群看看效果怎么样,实际效果比3.6稳定很多,下面说一下项目中我们的具体现在一个测试的配置。 我们配置3台zookper服务器和3台solr服务器,由于没有多台机器测试,zookeeper的集群都在一台服务器上,tomcat部署在3台服务器上。
zookeeper是什么
ZooKeeper 顾名思义 动物园管理员,他是拿来管大象(Hadoop) 、 蜜蜂(Hive) 、 小猪(Pig) 的管理员, Apache Hbase和 Apache Solr 以及LinkedIn sensei 等项目中都采用到了 Zookeeper。ZooKeeper是一个分布式的,开放源码的分布式应用程序协调服务,ZooKeeper是以Fast Paxos算法为基础,实现同步服务,配置维护和命名服务等分布式应用。
zookeeper安装及配置
zookeeper下载
zookeeper配置
---------------
首先先把/etc/hosts下的主机名映射127.0.1.1改成127.0.0.1。
---------------
(1)解压为zookeeper
- tar -xf -C /home/myuser/zookeeper/
复制zookeeper文件夹3份,分别重名名为zookeeperA,zookeeperB,zookeeperC。
并且创建数据快照以及日志存放文件夹,命名为zooA,zooB,zooC。
(2)编辑对应的zookeeper配置文件,复制zookeeperconf下zoo_sample.cfg为zoo.cfg
- cd /home/myuser/zookeeperA/conf
- cp zoo_sample.cfg zoo.cfg
- # The number of milliseconds of each tick
- tickTime=2000
- # The number of ticks that the initial
- # synchronization phase can take
- initLimit=10
- # The number of ticks that can pass between
- # sending a request and getting an acknowledgement
- syncLimit=5
- # the directory where the snapshot is stored.
- # do not use /tmp for storage, /tmp here is just
- # example sakes.
- dataDir=/home/myuser/zooA/data
- # the port at which the clients will connect
- clientPort=2181
- # ZooKeeper server and its port no. # ZooKeeper ensemble should know about every other machine in the ensemble # specify server id by creating 'myid' file in the dataDir # use hostname instead of IP address for convenient maintenance
- server.1=127.0.0.1:2888:3888
- server.2=127.0.0.1:2988:3988
- server.3=127.0.0.1:2088:3088
- #
- # Be sure to read the maintenance section of the
- # administrator guide before turning on autopurge.
- #
- # http://zookeeper.apache.org/doc/current/zookeeperAdmin.html#sc_maintenance
- #
- # The number of snapshots to retain in dataDir
- #autopurge.snapRetainCount=3
- # Purge task interval in hours
- # Set to "0" to disable auto purge feature
- #autopurge.purgeInterval=1
- dataLogDir=/home/myuser/zooA/log
initLimit:多少个心跳时间内,允许其他server连接并初始化数据,如果ZooKeeper管理的数据较大,则应相应增大这个值
clientPort:服务的监听端口
dataDir:用于存放内存数据库快照的文件夹,同时用于集群的myid文件也存在这个文件夹里(注意:一个配置文件只能包含一个dataDir字样,即使它被注释掉了。)
dataLogDir:用于单独设置transaction log的目录,transaction log分离可以避免和普通log还有快照的竞争
syncLimit:多少个tickTime内,允许follower同步,如果follower落后太多,则会被丢弃。
(4)创建myid文件
(4)创建myid文件
- cd /home/myuser/zooA/data
- sudo sh -c 'echo "1" >> myid'
(5)启动zookeeper
- cd /home/myuser/zookeeperA/bin
- sudo sh zkServer.sh start
- [root@weibo bin]# sh zkServer.sh status
- JMX enabled by default
- Using config: /home/weibo/zookeeperA/bin/../conf/zoo.cfg
- Mode: follower
(6)客户端连接zookeeper
- [root@weibo bin]# sh zkCli.sh
- Connecting to localhost:2181
- 2013-05-10 15:00:25,363 [myid:] - INFO [main:Environment@100] - Client environment:zookeeper.version=3.4.5-1392090, built on 09/30/2012 17:52 GMT
- 2013-05-10 15:00:25,369 [myid:] - INFO [main:Environment@100] - Client environment:host.name=localhost
- 2013-05-10 15:00:25,370 [myid:] - INFO [main:Environment@100] - Client environment:java.version=1.6.0_33
- 2013-05-10 15:00:25,371 [myid:] - INFO [main:Environment@100] - Client environment:java.vendor=Sun Microsystems Inc.
- 2013-05-10 15:00:25,372 [myid:] - INFO [main:Environment@100] - Client environment:java.home=/home/weibo/jdk1.6.0_33/jre
- 2013-05-10 15:00:25,373 [myid:] - INFO [main:Environment@100] - Client environment:java.class.path=/home/weibo/zookeeperA/bin/../build/classes:/home/weibo/zookeeperA/bin/../build/lib/*.jar:/home/weibo/zookeeperA/bin/../lib/slf4j-log4j12-1.6.1.jar:/home/weibo/zookeeperA/bin/../lib/slf4j-api-1.6.1.jar:/home/weibo/zookeeperA/bin/../lib/netty-3.2.2.Final.jar:/home/weibo/zookeeperA/bin/../lib/log4j-1.2.15.jar:/home/weibo/zookeeperA/bin/../lib/jline-0.9.94.jar:/home/weibo/zookeeperA/bin/../zookeeper-3.4.5.jar:/home/weibo/zookeeperA/bin/../src/java/lib/*.jar:/home/weibo/zookeeperA/bin/../conf:/home/weibo/jdk1.6.0_33/lib:.
- 2013-05-10 15:00:25,373 [myid:] - INFO [main:Environment@100] - Client environment:java.library.path=/home/weibo/jdk1.6.0_33/jre/lib/amd64/server:/home/weibo/jdk1.6.0_33/jre/lib/amd64:/home/weibo/jdk1.6.0_33/jre/../lib/amd64:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
- 2013-05-10 15:00:25,374 [myid:] - INFO [main:Environment@100] - Client environment:java.io.tmpdir=/tmp
- 2013-05-10 15:00:25,375 [myid:] - INFO [main:Environment@100] - Client environment:java.compiler=<NA>
- 2013-05-10 15:00:25,375 [myid:] - INFO [main:Environment@100] - Client environment:os.name=Linux
- 2013-05-10 15:00:25,376 [myid:] - INFO [main:Environment@100] - Client environment:os.arch=amd64
- 2013-05-10 15:00:25,377 [myid:] - INFO [main:Environment@100] - Client environment:os.version=2.6.32-279.5.2.el6.x86_64
- 2013-05-10 15:00:25,377 [myid:] - INFO [main:Environment@100] - Client environment:user.name=root
- 2013-05-10 15:00:25,378 [myid:] - INFO [main:Environment@100] - Client environment:user.home=/root
- 2013-05-10 15:00:25,379 [myid:] - INFO [main:Environment@100] - Client environment:user.dir=/home/weibo/zookeeperA/bin
- 2013-05-10 15:00:25,382 [myid:] - INFO [main:ZooKeeper@438] - Initiating client connection, connectString=localhost:2181 sessionTimeout=30000 watcher=org.apache.zookeeper.ZooKeeperMain$MyWatcher@9d04653
- Welcome to ZooKeeper!
- 2013-05-10 15:00:25,604 [myid:] - INFO [main-SendThread(localhost:2181):ClientCnxn$SendThread@966] - Opening socket connection to server localhost/127.0.0.1:2181. Will not attempt to authenticate using SASL (无法定位登录配置)
- JLine support is enabled
- 2013-05-10 15:00:25,649 [myid:] - INFO [main-SendThread(localhost:2181):ClientCnxn$SendThread@849] - Socket connection established to localhost/127.0.0.1:2181, initiating session
- 2013-05-10 15:00:25,842 [myid:] - INFO [main-SendThread(localhost:2181):ClientCnxn$SendThread@1207] - Session establishment complete on server localhost/127.0.0.1:2181, sessionid = 0x13e7e067764005d, negotiated timeout = 30000
- WATCHER::
- WatchedEvent state:SyncConnected type:None path:null
- [zk: localhost:2181(CONNECTED) 0] ls /
- [configs, zookeeper, clusterstate.json, aliases.json, live_nodes, overseer, overseer_elect, collections]
- [zk: localhost:2181(CONNECTED) 1]
clusterstate.json:集群状态json
aliases:别名json
live_node:当solr服务器启动的时候,会注册到这里
overseer:保存shard信息
overseer_elect:节点选举
collections:所有的collection
Solr及Tomcat安装配置
(1)下载solr4.2,把solr-4.2.1\example\solr\collection1\conf 目录下的配置文件上传的zookeeper
(2)上传solr配置文件到zookeeper
- java -classpath .:/home/weibo/tomcat_solr_qq/webapps/solr/WEB-INF/lib/* org.apache.solr.cloud.ZkCLI -cmd upconfig -zkhost 127.0.0.1:2181,127.0.0.1:2281,127.0.0.1:2381 -confdir /home/weibo/solr-config -confname weiboconfig
- java -classpath .:/home/weibo/tomcat_solr_qq/webapps/solr/WEB-INF/lib/* org.apache.solr.cloud.ZkCLI -cmd linkconfig -collection weibocollection -confname weiboconfig -zkhost 127.0.0.1:2181,127.0.0.1:2281,127.0.0.1:2381
- <?xml version="1.0" encoding="utf-8"?>
- <Context docBase="/home/weibo/solr-qq/solr.war" debug="0" corssContext="false" >
- <Environment name="solr/home" type="java.lang.String" value="/home/weibo/solr-qq/" override="true"/>
- </Context>
(5)在你的solr home目录创建solr.xml
- <?xml version="1.0" encoding="UTF-8" ?>
- <solr persistent="true">
- <cores adminPath="/admin/cores" zkClientTimeout="20000" hostPort="8080">
- </cores>
- </solr>
(6)编辑tomcat bin/catalina.sh(windows下catalina.bat),加入
- JAVA_OPTS='-DzkHost=127.0.0.1:2181,127.0.0.1:2281,127.0.0.1:2381'
这里就是solr和zookeeper交互的地方了
(7)依次启动tomcat,创建core及shard,访问solr
----------------------------------这里由于为了测试简单,我只创建了一个主节点,没有replica节点,如果你需要,请跳过这一步,直接看下面的collection配置-----------------
- http://localhost:8080/solr/admin/collections?action=CREATE&name=weibocollection&numShards=3&replicationFactor=1
Collection,Shard,Replication,Core配置
首先,我们创建一个collection,并且指明它的replica的个数,shard的个数
- http://localhost:8080/solr/admin/collections?action=CREATE&name=weibocollection&numShards=3&replicationFactor=3&maxShardsPerNode=3
Last
由于zookeeper是过半存活,所以由于我这里使用的是3台zookeeper,所以最大可以死掉1台,还能继续工作,官方推荐使用奇数台服务器。