lxwt909

跟益达学Solr5之solrconfig.xml配置详解

solrconfig.xml配置文件中包含了很多solr自身配置相关的参数,solrconfig.xml配置文件示例可以从solr的解压目录下找到，如图：

用文本编辑软件打开solrconfig.xml配置，你将会看到以下配置内容：

<?xml version="1.0" encoding="UTF-8" ?>
<!--
 Licensed to the Apache Software Foundation (ASF) under one or more
 contributor license agreements.  See the NOTICE file distributed with
 this work for additional information regarding copyright ownership.
 The ASF licenses this file to You under the Apache License, Version 2.0
 (the "License"); you may not use this file except in compliance with
 the License.  You may obtain a copy of the License at

     http://www.apache.org/licenses/LICENSE-2.0

 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
-->

<!-- 
     For more details about configurations options that may appear in
     this file, see http://wiki.apache.org/solr/SolrConfigXml. 
-->
<config>
  <!-- In all configuration below, a prefix of "solr." for class names
       is an alias that causes solr to search appropriate packages,
       including org.apache.solr.(search|update|request|core|analysis)

       You may also specify a fully qualified Java classname if you
       have your own custom plugins.
    -->

  <!-- Controls what version of Lucene various components of Solr
       adhere to.  Generally, you want to use the latest version to
       get all bug fixes and improvements. It is highly recommended
       that you fully re-index after changing this setting as it can
       affect both how text is indexed and queried.
  -->
  <luceneMatchVersion>5.1.0</luceneMatchVersion>

  <!-- Data Directory

       Used to specify an alternate directory to hold all index data
       other than the default ./data under the Solr home.  If
       replication is in use, this should match the replication
       configuration.
    -->
  <!--
  <dataDir>${solr.data.dir:}</dataDir>
  -->
  <dataDir>C:\solr_home\core1\data</dataDir>

  <!-- The DirectoryFactory to use for indexes.
       
       solr.StandardDirectoryFactory is filesystem
       based and tries to pick the best implementation for the current
       JVM and platform.  solr.NRTCachingDirectoryFactory, the default,
       wraps solr.StandardDirectoryFactory and caches small files in memory
       for better NRT performance.

       One can force a particular implementation via solr.MMapDirectoryFactory,
       solr.NIOFSDirectoryFactory, or solr.SimpleFSDirectoryFactory.

       solr.RAMDirectoryFactory is memory based, not
       persistent, and doesn't work with replication.
    -->
  <directoryFactory name="DirectoryFactory" 
                    class="${solr.directoryFactory:solr.NRTCachingDirectoryFactory}">
  </directoryFactory> 

  <!-- The CodecFactory for defining the format of the inverted index.
       The default implementation is SchemaCodecFactory, which is the official Lucene
       index format, but hooks into the schema to provide per-field customization of
       the postings lists and per-document values in the fieldType element
       (postingsFormat/docValuesFormat). Note that most of the alternative implementations
       are experimental, so if you choose to customize the index format, it's a good
       idea to convert back to the official format e.g. via IndexWriter.addIndexes(IndexReader)
       before upgrading to a newer version to avoid unnecessary reindexing.
  -->
  <codecFactory class="solr.SchemaCodecFactory"/>

  <schemaFactory class="ClassicIndexSchemaFactory"/>

  <!-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
       Index Config - These settings control low-level behavior of indexing
       Most example settings here show the default value, but are commented
       out, to more easily see where customizations have been made.
       
       Note: This replaces <indexDefaults> and <mainIndex> from older versions
       ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -->
  <indexConfig>

    <!-- LockFactory 

         This option specifies which Lucene LockFactory implementation
         to use.
      
         single = SingleInstanceLockFactory - suggested for a
                  read-only index or when there is no possibility of
                  another process trying to modify the index.
         native = NativeFSLockFactory - uses OS native file locking.
                  Do not use when multiple solr webapps in the same
                  JVM are attempting to share a single index.
         simple = SimpleFSLockFactory  - uses a plain file for locking

         Defaults: 'native' is default for Solr3.6 and later, otherwise
                   'simple' is the default

         More details on the nuances of each LockFactory...
         http://wiki.apache.org/lucene-java/AvailableLockFactories
    -->
    <lockType>${solr.lock.type:native}</lockType>

    <!-- Lucene Infostream
       
         To aid in advanced debugging, Lucene provides an "InfoStream"
         of detailed information when indexing.

         Setting the value to true will instruct the underlying Lucene
         IndexWriter to write its info stream to solr's log. By default,
         this is enabled here, and controlled through log4j.properties.
      -->
     <infoStream>true</infoStream>
  </indexConfig>


  <!-- JMX
       
       This example enables JMX if and only if an existing MBeanServer
       is found, use this if you want to configure JMX through JVM
       parameters. Remove this to disable exposing Solr configuration
       and statistics to JMX.

       For more details see http://wiki.apache.org/solr/SolrJmx
    -->
  <jmx />
  <!-- If you want to connect to a particular server, specify the
       agentId 
    -->
  <!-- <jmx agentId="myAgent" /> -->
  <!-- If you want to start a new MBeanServer, specify the serviceUrl -->
  <!-- <jmx serviceUrl="service:jmx:rmi:///jndi/rmi://localhost:9999/solr"/>
    -->

  <!-- The default high-performance update handler -->
  <updateHandler class="solr.DirectUpdateHandler2">

    <!-- Enables a transaction log, used for real-time get, durability, and
         and solr cloud replica recovery.  The log can grow as big as
         uncommitted changes to the index, so use of a hard autoCommit
         is recommended (see below).
         "dir" - the target directory for transaction logs, defaults to the
                solr data directory.  --> 
    <updateLog>
      <str name="dir">${solr.ulog.dir:}</str>
    </updateLog>
 
    <!-- AutoCommit

         Perform a hard commit automatically under certain conditions.
         Instead of enabling autoCommit, consider using "commitWithin"
         when adding documents. 

         http://wiki.apache.org/solr/UpdateXmlMessages

         maxDocs - Maximum number of documents to add since the last
                   commit before automatically triggering a new commit.

         maxTime - Maximum amount of time in ms that is allowed to pass
                   since a document was added before automatically
                   triggering a new commit. 
         openSearcher - if false, the commit causes recent index changes
           to be flushed to stable storage, but does not cause a new
           searcher to be opened to make those changes visible.

         If the updateLog is enabled, then it's highly recommended to
         have some sort of hard autoCommit to limit the log size.
      -->
     <autoCommit> 
       <maxTime>${solr.autoCommit.maxTime:15000}</maxTime> 
       <openSearcher>false</openSearcher> 
     </autoCommit>

    <!-- softAutoCommit is like autoCommit except it causes a
         'soft' commit which only ensures that changes are visible
         but does not ensure that data is synced to disk.  This is
         faster and more near-realtime friendly than a hard commit.
      -->
     <autoSoftCommit> 
       <maxTime>${solr.autoSoftCommit.maxTime:-1}</maxTime> 
     </autoSoftCommit>

  </updateHandler>
  
  <!-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
       Query section - these settings control query time things like caches
       ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -->
  <query>
    <!-- Max Boolean Clauses

         Maximum number of clauses in each BooleanQuery,  an exception
         is thrown if exceeded.

         ** WARNING **
         
         This option actually modifies a global Lucene property that
         will affect all SolrCores.  If multiple solrconfig.xml files
         disagree on this property, the value at any given moment will
         be based on the last SolrCore to be initialized.
         
      -->
    <maxBooleanClauses>1024</maxBooleanClauses>


    <!-- Solr Internal Query Caches

         There are two implementations of cache available for Solr,
         LRUCache, based on a synchronized LinkedHashMap, and
         FastLRUCache, based on a ConcurrentHashMap.  

         FastLRUCache has faster gets and slower puts in single
         threaded operation and thus is generally faster than LRUCache
         when the hit ratio of the cache is high (> 75%), and may be
         faster under other scenarios on multi-cpu systems.
    -->

    <!-- Filter Cache

         Cache used by SolrIndexSearcher for filters (DocSets),
         unordered sets of *all* documents that match a query.  When a
         new searcher is opened, its caches may be prepopulated or
         "autowarmed" using data from caches in the old searcher.
         autowarmCount is the number of items to prepopulate.  For
         LRUCache, the autowarmed items will be the most recently
         accessed items.

         Parameters:
           class - the SolrCache implementation LRUCache or
               (LRUCache or FastLRUCache)
           size - the maximum number of entries in the cache
           initialSize - the initial capacity (number of entries) of
               the cache.  (see java.util.HashMap)
           autowarmCount - the number of entries to prepopulate from
               and old cache.  
      -->
    <filterCache class="solr.FastLRUCache"
                 size="512"
                 initialSize="512"
                 autowarmCount="0"/>

    <!-- Query Result Cache
         
         Caches results of searches - ordered lists of document ids
         (DocList) based on a query, a sort, and the range of documents requested.  
      -->
    <queryResultCache class="solr.LRUCache"
                     size="512"
                     initialSize="512"
                     autowarmCount="0"/>
   
    <!-- Document Cache

         Caches Lucene Document objects (the stored fields for each
         document).  Since Lucene internal document ids are transient,
         this cache will not be autowarmed.  
      -->
    <documentCache class="solr.LRUCache"
                   size="512"
                   initialSize="512"
                   autowarmCount="0"/>
    
    <!-- custom cache currently used by block join --> 
    <cache name="perSegFilter"
      class="solr.search.LRUCache"
      size="10"
      initialSize="0"
      autowarmCount="10"
      regenerator="solr.NoOpRegenerator" />

    <!-- Lazy Field Loading

         If true, stored fields that are not requested will be loaded
         lazily.  This can result in a significant speed improvement
         if the usual case is to not load all stored fields,
         especially if the skipped fields are large compressed text
         fields.
    -->
    <enableLazyFieldLoading>true</enableLazyFieldLoading>

   <!-- Result Window Size

        An optimization for use with the queryResultCache.  When a search
        is requested, a superset of the requested number of document ids
        are collected.  For example, if a search for a particular query
        requests matching documents 10 through 19, and queryWindowSize is 50,
        then documents 0 through 49 will be collected and cached.  Any further
        requests in that range can be satisfied via the cache.  
     -->
   <queryResultWindowSize>20</queryResultWindowSize>

   <!-- Maximum number of documents to cache for any entry in the
        queryResultCache. 
     -->
   <queryResultMaxDocsCached>200</queryResultMaxDocsCached>

    <!-- Use Cold Searcher

         If a search request comes in and there is no current
         registered searcher, then immediately register the still
         warming searcher and use it.  If "false" then all requests
         will block until the first searcher is done warming.
      -->
    <useColdSearcher>false</useColdSearcher>

    <!-- Max Warming Searchers
         
         Maximum number of searchers that may be warming in the
         background concurrently.  An error is returned if this limit
         is exceeded.

         Recommend values of 1-2 for read-only slaves, higher for
         masters w/o cache warming.
      -->
    <maxWarmingSearchers>2</maxWarmingSearchers>

  </query>


  <!-- Request Dispatcher

       This section contains instructions for how the SolrDispatchFilter
       should behave when processing requests for this SolrCore.

       handleSelect is a legacy option that affects the behavior of requests
       such as /select?qt=XXX

       handleSelect="true" will cause the SolrDispatchFilter to process
       the request and dispatch the query to a handler specified by the 
       "qt" param, assuming "/select" isn't already registered.

       handleSelect="false" will cause the SolrDispatchFilter to
       ignore "/select" requests, resulting in a 404 unless a handler
       is explicitly registered with the name "/select"

       handleSelect="true" is not recommended for new users, but is the default
       for backwards compatibility
    -->
  <requestDispatcher handleSelect="false" >
    <!-- Request Parsing

         These settings indicate how Solr Requests may be parsed, and
         what restrictions may be placed on the ContentStreams from
         those requests

         enableRemoteStreaming - enables use of the stream.file
         and stream.url parameters for specifying remote streams.

         multipartUploadLimitInKB - specifies the max size (in KiB) of
         Multipart File Uploads that Solr will allow in a Request.
         
         formdataUploadLimitInKB - specifies the max size (in KiB) of
         form data (application/x-www-form-urlencoded) sent via
         POST. You can use POST to pass request parameters not
         fitting into the URL.
         
         addHttpRequestToContext - if set to true, it will instruct
         the requestParsers to include the original HttpServletRequest
         object in the context map of the SolrQueryRequest under the 
         key "httpRequest". It will not be used by any of the existing
         Solr components, but may be useful when developing custom 
         plugins.
         
         *** WARNING ***
         The settings below authorize Solr to fetch remote files, You
         should make sure your system has some authentication before
         using enableRemoteStreaming="true"

      --> 
    <requestParsers enableRemoteStreaming="true" 
                    multipartUploadLimitInKB="2048000"
                    formdataUploadLimitInKB="2048"
                    addHttpRequestToContext="false"/>

    <!-- HTTP Caching

         Set HTTP caching related parameters (for proxy caches and clients).

         The options below instruct Solr not to output any HTTP Caching
         related headers
      -->
    <httpCaching never304="true" />

  </requestDispatcher>

  <!-- Request Handlers 

       http://wiki.apache.org/solr/SolrRequestHandler

       Incoming queries will be dispatched to a specific handler by name
       based on the path specified in the request.

       Legacy behavior: If the request path uses "/select" but no Request
       Handler has that name, and if handleSelect="true" has been specified in
       the requestDispatcher, then the Request Handler is dispatched based on
       the qt parameter.  Handlers without a leading '/' are accessed this way
       like so: http://host/app/[core/]select?qt=name  If no qt is
       given, then the requestHandler that declares default="true" will be
       used or the one named "standard".

       If a Request Handler is declared with startup="lazy", then it will
       not be initialized until the first request that uses it.

    -->
  <!-- SearchHandler

       http://wiki.apache.org/solr/SearchHandler

       For processing Search Queries, the primary Request Handler
       provided with Solr is "SearchHandler" It delegates to a sequent
       of SearchComponents (see below) and supports distributed
       queries across multiple shards
    -->

  <!--
  <requestHandler name="/dataimport" class="solr.DataImportHandler">
    <lst name="defaults">
      <str name="config">solr-data-config.xml</str>
    </lst>
  </requestHandler>
  -->
  <requestHandler name="/dataimport" class="solr.DataImportHandler">
    <lst name="defaults">
      <str name="config">data-config.xml</str>
    </lst>
  </requestHandler>
    
  <requestHandler name="/select" class="solr.SearchHandler">
    <!-- default values for query parameters can be specified, these
         will be overridden by parameters in the request
      -->
     <lst name="defaults">
       <str name="echoParams">explicit</str>
       <int name="rows">10</int>
     </lst>

    </requestHandler>

  <!-- A request handler that returns indented JSON by default -->
  <requestHandler name="/query" class="solr.SearchHandler">
     <lst name="defaults">
       <str name="echoParams">explicit</str>
       <str name="wt">json</str>
       <str name="indent">true</str>
       <str name="df">text</str>
     </lst>
  </requestHandler>

  <!--
    The export request handler is used to export full sorted result sets.
    Do not change these defaults.
  -->
  <requestHandler name="/export" class="solr.SearchHandler">
    <lst name="invariants">
      <str name="rq">{!xport}</str>
      <str name="wt">xsort</str>
      <str name="distrib">false</str>
    </lst>

    <arr name="components">
      <str>query</str>
    </arr>
  </requestHandler>


  <initParams path="/update/**,/query,/select,/tvrh,/elevate,/spell">
    <lst name="defaults">
      <str name="df">text</str>
    </lst>
  </initParams>

  <!-- Field Analysis Request Handler

       RequestHandler that provides much the same functionality as
       analysis.jsp. Provides the ability to specify multiple field
       types and field names in the same request and outputs
       index-time and query-time analysis for each of them.

       Request parameters are:
       analysis.fieldname - field name whose analyzers are to be used

       analysis.fieldtype - field type whose analyzers are to be used
       analysis.fieldvalue - text for index-time analysis
       q (or analysis.q) - text for query time analysis
       analysis.showmatch (true|false) - When set to true and when
           query analysis is performed, the produced tokens of the
           field value analysis will be marked as "matched" for every
           token that is produces by the query analysis
   -->
  <requestHandler name="/analysis/field" 
                  startup="lazy"
                  class="solr.FieldAnalysisRequestHandler" />


  <!-- Document Analysis Handler

       http://wiki.apache.org/solr/AnalysisRequestHandler

       An analysis handler that provides a breakdown of the analysis
       process of provided documents. This handler expects a (single)
       content stream with the following format:

       <docs>
         <doc>
           <field name="id">1</field>
           <field name="name">The Name</field>
           <field name="text">The Text Value</field>
         </doc>
         <doc>...</doc>
         <doc>...</doc>
         ...
       </docs>

    Note: Each document must contain a field which serves as the
    unique key. This key is used in the returned response to associate
    an analysis breakdown to the analyzed document.

    Like the FieldAnalysisRequestHandler, this handler also supports
    query analysis by sending either an "analysis.query" or "q"
    request parameter that holds the query text to be analyzed. It
    also supports the "analysis.showmatch" parameter which when set to
    true, all field tokens that match the query tokens will be marked
    as a "match". 
  -->
  <requestHandler name="/analysis/document" 
                  class="solr.DocumentAnalysisRequestHandler" 
                  startup="lazy" />

  <!-- Echo the request contents back to the client -->
  <requestHandler name="/debug/dump" class="solr.DumpRequestHandler" >
    <lst name="defaults">
     <str name="echoParams">explicit</str> 
     <str name="echoHandler">true</str>
    </lst>
  </requestHandler>
  


  <!-- Search Components

       Search components are registered to SolrCore and used by 
       instances of SearchHandler (which can access them by name)
       
       By default, the following components are available:
       
       <searchComponent name="query"     class="solr.QueryComponent" />
       <searchComponent name="facet"     class="solr.FacetComponent" />
       <searchComponent name="mlt"       class="solr.MoreLikeThisComponent" />
       <searchComponent name="highlight" class="solr.HighlightComponent" />
       <searchComponent name="stats"     class="solr.StatsComponent" />
       <searchComponent name="debug"     class="solr.DebugComponent" />
       
     -->

  <!-- Terms Component

       http://wiki.apache.org/solr/TermsComponent

       A component to return terms and document frequency of those
       terms
    -->
  <searchComponent name="terms" class="solr.TermsComponent"/>

  <!-- A request handler for demonstrating the terms component -->
  <requestHandler name="/terms" class="solr.SearchHandler" startup="lazy">
     <lst name="defaults">
      <bool name="terms">true</bool>
      <bool name="distrib">false</bool>
    </lst>     
    <arr name="components">
      <str>terms</str>
    </arr>
  </requestHandler>

  <!-- Legacy config for the admin interface -->
  <admin>
    <defaultQuery>*:*</defaultQuery>
  </admin>

</config>

下面我将对其中关键地方加以解释说明：

lib

<lib> 标签指令可以用来告诉Solr如何去加载solr plugins(Solr插件)依赖的jar包，在solrconfig.xml配置文件的注释中有配置示例，例如：

这里的dir表示一个jar包目录路径，该目录路径是相对于你当前core根目录的；regex表示一个正则表达式，用来过滤文件名的，符合正则表达式的jar文件将会被加载

dataDir parameter

用来指定一个solr的索引数据目录，solr创建的索引会存放在data\index目录下，默认dataDir是相对于当前core目录(如果solr_home下存在core的话)，如果solr_home下不存在core的话，那dataDir默认就是相对于solr_home啦，不过一般dataDir都在core.properties下配置。

codecFactory

用来设置Lucene倒排索引的编码工厂类，默认实现是官方提供的SchemaCodecFactory类。

indexConfig Section

在solrconfig.xml的<indexConfig>标签中间有很多关于此配置项的说明：

<!-- maxFieldLength was removed in 4.0. To get similar behavior, include a

LimitTokenCountFilterFactory in your fieldType definition. E.g.

提供我们maxFieldLength配置项已经从4.0版本开始就已经被移除了，可以使用配置一个filter达到相似的效果，maxTokenCount即在对某个域分词的时候，最多只提取前10000个Token，后续的域值将被抛弃。maxFieldLength若表示1000，则意味着只会对域值的0~1000范围内的字符串进行分词索引。

writeLockTimeout表示IndexWriter实例在获取写锁的时候最大等待超时时间，超过指定的超时时间仍未获取到写锁，则IndexWriter写索引操作将会抛出异常

表示创建索引的最大线程数，默认是开辟8个线程来创建索引

<useCompoundFile>false</useCompoundFile>

是否开启复合文件模式，启用了复合文件模式即意味着创建的索引文件数量会减少，这样占用的文件描述符也会减少，但这会带来性能的损耗，在Lucene中，它默认是开启，而在Solr中，自从3.6版本开始，默认就是禁用的

表示创建索引时内存缓存大小，单位是MB,默认最大是100M,

表示在document写入到硬盘之前，缓存的document最大个数，超过这个最大值会触发索引的flush操作。

<mergePolicy class="org.apache.lucene.index.TieredMergePolicy">
    <int name="maxMergeAtOnce">10</int>
<int name="segmentsPerTier">10</int>
</mergePolicy>

用来配置Lucene索引段合并策略的，里面有两个参数：

maxMergeAtOne: 一次最多合并段个数

segmentPerTier: 每个层级的段个数，同时也是内存buffer递减的等比数列的公比，看源码：

// Compute max allowed segs in the index
    long levelSize = minSegmentBytes;
    long bytesLeft = totIndexBytes;
    double allowedSegCount = 0;
    while(true) {
      final double segCountLevel = bytesLeft / (double) levelSize;
      if (segCountLevel < segsPerTier) {
        allowedSegCount += Math.ceil(segCountLevel);
        break;
      }
      allowedSegCount += segsPerTier;
      bytesLeft -= segsPerTier * levelSize;
      levelSize *= maxMergeAtOnce;
    }
int allowedSegCountInt = (int) allowedSegCount;

要理解mergeFactor因子的含义，还是先看看lucene in action中给出的解释：

IndexWriter’s mergeFactor lets you control how many Documents to store in memory
before writing them to the disk, as well as how often to merge multiple index
segments together. (Index segments are covered in appendix B.) With the default
value of 10, Lucene stores 10 Documents in memory before writing them to a single
segment on the disk. The mergeFactor value of 10 also means that once the
number of segments on the disk has reached the power of 10, Lucene merges
these segments into a single segment.
For instance, if you set mergeFactor to 10, a new segment is created on the disk
for every 10 Documents added to the index. When the tenth segment of size 10 is
added, all 10 are merged into a single segment of size 100. When 10 such segments
of size 100 have been added, they’re merged into a single segment containing
1,000 Documents, and so on. Therefore, at any time, there are no more than 9
segments in the index, and the size of each merged segment is the power of 10.
There is a small exception to this rule that has to do with maxMergeDocs,
another IndexWriter instance variable: While merging segments, Lucene ensuresthat no segment with more than maxMergeDocs Documents is created. For instance,
suppose you set maxMergeDocs to 1,000. When you add the ten-thousandth Document,
instead of merging multiple segments into a single segment of size 10,000,
Lucene creates the tenth segment of size 1,000 and keeps adding new segments
of size 1,000 for every 1,000 Documents added.

IndexWriter的mergeFactory允许你来控制索引在写入磁盘之前内存中能缓存的document数量，以及合并

多个段文件的频率。默认这个值为10. 当往内存中存储了10个document,此时Lucene还没有把单个段文件

写入磁盘，mergeFactor值等于10也意味着当硬盘上的段文件数量达到10，lucene将会把这10个段文件合

并到一个段文件中。例如：如果你把mergeFactor设置为10，当你往索引中添加了10个document,一个段

文件将会在硬盘上被创建，当第10个段文件被添加时，这10个段文件就会被合并到1个段文件，此时这个

段文件中有100个document,当10个这样的包含了100个document的段文件被添加时，他们又会被合并到一

个新的段文件中，而此时这个段文件包含 1000个document,以此类推。所以，在任何时候，在索引中不

存在超过9个段文件。每个被合并的段文件包含的document个数都是10，但这样有点小问题，我们还必须

设置一个maxMergeDocs变量，当合并段文件的时候，lucene必须确保没有哪个段文件超过maxMergeDocs

变量规定的最大document数量。设置maxMergeDocs的目的是为了防止单个段文件中包含的document数量

过大，假定你把maxMergeDocs设置为1000，当你创建第10个包含1000个document段文件的时候，这时并

不会触发段文件合并(如果没有设置maxMergeDocs为100的话，按理来说，这10个包含了1000个document

的段文件将会被合并到一个包含了10000个document的段文件当中，但maxMergeDocs限制了单个段文件中

最多包含1000个document,所以此时并不会触发段合并操作)。影响段合并还有一些其他参数，比如：

mergeFactor：当大小几乎相当的段的数量达到此值的时候，开始合并。

minMergeSize：所有大小小于此值的段，都被认为是大小几乎相当，一同参与合并。

maxMergeSize：当一个段的大小大于此值的时候，就不再参与合并。

maxMergeDocs：当一个段包含的文档数大于此值的时候，就不再参与合并。

段合并分两个步骤：

1.首先筛选出哪些段需要合并，这一步由MergePolicy合并策略类来决定

2.然后就是真正的段合并过程了，这一步是交给MergeScheduler来完成的，MergeScheduler类主要做两件事：

A.对存储域，项向量，标准化因子即norms等信息进行合并

B.对倒排索引信息进行合并

尼玛扯远了，接着继续我们的solrconfig.xml中影响索引创建的一些参数配置；

mergeScheduler刚才提到过了，这是用来配置段合并操作的处理类。默认实现类是Lucene中自带的ConcurrentMergeScheduler。

<lockType>${solr.lock.type:native}</lockType>

这个是用来指定Lucene中LockFactory实现的，可配置项如下：

single = SingleInstanceLockFactory - suggested for a
                  read-only index or when there is no possibility of
                  another process trying to modify the index.
         native = NativeFSLockFactory - uses OS native file locking.
                  Do not use when multiple solr webapps in the same
                  JVM are attempting to share a single index.
         simple = SimpleFSLockFactory  - uses a plain file for locking

         Defaults: 'native' is default for Solr3.6 and later, otherwise
                   'simple' is the default

single：表示只读锁，没有另外一个处理线程会去修改索引数据

native：即Lucene中的NativeFSLockFactory实现，使用的是基于操作系统的本地文件锁

simple：即Lucene中的SimpleFSLockFactory实现，通过在硬盘上创建write.lock锁文件实现

Defaults：从solr3.6版本开始，这个默认值是native,否则，默认值就是simple,意思就是说，你如果配置为Defaults，到底使用哪种锁实现，取决于你当前使用的Solr版本。

<unlockOnStartup>false</unlockOnStartup>

如果这个设置为true,那么在solr启动后，IndexWriter和commit提交操作拥有的锁将会被释放，这会打破Lucene的锁机制，请谨慎使用。如果你的lockType设置为single,那么这个配置true or false都不会产生任何影响。

用来配置索引删除策略的，默认使用的是Solr的SolrDeletionPolicy实现。如果你需要自定义删除策略，那么你需要实现Lucene的org.apache.lucene.index.IndexDeletionPolicy接口。

<jmx />

这个配置是用来在Solr中启用JMX，有关这方面的详细信息，请移步到Solr官方Wiki，访问地址如下：

http://wiki.apache.org/solr/SolrJmx

指定索引更新操作处理类，DirectUpdateHandler2是一个高性能的索引更新处理类，它支持软提交

</updateLog>

<updateLog>用来指定上面的updateHandler的处理事务日志存放路径的，默认值是solr的data目录即solr的dataDir配置的目录。

<query>标签是有关索引查询相关的配置项

表示BooleanQuery最大能链接多少个子Query,当不同的core下的solrconfig.xml中此配置项的参数值配置的不一样时，以最后一个初始化的core的配置为准。

<filterCache class="solr.FastLRUCache"

size="512"

initialSize="512"

autowarmCount="0"/>

用来配置filter过滤器的缓存相关的参数

<queryResultCache class="solr.LRUCache"

size="512"

initialSize="512"

autowarmCount="0"/>

用来配置对Query返回的查询结果集即TopDocs的缓存

<documentCache class="solr.LRUCache"

size="512"

initialSize="512"

autowarmCount="0"/>

用来配置对Document中存储域的缓存，因为每次从硬盘上加载存储域的值都是很昂贵的操作，这里说的存储域指的是那些Store.YES的Field，所以你懂的。

<fieldValueCache class="solr.FastLRUCache"

size="512"

autowarmCount="128"

showItems="32" />

这个配置是用来缓存Document id的，用来快速访问你的Document id的。这个配置项默认就是开启的，无需显式配置。

<cache name="myUserCache"

class="solr.LRUCache"

size="4096"

initialSize="1024"

autowarmCount="1024"

regenerator="com.mycompany.MyRegenerator"

这个配置是用来配置你的自定义缓存的，你自己的Regenerator需要实现Solr的CacheRegenerator接口。

表示启用存储域的延迟加载，前提是你的存储域在Query的时候没有显式指定需要return这个域。

表示当你的Query没有使用score进行排序时，是否使用filter来替代Query.

<listener event="newSearcher" class="solr.QuerySenderListener">
      <arr name="queries">
        <!--
           <lst><str name="q">solr</str><str name="sort">price asc</str></lst>
           <lst><str name="q">rocks</str><str name="sort">weight asc</str></lst>
          -->
      </arr>
</listener>

QuerySenderListener用来监听查询发送过程，即你可以在Query请求发送之前追加一些请求参数，如上面给的示例中，可以追加qery关键字以及sort排序规则。

设置为false即表示Solr 服务器端不接收/select请求，即如果你请求http://localhost:8080/solr/coreName/select?qt=xxxx时，将会返回一个404，

这个select请求是为了兼容先前的旧版本，已经不推荐使用。

表示solr服务器段永远不返回304，那http响应状态码304表示什么呢？表示服务器端告诉客户端，你请求的资源尚未被修改过，我返回给你的是上次缓存的内容。Never304即告诉服务器，不管我访问的资源有没有更新过，都给我重新返回不走Http缓存。这属于Http协议相关知识，不清楚的请去Google HTTP协议详细了解去。

<requestHandler name="/query" class="solr.SearchHandler">
    <lst name="defaults">
      <str name="echoParams">explicit</str>
      <str name="wt">json</str>
      <str name="indent">true</str>
    </lst>
  </requestHandler>

这个requestHandler配置的是请求URL /query跟请求处理类SearcherHandler之间的一个映射关系，即你访问http://localhost:8080/solr/coreName/query?q=xxx时，会交给SearcherHandler类来处理这个http请求，你可以配置一些参数来干预SearcherHandler处理细节，比如echoParams表示是否打印HTTP请求参数，wt即writer type,即返回的数据的MIME类型，如json,xml等等，indent表示返回的json或者XML数据是否需要缩进，否则返回的数据没有缩进也没有换行，不利于阅读。

其他的一些requestHandler说明就略过了，其实都大同小异，就是一个请求URL跟请求处理类的一个映射,就好比SpringMVC中请求URL和Controller类的一个映射。

用来配置查询组件比如SpellCheckComponent拼写检查，有关拼写检查的详细配置说明留到以后说到SpellCheck时再说吧。

用来返回所有的Term以及每个document中Term的出现频率

用来配置关键字高亮的，Solr高亮配置的详细说明这里暂时先略过，这篇我们只是先暂时大致了解下每个配置项的含义即可，具体如何使用留到后续再深入研究。

有关searchComponent查询组件的其他配置我就不一一说明了，太多了。你们自己看里面的英文注释吧，如果你实在看不懂再来问我。

<queryResponseWriter name="json" class="solr.JSONResponseWriter">
    <!-- For the purposes of the tutorial, JSON responses are written as
     plain text so that they are easy to read in *any* browser.
     If you expect a MIME type of "application/json" just remove this override.
    -->
    <str name="content-type">text/plain; charset=UTF-8</str>
</queryResponseWriter>

这个是用来配置Solr响应数据转换类，JSONResponseWriter就是把HTTP响应数据转成JSON格式，content-type即response响应头信息中的content-type,即告诉客户端返回的数据的MIME类型为text/plain，且charset字符集编码为UTF-8.

内置的响应数据转换器还有velocity，xslt等，如果你想自定义一个基于FreeMarker的转换器，那你需要实现Solr的QueryResponseWriter接口，模仿其他实现类，你懂的，然后在solrconfig.xml中添加类似的<queryResponseWriter配置即可

最后需要说明下的是solrconfig.xml中有大量类似<arr> <list> <str> <int>这样的自定义标签，下面做个统一的说明：

这张图摘自于Solr in Action这本书，由于是英文的，所以我稍微解释下：

arr:即array的缩写，表示一个数组，name即表示这个数组参数的变量名

lst即list的缩写，但注意它里面存放的是key-value键值对

bool表示一个boolean类型的变量,name表示boolean变量名，

同理还有int,long,float,str等等

Str即string的缩写，唯一要注意的是arr下的str子元素是没有name属性的，而list下的str元素是有name属性的

最后总结下：

solrconfig.xml中的配置项主要分以下几大块：

1.依赖的lucene版本配置，这决定了你创建的Lucene索引结构，因为Lucene各版本之间的索引结构并不是完全兼容的，这个需要引起你的注意。

2.索引创建相关的配置，如索引目录，IndexWriterConfig类中的相关配置(它决定了你的索引创建性能)

3.solrconfig.xml中依赖的外部jar包加载路径配置

4.JMX相关配置

5.缓存相关配置，缓存包括过滤器缓存，查询结果集缓存，Document缓存，以及自定义缓存等等

6.updateHandler配置即索引更新操作相关配置

7.RequestHandler相关配置，即接收客户端HTTP请求的处理类配置

8.查询组件配置如HightLight，SpellChecker等等

9.ResponseWriter配置即响应数据转换器相关配置，决定了响应数据是以什么样格式返回给客户端的。

10.自定义ValueSourceParser配置，用来干预Document的权重、评分，排序

solrconfig.xml就解释到这儿了，理解这些配置项是为后续Solr学习扫清障碍。有些我没说到的或者我有意略过的，就留给你们自己去阅读和理解了，毕竟内容太多，1000多行的配置，一行不拉的解释完太耗时，有些都是类似的配置，我想你们应该能看懂。

如果你还有什么问题请加我Ｑ-Q：7-3-6-0-3-1-3-0-5，

或者加裙
一起交流学习！

你可能感兴趣的:(Solr,config)

Linux:kubeadm⽅式部署k8s集群陈婷婷1 linux kubernetes 运维服务器容器
1.kubeadm创建环境k8s-master192.168.150.11k8s-node1192.168.150.12k8s-node2192.168.150.13三台节点都安装docker#Step1:安装必要的一些系统工具sudoyuminstall-yyum-utilsdevice-mapper-persistent-datalvm2#Step2:添加软件源信息sudoyum-config
Spring Boot自动配置原理深度解析：揭开@SpringBootApplication的魔法面纱 Sendingab Spring boot 从入门到精通 spring boot 后端 java 前端 spring
SpringBoot自动配置原理深度解析：揭开@SpringBootApplication的魔法面纱https://example.com/spring-boot-auto-config前言SpringBoot的**"约定大于配置"理念极大简化了开发流程，其核心秘密在于自动配置（Auto-Configuration）**机制。本文将深入剖析自动配置的实现原理，手把手教你自定义Starter，彻底掌
Spring Boot 核心知识点深度详解：自动化配置 (Auto-configuration) - 解锁 Spring Boot 的 “魔法” 无眠_ spring boot 自动化后端
SpringBoot核心知识点深度详解：自动化配置(Auto-configuration)-解锁SpringBoot的“魔法”✨自动化配置(Auto-configuration)是SpringBoot最核心的特性之一，也是它能够大幅简化Spring应用开发的关键所在。它让SpringBoot应用能够“零配置”启动，极大地提升了开发效率和便捷性。本文将深入剖析SpringBoot的自动化配置机制，让
/etc/sysconfig/jenkins 没有这个文件计算机辅助工程 centos jenkins
在CentOS或其他基于RedHat的Linux系统中，/etc/sysconfig/jenkins文件通常用来存储Jenkins的配置参数，例如JENKINS_HOME的路径。但是，如果你发现没有这个文件，你可以通过以下几种方式来解决或确认：检查Jenkins是否安装首先，确认Jenkins是否已经正确安装在你的系统上。你可以使用以下命令来检查Jenkins的安装状态：rpm-qa|grepje
vscode通过remote-ssh连接远程开发机 Cachel wood 软件安装教程计算机基础 vscode ssh ide 前端前端框架运维编辑器
文章目录安装扩展注意事项：tips其他参数安装扩展安装VSCode和SSH-Remote扩展：首先，需要确保你已经在本地计算机上安装了VSCode，并且在扩展市场中搜索并安装了"Remote-SSH"扩展。配置SSH：在本地计算机上，打开VSCode的命令面板（使用快捷键"Ctrl+Shift+P"或"Cmd+Shift+P"）并输入"Remote-SSH:OpenConfigurationFil
electron 安装换源夜璨如炽学习笔记
安装electronnpm不换源无法正常安装，打包都会遇到问题，网上找了好多换源方式都无效。良久重要找到有效的命令换源npmconfigsetELECTRON_MIRRORhttp://npm.taobao.org/mirrors/electron/
MyBatis-Plus分页查询IPage的使用方法，如何自定义分页查询功能？程序猿ZhangSir Spring全家桶微服务 #MyBatis mybatis 开发语言
目录1.MyBatis-Plus分页插件介绍2.准备工作-创建项目配置环境2.1创建数据库表Product商品表2.2创建Maven项目，创建包，接口，类2.3添加MyBatisPlus依赖和Lombok插件2.4编写Configuration分页插件配置文件2.5编写application.properties配置文件2.6实体类代码，接口代码3.IPage分页的使用方式4.自定义分页查询5.Q
eNSP-DHCP服务 2022级计算机网络一班何宏超网络服务器 linux
DHCP：动态主机配置协议DHCP（DynamicHostConfigurationProtocol，动态主机配置协议）DHCP用途：用来分配IP地址等网络参数一、基于全局地址池的DHCP服务器1、在R1上配置G0/0/1的IP地址[R1]intg0/0/1[R1-GigabitEthernet0/0/1]ipadd192.168.100.254242、创建全局地址池[R1]ippoolpool1
路由器的配置命令 yinyaoqi 路由器 interface cisco ios network 网络
路由命令十全大补router>enable从用户模式进入特权模式router#disableorexit从特权模式退出到用户模式router#showsessions查看本机上的TELNET会话router#disconnect关闭所有的TELNET会话router#showusers查看本机上的用户router#erasestartup-config删除NVRAM中的配置router#reloa
CAN 调试总结张太行_ arm 网络协议
1.查看CAN设备状态命令：ifconfig~#ifconfigcan0Linkencap:UNSPECHWaddr00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00UPRUNNINGNOARPMTU:16Metric:1RXpackets:2165errors:0dropped:0overruns:0frame:0TXpackets:0errors:0
MybatisPlus+Spring Boot3 分页查询实现新停浊酒杯 spring boot mybatis 后端
目录导入依赖本文的house表直接复制粘贴运行即可MybatisConfig配置文件创建数据库对应的实体类创建mapper层接口在service包下创建xxxService接口controller层创建XXXController类完成分页查询导入依赖com.baomidoumybatis-plus-spring-boot3-starter3.5.5本文的house表直接复制粘贴运行即可/*Navi
Linux修改/设置服务器ip地址大橙子房 Linux linux 服务器 centos
1.用户切换到root用户su-#普通用户切换到root用户2.cd到network-scripts目录下cd/etc/sysconfig/network-scriptsll#ll查看文件目录#找到ifcfg-exx这个格式的文件，我这里的是ifcfg-ens33#大家可能都不相同，但是前面的ifcfg-exx这些是一样的3.编辑ifcfg-ens33(每个人的文件名都不一样，要自己看文件名是什么
pip install速度慢怎么解决滴答滴答滴嗒滴 pip python
如果您发现使用pipinstall安装Python包的速度很慢，可以尝试以下方法来解决：（1）更换镜像源：您可以使用国内的镜像源，通常国内镜像源的速度更快。例如，清华大学、阿里云、网易等都提供了Python镜像源。您可以通过在终端中运行以下命令来更改镜像源：pipconfigsetglobal.index-urlhttps://pypi.tuna.tsinghua.edu.cn/simple或者p
oracle数据库转mysql数据库一直想成为大神的菜鸟数据库 oracle mysql
1.删除oracle相关配置1.1删除pom中的oracle依赖1.2删除有关@Configuration中oracle配置2.驱动引入引入mysql依赖mysqlmysql-connector-java8.0.13org.springframework.bootspring-boot-starter-jdbc3.配置文件更改spring:datasource:druid:url:jdbc:mys
RG-S3760应用协议配置路星辞* 网络服务器运维
RG-S3760应用协议配置1.dhcp服务配置提问：如何在设备上开启dhcp服务，让不同VLAN下的电脑获得相应的IP地址？回答：步骤一：配置VLAN网关IP地址，及将相关端口划入相应的VLAN中S3760#contS3760(config)#vlan2----创建VLAN2S3760(config-vlan)#exit----退回到全局配置模式下S3760(config)#vlan3----创
Spring Boot 配置属性 (Configuration Properties) 详解：优雅地管理应用配置无眠_ spring boot 前端后端
引言SpringBoot的配置属性(ConfigurationProperties)是其另一个核心特性，它提供了一种类型安全、结构化的方式来管理应用的配置信息。与自动配置相辅相成，配置属性允许开发者以声明式的方式将外部配置(如properties文件、YAML文件、环境变量等)绑定到Java对象，从而简化配置读取和使用，提高代码的可读性和可维护性。本文将深入解析SpringBoot配置属性的原理、
关于Linux系统下如何配置双网口绑定 1079986725 linux 运维服务器
在Linux系统中，配置双网口绑定（也称为网卡绑定或链路聚合）可以提高网络带宽、冗余和负载均衡。以下是配置双网口绑定的详细步骤：1.确认网卡信息首先，确认系统中已安装并识别的网卡设备。可以使用以下命令查看网卡信息：bashiplinkshow或bashifconfig-a记录下需要绑定的网卡名称（如`eth0`和`eth1`）。2.安装必要的工具确保系统已安装`ifenslave`工具，用于绑定网
【ubuntu虚拟机】ens33未出现在ifconfig问题 qinfinger 虚拟机 ubuntu linux
事情发生与2023年4月12日，windows上安装了docker-desktop，奈何wsl不好用，便卸载了，之后我的虚拟机ubuntu无法联网，于是开始解决之旅事故原因ifconfig查了一下，没有ens33网卡，于是用ipaddress查看，发现是存在的，说明ens33有误。解决方案记录从茫茫人海中，看了这篇文章遇上同样场景，便记录下突然无法连接虚拟机：ifconfig中没有ens33步骤一
ubuntu双网卡连接不同网络周陽讀書个人经验可供分享 Ubuntu ubuntu 网络服务器
在ubuntu系统中插入ubs无线网卡构成双网卡设备，设想一个连外网一个连局域网。原本以为装上ubs网卡驱动即可，双网卡连接后发现不能上网，遂研究之。0前言接上一篇文章CF-955AX无线网卡LINUX驱动安装问题及解决我以为装好驱动就能实现双网卡上网，结果发现在ubuntu下还需要配置，网络是薄弱项，多亏学习交流群网友指点。1网卡信息查询参考连接：1.查看网卡信息：ifconfig命令及详细介绍
Linux_Ubuntu20.04中ens33没有ip ginger_mr Linux
今天换了一个工位（公司网络也是同一个），但是打开电脑虚拟机Ubtuntu连接不上网络，windows上的配置已经检查了一遍发现没什么问题，在Ubtun上发现ens33这个网卡根本没有分配ip。ginger@ubuntu:~$ifconfig-aens33:flags=4098mtu1500ether00:0c:29:ef:77:09txqueuelen1000(以太网)RXpackets0byte
Ubuntu18.04虚拟机掉电重启后网卡丢失只剩下lo回环网卡ens33网卡不见了新潮技术研究社 Qt开发问题大全算法爬虫大数据百万案例大全 ubuntu linux vmware ifconfig
某天，Ubuntu18.04虚拟机掉电重启后网卡丢失只剩下lo回环网卡执行操作：1.查看所有网卡，ifconfig-a2.执行sudodhclientens333.查看网卡驱动是否还在：lspci–v4.使能网卡：ifconfigens33up5.重启网络服务：/etc/init.d/networkingrestart6.测试：sudoaptupdate7.这里，关键的一步是，dhclienten
ubuntu中ens33没有显示明确的IP地址的解决办法网安.Hunter linux ubuntu 服务器运维学习
使用如下两条命令即可：sudodhclientens33sudoifconfigens33
第七章Solr：企业级搜索应用 AGI大模型与大数据研究院 DeepSeek R1 &大数据AI人工智能计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
第七章Solr：企业级搜索应用1.背景介绍1.1搜索引擎的重要性在当今信息时代,数据量呈指数级增长,海量数据中蕴含着极其宝贵的信息和知识。然而,如何快速、准确地从大数据中检索出所需的信息,一直是企业和组织面临的巨大挑战。传统的数据库查询方式已经无法满足现代搜索需求,因此高效的搜索引擎应运而生。1.2什么是SolrApacheSolr是一个高性能、可扩展、云就绪的企业级搜索平台,由Apache软件基
第2篇：SOLR 的架构总览不出名的架构师 solr 架构 lucene
第2篇：SOLR的架构总览2.1前言在上一篇文章中，我们已经完成了SOLR的源码环境搭建，成功运行了一个简单的实例，并初步浏览了源码目录结构。现在，我们将目光转向SOLR的整体架构，探索它如何将复杂的功能组织成一个高效的搜索系统。通过本篇，你将了解SOLR的核心组件是如何协作的，请求是如何从客户端到达服务器并返回结果的，以及源码中哪些关键类扮演了重要角色。这不仅是后续深入分析的基础，也是理解SOL
Android面试总结（Android篇） Rookie、Zyu android 面试职场和发展
Android相关Activity:OnSaveInstanceState(BundleoutState)OnRestoreInstanceState(BundlesavedInstanceState)横竖屏切换时设置configchanges="orientation|screenSize"不会重新调用各个生命周期，会执行onConfigurationChanged方法。启动模式：1.标准模式s
128.HarmonyOS NEXT 数字滚动示例详解(三)：列表实现与布局 harmonyos-next
温馨提示：本篇博客的详细代码已发布到git:https://gitcode.com/nutpi/HarmonyosNext可以下载运行哦！HarmonyOSNEXT数字滚动示例详解(三)：列表实现与布局效果演示1.列表结构概述列表组件使用List和ListItem实现，包含标题和数字显示两个主要部分。2.List组件实现2.1基本结构List({space:STYLE_CONFIG.ITEM_GU
129.HarmonyOS NEXT 数字滚动示例详解(四)：样式与主题适配 harmonyos-next
温馨提示：本篇博客的详细代码已发布到git:https://gitcode.com/nutpi/HarmonyosNext可以下载运行哦！HarmonyOSNEXT数字滚动示例详解(四)：样式与主题适配效果演示1.样式配置概述示例组件使用了统一的样式配置和资源引用，确保界面风格的一致性和可维护性。2.样式常量定义constSTYLE_CONFIG={ITEM_GUTTER:12,//列表项间距PA
ipconfig、ping、netstat、nbtstat、arp、route、net、tracert命令作用和用法案例 learning-striving eNSP eNSP 计算机网络网络命令网络
常用计算机网络命令的详细解释、使用场景及通俗易懂的示例一、网络基础诊断工具1.ipconfig作用：查看本机网络配置（IP地址、网关、DNS等）。常用参数：ipconfig：显示基本信息。ipconfig/all：显示所有网络适配器的详细信息。ipconfig/release：释放当前IP地址（解除DHCP租约）。ipconfig/renew：重新获取IP地址（适用于网络断连时）。示例：#查看详细
MyBatis简单配置 T何必当初 Java后端框架 mybatis java mysql
1、在Maven的pom.xml中导入Mybatis和MySQL数据库驱动依赖org.mybatismybatis3.5.5mysqlmysql-connector-java5.1.382、在Maven项目中的resources下创建mybatis-config.xml配置文件-->3、在com.tyh.utils下创建MybatisUtils.java工具类packagecom.tyh.util
Mmybatis xml 连接数据库的方法墨香染城城 xml 数据库
1.添加依赖（Maven项目）在pom.xml中添加MyBatis和数据库驱动的依赖（以MySQL为例）：org.mybatismybatis3.5.13mysqlmysql-connector-java8.0.332.配置MyBatis核心文件在resources目录下创建mybatis-config.xml，配置数据库连接和全局设置：3.创建实体类定义与数据库表对应的实体类，例如User：pu
ASM系列四利用Method 组件动态注入方法逻辑 lijingyao8206 字节码技术 jvm AOP 动态代理 ASM
这篇继续结合例子来深入了解下Method组件动态变更方法字节码的实现。通过前面一篇，知道ClassVisitor 的visitMethod()方法可以返回一个MethodVisitor的实例。那么我们也基本可以知道，同ClassVisitor改变类成员一样，MethodVIsistor如果需要改变方法成员，注入逻辑，也可以
java编程思想 --内部类百合不是茶 java 内部类匿名内部类
内部类;了解外部类并能与之通信内部类写出来的代码更加整洁与优雅 1,内部类的创建内部类是创建在类中的 package com.wj.InsideClass; /* * 内部类的创建 */ public class CreateInsideClass { public CreateInsideClass(
web.xml报错 crabdave web.xml
web.xml报错 The content of element type "web-app" must match "(icon?,display- name?,description?,distributable?,context-param*,filter*,filter-mapping*,listener*,servlet*,s
泛型类的自定义麦田的设计者 java android 泛型
为什么要定义泛型类，当类中要操作的引用数据类型不确定的时候。采用泛型类，完成扩展。例如有一个学生类 Student{ Student(){ System.out.println("I'm a student....."); } } 有一个老师类
CSS清除浮动的4中方法 IT独行者 JavaScript UI css
清除浮动这个问题，做前端的应该再熟悉不过了，咱是个新人，所以还是记个笔记，做个积累，努力学习向大神靠近。CSS清除浮动的方法网上一搜，大概有N多种，用过几种，说下个人感受。 1、结尾处加空div标签 clear:both 1 2 3 4 .div 1 { background : #000080 ; border : 1px s
Cygwin使用windows的jdk 配置方法 _wy_ jdk windows cygwin
1.[vim /etc/profile] JAVA_HOME="/cgydrive/d/Java/jdk1.6.0_43" (windows下jdk路径为D:\Java\jdk1.6.0_43) PATH="$JAVA_HOME/bin:${PATH}" CLAS
linux下安装maven 无量 maven linux 安装
Linux下安装maven(转) 1.首先到Maven官网下载安装文件，目前最新版本为3.0.3，下载文件为 apache-maven-3.0.3-bin.tar.gz，下载可以使用wget命令； 2.进入下载文件夹，找到下载的文件，运行如下命令解压 tar -xvf apache-maven-2.2.1-bin.tar.gz 解压后的文件夹
tomcat的https 配置,syslog-ng配置 aichenglong tomcat http跳转到https syslong-ng配置 syslog配置
1) tomcat配置https,以及http自动跳转到https的配置 1)TOMCAT_HOME目录下生成密钥(keytool是jdk中的命令) keytool -genkey -alias tomcat -keyalg RSA -keypass changeit -storepass changeit
关于领号活动总结 alafqq 活动
关于某彩票活动的总结具体需求，每个用户进活动页面，领取一个号码，1000中的一个；活动要求 1，随机性，一定要有随机性； 2，最少中奖概率，如果注数为3200注，则最多中4注 3，效率问题，（不能每个人来都产生一个随机数，这样效率不高）； 4，支持断电（仍然从下一个开始），重启服务；（存数据库有点大材小用，因此不能存放在数据库）解决方案 1，事先产生随机数1000个，并打
java数据结构冒泡排序的遍历与排序百合不是茶 java
java的冒泡排序是一种简单的排序规则冒泡排序的原理：比较两个相邻的数，首先将最大的排在第一个，第二次比较第二个，此后一样；针对所有的元素重复以上的步骤，除了最后一个例题；将int array[]
JS检查输入框输入的是否是数字的一种校验方法 bijian1013 js
如下是JS检查输入框输入的是否是数字的一种校验方法： <form method=post target="_blank"> 数字：<input type="text" name=num onkeypress="checkNum(this.form)"><br> </form>
Test注解的两个属性：expected和timeout bijian1013 java JUnit expected timeout
JUnit4：Test文档中的解释：　　The Test annotation supports two optional parameters. 　　The first, expected, declares that a test method should throw an exception. 　　If it doesn't throw an exception or if it
[Gson二]继承关系的POJO的反序列化 bit1129 POJO
父类 package inheritance.test2; import java.util.Map; public class Model { private String field1; private String field2; private Map<String, String> infoMap
【Spark八十四】Spark零碎知识点记录 bit1129 spark
1. ShuffleMapTask的shuffle数据在什么地方记录到MapOutputTracker中的 ShuffleMapTask的runTask方法负责写数据到shuffle map文件中。当任务执行完成成功，DAGScheduler会收到通知，在DAGScheduler的handleTaskCompletion方法中完成记录到MapOutputTracker中
WAS各种脚本作用大全 ronin47 WAS 脚本
　　　http://www.ibm.com/developerworks/cn/websphere/library/samples/SampleScripts.html 　　　无意中，在WAS官网上发现的各种脚本作用，感觉很有作用，先与各位分享一下　　　获取下载这些示例 jacl 和 Jython 脚本可用于在 WebSphere Application Server 的不同版本中自
java-12.求 1+2+3+..n不能使用乘除法、 for 、 while 、 if 、 else 、 switch 、 case 等关键字以及条件判断语句 bylijinnan switch
借鉴网上的思路，用java实现： public class NoIfWhile { /** * @param args * * find x=1+2+3+....n */ public static void main(String[] args) { int n=10; int re=find(n); System.o
Netty源码学习-ObjectEncoder和ObjectDecoder bylijinnan java netty
Netty中传递对象的思路很直观： Netty中数据的传递是基于ChannelBuffer（也就是byte[]）；那把对象序列化为字节流，就可以在Netty中传递对象了相应的从ChannelBuffer恢复对象，就是反序列化的过程 Netty已经封装好ObjectEncoder和ObjectDecoder 先看ObjectEncoder ObjectEncoder是往外发送
spring 定时任务中cronExpression表达式含义 chicony cronExpression
一个cron表达式有6个必选的元素和一个可选的元素，各个元素之间是以空格分隔的，从左至右，这些元素的含义如下表所示：代表含义是否必须允许的取值范围 &nb
Nutz配置Jndi ctrain JNDI
1、使用JNDI获取指定资源： var ioc = { dao : { type :"org.nutz.dao.impl.NutDao", args : [ {jndi :"jdbc/dataSource"} ] } } 以上方法,仅需要在容器中配置好数据源,注入到NutDao即可.
解决 /bin/sh^M: bad interpreter: No such file or directory daizj shell
在Linux中执行.sh脚本，异常/bin/sh^M: bad interpreter: No such file or directory。分析：这是不同系统编码格式引起的：在windows系统中编辑的.sh文件可能有不可见字符，所以在Linux系统下执行会报以上异常信息。解决： 1）在windows下转换：利用一些编辑器如UltraEdit或EditPlus等工具
[转]for 循环为何可恨？ dcj3sjt126com 程序员读书
Java的闭包(Closure)特征最近成为了一个热门话题。一些精英正在起草一份议案，要在Java将来的版本中加入闭包特征。然而，提议中的闭包语法以及语言上的这种扩充受到了众多Java程序员的猛烈抨击。不久前，出版过数十本编程书籍的大作家Elliotte Rusty Harold发表了对Java中闭包的价值的质疑。尤其是他问道“for 循环为何可恨？”[http://ju
Android实用小技巧 dcj3sjt126com android
1、去掉所有Activity界面的标题栏　　修改AndroidManifest.xml 　　在application 标签中添加android:theme="@android:style/Theme.NoTitleBar" 2、去掉所有Activity界面的TitleBar 和StatusBar 　　修改AndroidManifes
Oracle 复习笔记之序列 eksliang Oracle 序列 sequence Oracle sequence
转载请出自出处：http://eksliang.iteye.com/blog/2098859 1.序列的作用序列是用于生成唯一、连续序号的对象一般用序列来充当数据库表的主键值 2.创建序列语法如下： create sequence s_emp start with 1 --开始值 increment by 1 --増长值 maxval
有“品”的程序员 gongmeitao 工作
完美程序员的10种品质　　完美程序员的每种品质都有一个范围，这个范围取决于具体的问题和背景。没有能解决所有问题的完美程序员（至少在我们这个星球上），并且对于特定问题，完美程序员应该具有以下品质：　　1. 才智非凡- 能够理解问题、能够用清晰可读的代码翻译并表达想法、善于分析并且逻辑思维能力强（范围：用简单方式解决复杂问题）　　
使用KeleyiSQLHelper类进行分页查询 hvt sql .net C#asp.net hovertree
本文适用于sql server单主键表或者视图进行分页查询，支持多字段排序。KeleyiSQLHelper类的最新代码请到http://hovertree.codeplex.com/SourceControl/latest下载整个解决方案源代码查看。或者直接在线查看类的代码：http://hovertree.codeplex.com/SourceControl/latest#HoverTree.D
SVG 教程（三）圆形，椭圆，直线天梯梦 svg
SVG <circle> SVG 圆形 - <circle> <circle> 标签可用来创建一个圆：下面是SVG代码： <svg xmlns="http://www.w3.org/2000/svg" version="1.1"> <circle cx="100" c
链表栈 luyulong java 数据结构
public class Node { private Object object; private Node next; public Node() { this.next = null; this.object = null; } public Object getObject() { return object; } public
基础数据结构和算法十：2-3 search tree sunwinner Algorithm 2-3 search tree
Binary search tree works well for a wide variety of applications, but they have poor worst-case performance. Now we introduce a type of binary search tree where costs are guaranteed to be loga
spring配置定时任务 stunizhengjia spring timer
最近因工作的需要，用到了spring的定时任务的功能,觉得spring还是很智能化的,只需要配置一下配置文件就可以了,在此记录一下，以便以后用到： //------------------------定时任务调用的方法------------------------------ /** * 存储过程定时器 */ publi
ITeye 8月技术图书有奖试读获奖名单公布 ITeye管理员活动
ITeye携手博文视点举办的8月技术图书有奖试读活动已圆满结束，非常感谢广大用户对本次活动的关注与参与。 8月试读活动回顾： http://webmaster.iteye.com/blog/2102830 本次技术图书试读活动的优秀奖获奖名单及相应作品如下（优秀文章有很多，但名额有限，没获奖并不代表不优秀）：《跨终端Web》 gleams：http