Lucene源码分析 - analysis

位置:lucene-solr/lucene/analysis

analysis结构

Lucene源码分析 - analysis_第1张图片
image.png

需要关注的是
common/src/java
opennlp/src/java
smartcn/src/java

Analysis README file

INTRODUCTION

The Analysis Module provides analysis capabilities to Lucene and Solr
applications.

分析模块为Lucene和SoR提供分析能力。

The Lucene web site is at:
http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
[email protected]

FILES

lucene-analyzers-common-XX.jar
The primary analysis module library, containing general-purpose analysis
components and support for various languages.

包含通用分析的主分析模块库
各种语言的组件和支持。

lucene-analyzers-icu-XX.jar
An add-on analysis library that provides improved Unicode support via
International Components for Unicode (ICU). Note: this module depends on
the ICU4j jar file (version >= 4.6.0)

通过提供改进的Unicode支持的附加分析库
Unicode(ICU)的国际组件。注意:此模块依赖于
ICU4J jar文件(版本>=4.6)

lucene-analyzers-kuromoji-XX.jar
An analyzer with morphological analysis for Japanese.

lucene-analyzers-morfologik-XX.jar
An analyzer using the Morfologik stemming library.

使用MyfRooik词干库的分析器。

lucene-analyzers-nori-XX.jar
An analyzer with morphological analysis for Korean.

lucene-analyzers-opennlp-XX.jar
An analyzer using the OpenNLP natural-language processing library.

一个使用OpenNLP自然语言处理库的分析器。

lucene-analyzers-phonetic-XX.jar
An add-on analysis library that provides phonetic encoders via Apache
Commons-Codec. Note: this module depends on the commons-codec jar
file

一个通过Apache提供语音编码器的附加分析库
公钥编解码器。注意:这个模块依赖于公钥编译码器jar文件

lucene-analyzers-smartcn-XX.jar
An add-on analysis library that provides word segmentation for Simplified
Chinese.

为简体中文分词提供的分析库。

lucene-analyzers-stempel-XX.jar
An add-on analysis library that contains a universal algorithmic stemmer,
including tables for the Polish language.

common/src/java
icu/src/java
kuromoji/src/java
morfologik/src/java
nori/src/java
opennlp/src/java
phonetic/src/java
smartcn/src/java
stempel/src/java
The source code for the libraries.

common/src/test
icu/src/test
kuromoji/src/test
morfologik/src/test
nori/src/test
opennlp/src/test
phonetic/src/test
smartcn/src/test
stempel/src/test
Unit tests for the libraries.

你可能感兴趣的:(Lucene源码分析 - analysis)