ElasticSearch源码解析(四):检索(Search)

聊ElasticSearch的检索过程,首先展示一个简单的java检索例子:

	

Client client = new TransportClient.Builder() .settings(Settings.settingsBuilder().put("discovery.type", "zen") //发现集群方式 .put("discovery.zen.minimum_master_nodes", 2) //最少有2个master存在 .put("discovery.zen.ping_timeout", "200ms") //集群ping时间,太小可能会因为网络通信而导致不能发现集群 .build()) .build() .addTransportAddress(new InetSocketTransportAddress(new InetSocketAddress("localhost", 9300)));

SearchResponse response = client.prepareSearch("users") .setTypes("user") .setSearchType(SearchType.DFS_QUERY_THEN_FETCH) .setQuery(QueryBuilders.termQuery("name", "fox")) // Query .setFilter(FilterBuilders.rangeFilter("age").from(20).to(30)) // Filter .setFrom(0).setSize(60).setExplain(true).execute().actionGet(); SearchHits hits = response.getHits(); for (int i = 0; i < hits.getHits().length; i++) { System.out.println(hits.getHits()[i].getSourceAsString()); }

检索的第一步是对集群的工作方式进行一些必要的设置,Settings即是对elasticsearch配置的封装,它使用了建造者模式(Builder Pattern)。

    public static Builder builder() {
        return new Builder();
    }

    /**
     * Returns a builder to be used in order to build settings.
     */
    public static Builder settingsBuilder() {
        return new Builder();
    }

 /**
     * A builder allowing to put different settings and then {@link #build()} an immutable
     * settings implementation. Use {@link Settings#settingsBuilder()} in order to
     * construct it.
     */
    public static class Builder {

        public static final Settings EMPTY_SETTINGS = new Builder().build();

        private final Map map = new LinkedHashMap<>();

        private Builder() {

        }
    /**
     * Builds a {@link Settings} (underlying uses {@link Settings}) based on everything
     * set on this builder.
     */
    public Settings build() {
        return new Settings(Collections.unmodifiableMap(map));
    }
}

具体的配置属性可以参照官方文档去设置,这里就不一一赘述了。

创建好配置需要将其添加到客户端,这里我们使用TransportClient的方式通过socket的方式与服务器直连,TransportClient是一种轻量级的连接方式,它使用netty通讯框架与服务器通讯,易于维护。

TransportClient的源码使用了建造者模式生成客户端对象。

public static class Builder {
        /**
         * The settings to configure the transport client with.
         */
        public Builder settings(Settings.Builder settings) {
            return settings(settings.build());
        }
/**
         * Builds a new instance of the transport client.
         */
        public TransportClient build() {
            Settings settings = InternalSettingsPreparer.prepareSettings(this.settings);
            settings = settingsBuilder()
                    .put(NettyTransport.PING_SCHEDULE, "5s") // 5秒ping一次服务器,心跳机制
                    .put(settings)
                    .put("network.server", false)
                    .put("node.client", true)//以客户端的方式进行连接
                    .put(CLIENT_TYPE_SETTING, CLIENT_TYPE)
                    .build();

            PluginsService pluginsService = new PluginsService(settings, null, null, pluginClasses);
            this.settings = pluginsService.updatedSettings();

            Version version = Version.CURRENT;

            final ThreadPool threadPool = new ThreadPool(settings);//创建线程池
            NamedWriteableRegistry namedWriteableRegistry = new NamedWriteableRegistry();

            boolean success = false;
            try {
                ModulesBuilder modules = new ModulesBuilder();
                modules.add(new Version.Module(version));
                //添加模块
                // plugin modules must be added here, before others or we can get crazy injection errors...
                for (Module pluginModule : pluginsService.nodeModules()) {
                    modules.add(pluginModule);
                }
                modules.add(new PluginsModule(pluginsService));
                modules.add(new SettingsModule(this.settings));
                modules.add(new NetworkModule(namedWriteableRegistry));
                modules.add(new ClusterNameModule(this.settings));
                modules.add(new ThreadPoolModule(threadPool));
                modules.add(new TransportModule(this.settings, namedWriteableRegistry));
                modules.add(new SearchModule() {
                    @Override
                    protected void configure() {
                        // noop
                    }
                });
                modules.add(new ActionModule(true));
                modules.add(new ClientTransportModule());
                modules.add(new CircuitBreakerModule(this.settings));

                pluginsService.processModules(modules);

                Injector injector = modules.createInjector();
                final TransportService transportService = injector.getInstance(TransportService.class);
                transportService.start();
                transportService.acceptIncomingRequests();

                TransportClient transportClient = new TransportClient(injector);
                success = true;
                return transportClient;
            } finally {
                if (!success) {
                    ThreadPool.terminate(threadPool, 10, TimeUnit.SECONDS);
                }
            }
        }

创建好客户端后,我们就可以进行查询了,首先进行一些查询方式的设置,client.prepareSearch对查询进行一些预处理,主要是设置索引indices对象和创建查询请求。然后setTypes()设置文档类型,setSearchType()设置检索行为,与搜索精度有关(比如更精确的计算词频并排序)。紧接着设置过滤器,查询对象(符合Query DSL灵域专用语言),设置分页,执行查询。然后返回一组结果SearchHits。它包含一组SearchHit对象,每个对象包括词型,得分,原文,索引,高亮显示等属性。

public interface SearchHit extends Streamable, ToXContent, Iterable {

    /**
     * The score.
     */
    float score();

    /**
     * The score.
     */
    float getScore();

    /**
     * The index of the hit.
     */
    String index();

    /**
     * The index of the hit.
     */
    String getIndex();

    /**
     * The id of the document.
     */
    String id();

    /**
     * The id of the document.
     */
    String getId();

    /**
     * The type of the document.
     */
    String type();

    /**
     * The type of the document.
     */
    String getType();

    /**
     * If this is a nested hit then nested reference information is returned otherwise null is returned.
     */
    NestedIdentity getNestedIdentity();

    /**
     * The version of the hit.
     */
    long version();

    /**
     * The version of the hit.
     */
    long getVersion();

    /**
     * Returns bytes reference, also un compress the source if needed.
     */
    BytesReference sourceRef();

    /**
     * Returns bytes reference, also un compress the source if needed.
     */
    BytesReference getSourceRef();

    /**
     * The source of the document (can be null). Note, its a copy of the source
     * into a byte array, consider using {@link #sourceRef()} so there won't be a need to copy.
     */
    byte[] source();

    /**
     * Is the source empty (not available) or not.
     */
    boolean isSourceEmpty();

    /**
     * The source of the document as a map (can be null).
     */
    Map getSource();

    /**
     * The source of the document as string (can be null).
     */
    String sourceAsString();

    /**
     * The source of the document as string (can be null).
     */
    String getSourceAsString();

    /**
     * The source of the document as a map (can be null).
     */
    Map sourceAsMap() throws ElasticsearchParseException;

    /**
     * If enabled, the explanation of the search hit.
     */
    Explanation explanation();

    /**
     * If enabled, the explanation of the search hit.
     */
    Explanation getExplanation();

    /**
     * The hit field matching the given field name.
     */
    public SearchHitField field(String fieldName);

    /**
     * A map of hit fields (from field name to hit fields) if additional fields
     * were required to be loaded.
     */
    Map fields();

    /**
     * A map of hit fields (from field name to hit fields) if additional fields
     * were required to be loaded.
     */
    Map getFields();

    /**
     * A map of highlighted fields.
     */
    Map highlightFields();

    /**
     * A map of highlighted fields.
     */
    Map getHighlightFields();

    /**
     * An array of the sort values used.
     */
    Object[] sortValues();

    /**
     * An array of the sort values used.
     */
    Object[] getSortValues();

    /**
     * The set of query and filter names the query matched with. Mainly makes sense for compound filters and queries.
     */
    String[] matchedQueries();

    /**
     * The set of query and filter names the query matched with. Mainly makes sense for compound filters and queries.
     */
    String[] getMatchedQueries();

    /**
     * The shard of the search hit.
     */
    SearchShardTarget shard();

    /**
     * The shard of the search hit.
     */
    SearchShardTarget getShard();

    /**
     * @return Inner hits or null if there are none
     */
    Map getInnerHits();

    /**
     * Encapsulates the nested identity of a hit.
     */
    public interface NestedIdentity {

        /**
         * Returns the nested field in the source this hit originates from
         */
        public Text getField();

        /**
         * Returns the offset in the nested array of objects in the source this hit
         */
        public int getOffset();

        /**
         * Returns the next child nested level if there is any, otherwise null is returned.
         *
         * In the case of mappings with multiple levels of nested object fields
         */
        public NestedIdentity getChild();
    }
}




你可能感兴趣的:(搜索)