DougLeaMrConcurrency

HBase第一天：HBase组件及架构、安装HBase部署集群、HBase的shell操作、HBase数据结构、命名空间、原理、读写流程、flush与合并、hbase-default.xml配置详解

本文目录

第1章 HBase简介

1.1 什么是HBase

1.2 Hbase特点

1.3 HBase架构

1.3 HBase中的角色

1.3.1 HMaster

1.3.2 RegionServer

1.2.3 其他组件

第2章 HBase安装

2.1 Zookeeper正常部署

2.2 Hadoop正常部署

2.3 HBase的解压

2.4 HBase的配置文件

2.5 HBase远程发送到其他集群

2.6 HBase服务的启动

2.7 查看HBase页面

第3章 HBase Shell操作

3.1 基本操作

3.2 表的操作

第4章 HBase数据结构

4.1 RowKey

4.2 Column Family

4.3 Cell

4.4 Time Stamp

4.5 命名空间相当于mysql的数据库

第5 章 HBase原理

5.1 读流程

5.2 写流程

5.3 数据flush过程

5.4 数据合并过程

第1章 HBase简介

1.1 什么是HBase

HBase的原型是Google的BigTable论文，受到了该论文思想的启发，目前作为Hadoop的子项目来开发维护，用于支持结构化的数据存储。

官方网站：http://hbase.apache.org

-- 2006年Google发表BigTable白皮书

-- 2006年开始开发HBase

-- 2008年北京成功开奥运会，程序员默默地将HBase弄成了Hadoop的子项目

-- 2010年HBase成为Apache顶级项目

-- 现在很多公司二次开发出了很多发行版本，你也开始使用了。

HBase是一个高可靠性、高性能、面向列、可伸缩的分布式存储系统，利用HBASE技术可在廉价PC Server上搭建起大规模结构化存储集群。

HBase的目标是存储并处理大型的数据，更具体来说是仅需使用普通的硬件配置，就能够处理由成千上万的行和列所组成的大型数据。

HBase是Google Bigtable的开源实现，但是也有很多不同之处。比如：Google Bigtable利用GFS作为其文件存储系统，HBase利用Hadoop HDFS作为其文件存储系统；Google运行MAPREDUCE来处理Bigtable中的海量数据，HBase同样利用Hadoop MapReduce来处理HBase中的海量数据；Google Bigtable利用Chubby作为协同服务，HBase利用Zookeeper作为对应。

1.2 Hbase特点

1）海量存储

Hbase适合存储PB级别的海量数据，在PB级别的数据以及采用廉价PC存储的情况下，能在几十到百毫秒内返回数据。这与Hbase的极易扩展性息息相关。正式因为Hbase良好的扩展性，才为海量数据的存储提供了便利。

2）列式存储

这里的列式存储其实说的是列族存储，Hbase是根据列族来存储数据的。列族下面可以有非常多的列，列族在创建表的时候就必须指定。

3）极易扩展

Hbase的扩展性主要体现在两个方面，一个是基于上层处理能力（RegionServer）的扩展，一个是基于存储的扩展（HDFS）。
通过横向添加RegionSever的机器，进行水平扩展，提升Hbase上层的处理能力，提升Hbsae服务更多Region的能力。

备注：RegionServer的作用是管理region、承接业务的访问，这个后面会详细的介绍通过横向添加Datanode的机器，进行存储层扩容，提升Hbase的数据存储能力和提升后端存储的读写能力。

4）高并发

由于目前大部分使用Hbase的架构，都是采用的廉价PC，因此单个IO的延迟其实并不小，一般在几十到上百ms之间。这里说的高并发，主要是在并发的情况下，Hbase的单个IO延迟下降并不多。能获得高并发、低延迟的服务。

5）稀疏

稀疏主要是针对Hbase列的灵活性，在列族中，你可以指定任意多的列，在列数据为空的情况下，是不会占用存储空间的。

1.3 HBase架构

Hbase架构如图1所示：

图1 HBase架构图

从图中可以看出Hbase是由Client、Zookeeper、Master、HRegionServer、HDFS等几个组件组成，下面来介绍一下几个组件的相关功能：

1）Client

Client包含了访问Hbase的接口，另外Client还维护了对应的cache来加速Hbase的访问，比如cache的.META.元数据的信息。

2）Zookeeper

HBase通过Zookeeper来做master的高可用、RegionServer的监控、元数据的入口以及集群配置的维护等工作。具体工作如下：

通过Zoopkeeper来保证集群中只有1个master在运行，如果master异常，会通过竞争机制产生新的master提供服务

通过Zoopkeeper来监控RegionServer的状态，当RegionSevrer有异常的时候，通过回调的形式通知Master RegionServer上下线的信息

通过Zoopkeeper存储元数据的统一入口地址（元数据存储在哪里？——master和zk）

3）Hmaster

master节点的主要职责如下：
为RegionServer分配Region
维护整个集群的负载均衡
维护集群的元数据信息
发现失效的Region，并将失效的Region分配到正常的RegionServer上（RegionServer维护对应Hdfs上文件的元数据）
当RegionSever失效的时候，协调对应Hlog的拆分

4）HregionServer

HregionServer直接对接用户的读写请求，是真正的“干活”的节点。它的功能概括如下：
管理master为其分配的Region
处理来自客户端的读写请求
负责和底层HDFS的交互，存储数据到HDFS
负责Region变大以后的拆分
负责Storefile的合并工作

一个regionserver 有一个Hlog hlog 类似fsimagine

Region 类似ReginServer 的一张表，可以被横向切分

Region可以有多个列族（store），一个store 存储一个列族的表，切分之后列族（一个列族可以在多个region下的store中存储）

数据在内存中存的，溢写一次形成一个StoreFile，HFile 是存储形式，StoreFile是存储的组件

Region对应表（一个region只能来自一张表，一张表可以放在多个不同regionserver的region中）

store 对应列族（一个store 只能对应一个列族，但是被切分之后一个列族可以放在多个不同regionserver的region的store中）

5）HDFS

HDFS为Hbase提供最终的底层数据存储服务，同时为HBase提供高可用（Hlog存储在HDFS）的支持，具体功能概括如下：
提供元数据和表数据的底层分布式存储服务
数据多副本，保证的高可靠和高可用性

1.3 HBase中的角色

1.3.1 HMaster

功能

1．监控RegionServer

2．处理RegionServer故障转移

3．处理元数据的变更

4．处理region的分配或转移

5．在空闲时间进行数据的负载均衡

6．通过Zookeeper发布自己的位置给客户端

1.3.2 RegionServer

功能

1．负责存储HBase的实际数据

2．处理分配给它的Region

3．刷新缓存到HDFS

4．维护Hlog

5．执行压缩

6．负责处理Region分片

1.2.3 其他组件

1．Write-Ahead logs

HBase的修改记录，当对HBase读写数据的时候，数据不是直接写进磁盘，它会在内存中保留一段时间（时间以及数据量阈值可以设定）。但把数据保存在内存中可能有更高的概率引起数据丢失，为了解决这个问题，数据会先写在一个叫做Write-Ahead logfile的文件中，然后再写入内存中。所以在系统出现故障的时候，数据可以通过这个日志文件重建。

2．Region

Hbase表的分片，HBase表会根据RowKey值被切分成不同的region存储在RegionServer中，在一个RegionServer中可以有多个不同的region。

3．Store

HFile存储在Store中，一个Store对应HBase表中的一个列族。

4．MemStore

顾名思义，就是内存存储，位于内存中，用来保存当前的数据操作，所以当数据保存在WAL中之后，RegsionServer会在内存中存储键值对。

5．HFile

这是在磁盘上保存原始数据的实际的物理文件，是实际的存储文件。StoreFile是以Hfile的形式存储在HDFS的。

第2章 HBase安装

2.1 Zookeeper正常部署

首先保证Zookeeper集群的正常部署，并启动之：

[atguigu@hadoop102 zookeeper-3.4.10]$ bin/zkServer.sh start

[atguigu@hadoop103 zookeeper-3.4.10]$ bin/zkServer.sh start

[atguigu@hadoop104 zookeeper-3.4.10]$ bin/zkServer.sh start

2.2 Hadoop正常部署

Hadoop集群的正常部署并启动：

[atguigu@hadoop102 hadoop-2.7.2]$ sbin/start-dfs.sh

[atguigu@hadoop103 hadoop-2.7.2]$ sbin/start-yarn.sh

2.3 HBase的解压

准备hadoop-hbase-jar和hbase-bin包，选择hbse-bin包

解压HBase到指定目录：

[atguigu@hadoop102 software]$ tar -zxvf hbase-1.3.1-bin.tar.gz -C /opt/module

重命名

mv hbase-1.3.1 hbase

2.4 HBase的配置文件

修改HBase对应的配置文件。

1）hbase-env.sh修改内容：

27行export JAVA_HOME=/opt/module/jdk1.8.0_144

128行export HBASE_MANAGES_ZK=false

注释掉 46 47两行

2）hbase-site.xml修改内容：


	     
		hbase.rootdir     
		hdfs://hadoop102:9000/hbase   
	

	   
		hbase.cluster.distributed
		true
	

   
	
		hbase.master.port
		16000
	

	   
		hbase.zookeeper.quorum
	     hadoop102:2181,hadoop103:2181,hadoop104:2181
	


	   
		hbase.zookeeper.property.dataDir
	     /opt/module/zookeeper-3.4.10/zkData

3）regionservers：

hadoop102

hadoop103

hadoop104

4）软连接hadoop配置文件到hbase：

[atguigu@hadoop102 module]$ ln -s /opt/module/hadoop-2.7.2/etc/hadoop/core-site.xml /opt/module/hbase/conf/core-site.xml

[atguigu@hadoop102 module]$ ln -s /opt/module/hadoop-2.7.2/etc/hadoop/hdfs-site.xml /opt/module/hbase/conf/hdfs-site.xml

2.5 HBase远程发送到其他集群

[atguigu@hadoop102 module]$ xsync hbase/

2.6 HBase服务的启动

1．启动方式1

[atguigu@hadoop102 hbase]$ bin/hbase-daemon.sh start master

[atguigu@hadoop102 hbase]$ bin/hbase-daemon.sh start regionserver

在hadoop103、104上启动，查看jpsall

提示：如果集群之间的节点时间不同步，会导致regionserver无法启动，抛出ClockOutOfSyncException异常。

修复提示：

a、同步时间服务

请参：Hadoop入门（下）：伪分布式搭建、完全分布式搭建、SSH免密登录、集群分发脚本xsync、集群时间同步、HDFS运行MapReduce、Yarn、jpsall配置、hadoop编译源码、常见错误

b、属性：hbase.master.maxclockskew设置更大的值

hbase.master.maxclockskew

180000

Time difference of regionserver from master

2．启动方式2

[atguigu@hadoop102 hbase]$ bin/start-hbase.sh

对应的停止服务：

[atguigu@hadoop102 hbase]$ bin/stop-hbase.sh

2.7 查看HBase页面

启动成功后，可以通过“host:port”的方式来访问HBase管理页面，例如：

http://hadoop102:16010

第3章 HBase Shell操作

3.1 基本操作

注意：HBase中向左删除是用Ctrl + Backspace或Shift + Backspace组合键删除

1．进入HBase客户端命令行

[atguigu@hadoop102 hbase]$ bin/hbase shell

2．查看帮助命令

hbase(main):001:0> help

3．查看当前数据库中有哪些表

hbase(main):002:0> list

3.2 表的操作

1．创建表

hbase(main):002:0> create 'student','info'

2．插入数据到表

hbase(main):003:0> put 'student','1001','info:sex','male'

hbase(main):004:0> put 'student','1001','info:age','18'

hbase(main):005:0> put 'student','1002','info:name','Janna'

hbase(main):006:0> put 'student','1002','info:sex','female'

hbase(main):007:0> put 'student','1002','info:age','20'

3．扫描查看表数据 (区分大小写)

hbase(main):008:0> scan 'student'

hbase(main):009:0> scan 'student',{STARTROW => '1001', STOPROW => '1001'}

hbase(main):010:0> scan 'student',{STARTROW => '1001'}

4．查看表结构

hbase(main):011:0> describe ‘student’

5．更新指定字段的数据

hbase(main):012:0> put 'student','1001','info:name','Nick'

hbase(main):013:0> put 'student','1001','info:age','100'

6．查看“指定行”或“指定列族:列”的数据

hbase(main):014:0> get 'student','1001'

hbase(main):015:0> get 'student','1001','info:name'

7．统计表数据行数

hbase(main):021:0> count 'student'

8．删除数据

删除某rowkey的全部数据：

hbase(main):016:0> deleteall 'student','1001'

删除某rowkey的某一列数据：

hbase(main):017:0> delete 'student','1002','info:sex'

hbase(main):017:0> delete 'student','1002','info:sex'，时间戳

9．清空表数据

hbase(main):018:0> truncate 'student'

提示：清空表的操作顺序为先disable，然后再truncate。

10．删除表

首先需要先让该表为disable状态：

hbase(main):019:0> disable 'student'

然后才能drop这个表：

hbase(main):020:0> drop 'student'

提示：如果直接drop表，会报错：ERROR: Table student is enabled. Disable it first.

11．变更表信息

将info列族中的数据存放3个版本：

hbase(main):022:0> alter 'student',{NAME=>'info',VERSIONS=>3}

hbase(main):022:0> get 'student','1001',{COLUMN=>'info:name',VERSIONS=>3}

注意：更改版本号之后要重新插入数据测试

测试一下删除，如果不指定时间戳则将所有版本全部删除

hbase(main):012:0> delete 'student','1001','info:name'

第4章 HBase数据结构

4.1 RowKey

与nosql数据库们一样,RowKey是用来检索记录的主键。访问HBASE table中的行，只有三种方式：

1.通过单个RowKey访问

2.通过RowKey的range（正则）

3.全表扫描

RowKey行键 (RowKey)可以是任意字符串(最大长度是64KB，实际应用中长度一般为 10-100bytes)，在HBASE内部，RowKey保存为字节数组。存储时，数据按照RowKey的字典序(byte order)排序存储。设计RowKey时，要充分排序存储这个特性，将经常一起读取的行存储放到一起。(位置相关性)

4.2 Column Family

列族：HBASE表中的每个列，都归属于某个列族。列族是表的schema的一部分(而列不是)，必须在使用表之前定义。列名都以列族作为前缀。例如 courses:history，courses:math都属于courses 这个列族。

4.3 Cell

由{rowkey, column Family:columu, version} 唯一确定的单元。cell中的数据是没有类型的，全部是字节码形式存贮。 Version ——时间戳

关键字：无类型、字节码

4.4 Time Stamp

HBASE 中通过rowkey和columns确定的为一个存贮单元称为cell。每个 cell都保存着同一份数据的多个版本。版本通过时间戳来索引。时间戳的类型是 64位整型。时间戳可以由HBASE(在数据写入时自动 )赋值，此时时间戳是精确到毫秒的当前系统时间。时间戳也可以由客户显式赋值。如果应用程序要避免数据版本冲突，就必须自己生成具有唯一性的时间戳。每个 cell中，不同版本的数据按照时间倒序排序，即最新的数据排在最前面。

为了避免数据存在过多版本造成的的管理 (包括存贮和索引)负担，HBASE提供了两种数据版本回收方式。一是保存数据的最后n个版本，二是保存最近一段时间内的版本（比如最近七天）。用户可以针对每个列族进行设置。

4.5 命名空间相当于mysql的数据库

命名空间的结构:

1) Table：表，所有的表都是命名空间的成员，即表必属于某个命名空间，如果没有指定，则在default默认的命名空间中。

2) RegionServer group：一个命名空间包含了默认的RegionServer Group。

3) Permission：权限，命名空间能够让我们来定义访问控制列表ACL（Access Control List）。例如，创建表，读取表，删除，更新等等操作。

4) Quota：限额，可以强制一个命名空间可包含的region的数量。

操作

创建命名空间：

create_namespace ‘20190902’

查看命名空间

list_namespace

创建一个此命名空间的表

create '20190902:teachers' ,'f1'

删除命名空间

disable ‘20190902:teachers’

drop ‘20190902:teachers’

drop_namespace ‘20190902’

注意：删除命名空间前要删除表

第5 章 HBase原理

5.1 读流程

HBase读数据流程如图3所示

图3所示 HBase读数据流程

Client先访问zookeeper，从meta表读取region的位置，
然后读取meta表中的数据。meta中又存储了用户表的region信息；

3）根据namespace、表名和rowkey在meta表中找到对应的region信息；

4）找到这个region对应的regionserver；

5）查找对应的region；

6）先从MemStore找数据，如果没有，再到BlockCache里面读；

7）BlockCache还没有，再到StoreFile上读(为了读取的效率)；

8）如果是从StoreFile里面读取的数据，不是直接返回给客户端，而是先写入BlockCache，再返回给客户端。

细节分析：

进入102zookeeper目录 bin/zkCli.sh

ls / 找到 hbase

ls /hbase

get /hbase/meta-region-server 可以看到meta表在102上

在hadoop102中关闭HRegionServer

再次查看region-server，已切换到了hadoop103

关掉zkcli quit

5.2 写流程

Hbase写流程如图2所示

图2 HBase写数据流程

1）Client向HregionServer发送写请求；

2）HregionServer将数据写到HLog（write ahead log）。为了数据的持久化和恢复；

3）HregionServer将数据写到内存（MemStore）；

4）反馈Client写成功。

5.3 数据flush过程

1）当MemStore数据达到阈值（默认是128M，老版本是64M），将数据刷到硬盘，将内存中的数据删除，同时删除HLog中的历史数据；

2）并将数据存储到HDFS中；

MemStore数据达到regionserver 的40% 会触发 regionserver级别的flush

5.4 数据合并过程

1）当数据块达到4块（最大能存三块），Hmaster将数据块加载到本地，进行合并；

2）当合并的数据超过256M，进行拆分，将拆分后的Region分配给不同的HregionServer管理；

3）当HregionServer宕机后，将HregionServer上的hlog拆分，然后分配给不同的HregionServer加载，修改.META.；

4）注意：HLog会同步到HDFS。

附：hbase-default.xml注释版（hbase-default.xml源码详解）

  
  
  
      
      
        hbase.tmp.dir  
        ${java.io.tmpdir}/hbase-${user.name}  
        Temporary directory on the local filesystem.  
            Change this setting to point to a location more permanent  
            than '/tmp', the usual resolve for java.io.tmpdir, as the  
            '/tmp' directory is cleared on machine restart.  
          
      
      
      
        hbase.rootdir  
        ${hbase.tmp.dir}/hbase  
        The directory shared by region servers and into  
            which HBase persists. The URL should be 'fully-qualified'  
            to include the filesystem scheme. For example, to specify the  
            HDFS directory '/hbase' where the HDFS instance's namenode is  
            running at namenode.example.org on port 9000, set this value to:  
            hdfs://namenode.example.org:9000/hbase. By default, we write  
            to whatever ${hbase.tmp.dir} is set too -- usually /tmp --  
            so change this configuration or else all data will be lost on  
            machine restart.  
          
      
      
      
        hbase.fs.tmp.dir  
        /user/${user.name}/hbase-staging  
        A staging directory in default file system (HDFS)  
            for keeping temporary data.  
          
      
      
      
        hbase.bulkload.staging.dir  
        ${hbase.fs.tmp.dir}  
        A staging directory in default file system (HDFS)  
            for bulk loading.  
          
      
      
      
        hbase.cluster.distributed  
        false  
        The mode the cluster will be in. Possible values are  
            false for standalone mode and true for distributed mode. If  
            false, startup will run all HBase and ZooKeeper daemons together  
            in the one JVM.  
          
      
      
      
        hbase.zookeeper.quorum  
        localhost  
        Comma separated list of servers in the ZooKeeper ensemble  
            (This config. should have been named hbase.zookeeper.ensemble).  
            For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com".  
            By default this is set to localhost for local and pseudo-distributed  
            modes  
            of operation. For a fully-distributed setup, this should be set to a  
            full  
            list of ZooKeeper ensemble servers. If HBASE_MANAGES_ZK is set in  
            hbase-env.sh  
            this is the list of servers which hbase will start/stop ZooKeeper on as  
            part of cluster start/stop. Client-side, we will take this list of  
            ensemble members and put it together with the  
            hbase.zookeeper.clientPort  
            config. and pass it into zookeeper constructor as the connectString  
            parameter.  
          
      
      
      
        hbase.local.dir  
        ${hbase.tmp.dir}/local/  
        Directory on the local filesystem to be used  
            as a local storage.  
          
      
  
      
      
        hbase.master.port  
        16000  
        The port the HBase Master should bind to.  
      
      
      
        hbase.master.info.port  
        16010  
        The port for the HBase Master web UI.  
            Set to -1 if you do not want a UI instance run.  
          
      
      
      
        hbase.master.info.bindAddress  
        0.0.0.0  
        The bind address for the HBase Master web UI  
          
      
      
      
        hbase.master.logcleaner.plugins  
        org.apache.hadoop.hbase.master.cleaner.TimeToLiveLogCleaner  
          
        A comma-separated list of BaseLogCleanerDelegate invoked  
            by  
            the LogsCleaner service. These WAL cleaners are called in order,  
            so put the cleaner that prunes the most files in front. To  
            implement your own BaseLogCleanerDelegate, just put it in HBase's classpath  
            and add the fully qualified class name here. Always add the above  
            default log cleaners in the list.  
          
      
      
      
        hbase.master.logcleaner.ttl  
        600000  
        Maximum time a WAL can stay in the .oldlogdir directory,  
            after which it will be cleaned by a Master thread.  
          
      
      
        hbase.master.hfilecleaner.plugins  
        org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner  
          
        A comma-separated list of BaseHFileCleanerDelegate  
            invoked by  
            the HFileCleaner service. These HFiles cleaners are called in order,  
            so put the cleaner that prunes the most files in front. To  
            implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath  
            and add the fully qualified class name here. Always add the above  
            default log cleaners in the list as they will be overwritten in  
            hbase-site.xml.  
          
      
      
      
        hbase.master.catalog.timeout  
        600000  
        Timeout value for the Catalog Janitor from the master to  
            META.  
          
      
      
      
        hbase.master.infoserver.redirect  
        true  
        Whether or not the Master listens to the Master web  
            UI port (hbase.master.info.port) and redirects requests to the web  
            UI server shared by the Master and RegionServer.  
          
      
  
      
      
        hbase.regionserver.port  
        16020  
        The port the HBase RegionServer binds to.  
      
      
      
        hbase.regionserver.info.port  
        16030  
        The port for the HBase RegionServer web UI  
            Set to -1 if you do not want the RegionServer UI to run.  
          
      
      
      
        hbase.regionserver.info.bindAddress  
        0.0.0.0  
        The address for the HBase RegionServer web UI  
          
      
      
      
        hbase.regionserver.info.port.auto  
        false  
        Whether or not the Master or RegionServer  
            UI should search for a port to bind to. Enables automatic port  
            search if hbase.regionserver.info.port is already in use.  
            Useful for testing, turned off by default.  
          
      
      
      
        hbase.regionserver.handler.count  
        30  
        Count of RPC Listener instances spun up on RegionServers.  
            Same property is used by the Master for count of master handlers.  
          
      
      
      
        hbase.ipc.server.callqueue.handler.factor  
        0.1  
        Factor to determine the number of call queues.  
            A value of 0 means a single queue shared between all the handlers.  
            A value of 1 means that each handler has its own queue.  
          
      
      
      
      
        hbase.ipc.server.callqueue.read.ratio  
        0  
        Split the call queues into read and write queues.  
            The specified interval (which should be between 0.0 and 1.0)  
            will be multiplied by the number of call queues.  
            A value of 0 indicate to not split the call queues, meaning that both  
            read and write  
            requests will be pushed to the same set of queues.  
            A value lower than 0.5 means that there will be less read queues than  
            write queues.  
            A value of 0.5 means there will be the same number of read and write  
            queues.  
            A value greater than 0.5 means that there will be more read queues  
            than write queues.  
            A value of 1.0 means that all the queues except one are used to  
            dispatch read requests.  
  
            Example: Given the total number of call queues being 10  
            a read.ratio of 0 means that: the 10 queues will contain both  
            read/write requests.  
            a read.ratio of 0.3 means that: 3 queues will contain only read  
            requests  
            and 7 queues will contain only write requests.  
            a read.ratio of 0.5 means that: 5 queues will contain only read  
            requests  
            and 5 queues will contain only write requests.  
            a read.ratio of 0.8 means that: 8 queues will contain only read  
            requests  
            and 2 queues will contain only write requests.  
            a read.ratio of 1 means that: 9 queues will contain only read requests  
            and 1 queues will contain only write requests.  
          
      
      
      
        hbase.ipc.server.callqueue.scan.ratio  
        0  
        Given the number of read call queues, calculated from the  
            total number  
            of call queues multiplied by the callqueue.read.ratio, the scan.ratio  
            property  
            will split the read call queues into small-read and long-read queues.  
            A value lower than 0.5 means that there will be less long-read queues  
            than short-read queues.  
            A value of 0.5 means that there will be the same number of short-read  
            and long-read queues.  
            A value greater than 0.5 means that there will be more long-read  
            queues than short-read queues  
            A value of 0 or 1 indicate to use the same set of queues for gets and  
            scans.  
  
            Example: Given the total number of read call queues being 8  
            a scan.ratio of 0 or 1 means that: 8 queues will contain both long and  
            short read requests.  
            a scan.ratio of 0.3 means that: 2 queues will contain only long-read  
            requests  
            and 6 queues will contain only short-read requests.  
            a scan.ratio of 0.5 means that: 4 queues will contain only long-read  
            requests  
            and 4 queues will contain only short-read requests.  
            a scan.ratio of 0.8 means that: 6 queues will contain only long-read  
            requests  
            and 2 queues will contain only short-read requests.  
          
      
      
      
        hbase.regionserver.msginterval  
        3000  
        Interval between messages from the RegionServer to Master  
            in milliseconds.  
          
      
      
      
        hbase.regionserver.logroll.period  
        3600000  
        Period at which we will roll the commit log regardless  
            of how many edits it has.  
          
      
      
      
        hbase.regionserver.logroll.errors.tolerated  
        2  
        The number of consecutive WAL close errors we will allow  
            before triggering a server abort. A setting of 0 will cause the  
            region server to abort if closing the current WAL writer fails during  
            log rolling. Even a small value (2 or 3) will allow a region server  
            to ride over transient HDFS errors.  
          
      
      
      
        hbase.regionserver.hlog.reader.impl  
        org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader  
          
        The WAL file reader implementation.  
      
      
      
        hbase.regionserver.hlog.writer.impl  
        org.apache.hadoop.hbase.regionserver.wal.ProtobufLogWriter  
          
        The WAL file writer implementation.  
      
      
      
        hbase.regionserver.global.memstore.size  
          
        Maximum size of all memstores in a region server before  
            new  
            updates are blocked and flushes are forced. Defaults to 40% of heap (0.4).  
            Updates are blocked and flushes are forced until size of all  
            memstores  
            in a region server hits  
            hbase.regionserver.global.memstore.size.lower.limit.  
            The default value in this configuration has been intentionally left  
            emtpy in order to  
            honor the old hbase.regionserver.global.memstore.upperLimit property if  
            present.  
          
      
      
      
        hbase.regionserver.global.memstore.size.lower.limit  
          
        Maximum size of all memstores in a region server before  
            flushes are forced.  
            Defaults to 95% of hbase.regionserver.global.memstore.size (0.95).  
            A 100% value for this value causes the minimum possible flushing to  
            occur when updates are  
            blocked due to memstore limiting.  
            The default value in this configuration has been intentionally left  
            emtpy in order to  
            honor the old hbase.regionserver.global.memstore.lowerLimit property if  
            present.  
          
      
      
      
        hbase.regionserver.optionalcacheflushinterval  
        3600000  
          
            Maximum amount of time an edit lives in memory before being automatically  
            flushed.  
            Default 1 hour. Set it to 0 to disable automatic flushing.  
          
      
      
        hbase.regionserver.catalog.timeout  
        600000  
        Timeout value for the Catalog Janitor from the  
            regionserver to META.  
      
      
      
        hbase.regionserver.dns.interface  
        default  
        The name of the Network Interface from which a region  
            server  
            should report its IP address.  
          
      
      
      
        hbase.regionserver.dns.nameserver  
        default  
        The host name or IP address of the name server (DNS)  
            which a region server should use to determine the host name used by  
            the  
            master for communication and display purposes.  
          
      
      
      
        hbase.regionserver.region.split.policy  
        org.apache.hadoop.hbase.regionserver.IncreasingToUpperBoundRegionSplitPolicy  
          
          
            A split policy determines when a region should be split. The various  
            other split policies that  
            are available currently are ConstantSizeRegionSplitPolicy,  
            DisabledRegionSplitPolicy,  
            DelimitedKeyPrefixRegionSplitPolicy, KeyPrefixRegionSplitPolicy etc.  
          
      
      
      
        hbase.regionserver.regionSplitLimit  
        1000  
          
            Limit for the number of regions after which no more region splitting  
            should take place.  
            This is not hard limit for the number of regions but acts as a guideline  
            for the regionserver  
            to stop splitting after a certain limit. Default is set to 1000.  
          
      
  
      
      
        zookeeper.session.timeout  
        90000  
        ZooKeeper session timeout in milliseconds. It is used in  
            two different ways.  
            First, this value is used in the ZK client that HBase uses to connect to  
            the ensemble.  
            It is also used by HBase when it starts a ZK server and it is passed as  
            the 'maxSessionTimeout'. See  
            http://hadoop.apache.org/zookeeper/docs/current/zookeeperProgrammers.html#ch_zkSessions.  
            For example, if a HBase region server connects to a ZK ensemble  
            that's also managed by HBase, then the  
            session timeout will be the one specified by this configuration. But, a  
            region server that connects  
            to an ensemble managed with a different configuration will be subjected  
            that ensemble's maxSessionTimeout. So,  
            even though HBase might propose using 90 seconds, the ensemble can have a  
            max timeout lower than this and  
            it will take precedence. The current default that ZK ships with is 40  
            seconds, which is lower than HBase's.  
          
      
      
      
        zookeeper.znode.parent  
        /hbase  
        Root ZNode for HBase in ZooKeeper. All of HBase's  
            ZooKeeper  
            files that are configured with a relative path will go under this node.  
            By default, all of HBase's ZooKeeper file path are configured with a  
            relative path, so they will all go under this directory unless  
            changed.  
          
      
      
      
        zookeeper.znode.rootserver  
        root-region-server  
        Path to ZNode holding root region location. This is  
            written by  
            the master and read by clients and region servers. If a relative path is  
            given, the parent folder will be ${zookeeper.znode.parent}. By  
            default,  
            this means the root location is stored at /hbase/root-region-server.  
          
      
      
      
        zookeeper.znode.acl.parent  
        acl  
        Root ZNode for access control lists.  
      
      
        hbase.zookeeper.dns.interface  
        default  
        The name of the Network Interface from which a ZooKeeper  
            server  
            should report its IP address.  
          
      
      
        hbase.zookeeper.dns.nameserver  
        default  
        The host name or IP address of the name server (DNS)  
            which a ZooKeeper server should use to determine the host name used  
            by the  
            master for communication and display purposes.  
          
      
      
      
        hbase.zookeeper.peerport  
        2888  
        Port used by ZooKeeper peers to talk to each other.  
            See  
            http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperStarted.html#sc_RunningReplicatedZooKeeper  
            for more information.  
          
      
      
      
        hbase.zookeeper.leaderport  
        3888  
        Port used by ZooKeeper for leader election.  
            See  
            http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperStarted.html#sc_RunningReplicatedZooKeeper  
            for more information.  
          
      
      
      
        hbase.zookeeper.useMulti  
        true  
        Instructs HBase to make use of ZooKeeper's multi-update  
            functionality.  
            This allows certain ZooKeeper operations to complete more quickly and  
            prevents some issues  
            with rare Replication failure scenarios (see the release note of  
            HBASE-2611 for an example).  
            IMPORTANT: only set this to true if all ZooKeeper servers in the cluster are on  
            version 3.4+  
            and will not be downgraded. ZooKeeper versions before 3.4 do not support  
            multi-update and  
            will not fail gracefully if multi-update is invoked (see ZOOKEEPER-1495).  
          
      
      
      
        hbase.config.read.zookeeper.config  
        false  
          
            Set to true to allow HBaseConfiguration to read the  
            zoo.cfg file for ZooKeeper properties. Switching this to true  
            is not recommended, since the functionality of reading ZK  
            properties from a zoo.cfg file has been deprecated.  
          
      
      
        hbase.zookeeper.property.initLimit  
        10  
        Property from ZooKeeper's config zoo.cfg.  
            The number of ticks that the initial synchronization phase can take.  
          
      
      
        hbase.zookeeper.property.syncLimit  
        5  
        Property from ZooKeeper's config zoo.cfg.  
            The number of ticks that can pass between sending a request and getting  
            an  
            acknowledgment.  
          
      
      
        hbase.zookeeper.property.dataDir  
        ${hbase.tmp.dir}/zookeeper  
        Property from ZooKeeper's config zoo.cfg.  
            The directory where the snapshot is stored.  
          
      
      
        hbase.zookeeper.property.clientPort  
        2181  
        Property from ZooKeeper's config zoo.cfg.  
            The port at which the clients will connect.  
          
      
      
        hbase.zookeeper.property.maxClientCnxns  
        300  
        Property from ZooKeeper's config zoo.cfg.  
            Limit on number of concurrent connections (at the socket level) that a  
            single client, identified by IP address, may make to a single member  
            of  
            the ZooKeeper ensemble. Set high to avoid zk connection issues running  
            standalone and pseudo-distributed.  
          
      
  
      
      
      
        hbase.client.write.buffer  
        2097152  
        Default size of the HTable client write buffer in bytes.  
            A bigger buffer takes more memory -- on both the client and server  
            side since server instantiates the passed write buffer to process  
            it -- but a larger buffer size reduces the number of RPCs made.  
            For an estimate of server-side memory-used, evaluate  
            hbase.client.write.buffer * hbase.regionserver.handler.count  
          
      
      
      
        hbase.client.pause  
        100  
        General client pause value. Used mostly as value to wait  
            before running a retry of a failed get, region lookup, etc.  
            See hbase.client.retries.number for description of how we backoff from  
            this initial pause amount and how this pause works w/ retries.  
          
      
      
      
        hbase.client.retries.number  
        35  
        Maximum retries. Used as maximum for all retryable  
            operations such as the getting of a cell's value, starting a row  
            update,  
            etc. Retry interval is a rough function based on hbase.client.pause. At  
            first we retry at this interval but then with backoff, we pretty  
            quickly reach  
            retrying every ten seconds. See HConstants#RETRY_BACKOFF for how the backup  
            ramps up. Change this setting and hbase.client.pause to suit your  
            workload.  
          
      
      
      
        hbase.client.max.total.tasks  
        100  
        The maximum number of concurrent tasks a single HTable  
            instance will  
            send to the cluster.  
          
      
      
      
        hbase.client.max.perserver.tasks  
        5  
        The maximum number of concurrent tasks a single HTable  
            instance will  
            send to a single region server.  
          
      
      
      
        hbase.client.max.perregion.tasks  
        1  
        The maximum number of concurrent connections the client  
            will  
            maintain to a single Region. That is, if there is already  
            hbase.client.max.perregion.tasks writes in progress for this region,  
            new puts  
            won't be sent to this region until some writes finishes.  
          
      
      
      
        hbase.client.scanner.caching  
        2147483647  
        Number of rows that we try to fetch when calling next  
            on a scanner if it is not served from (local, client) memory. This  
            configuration  
            works together with hbase.client.scanner.max.result.size to try and use  
            the  
            network efficiently. The default value is Integer.MAX_VALUE by default so  
            that  
            the network will fill the chunk size defined by  
            hbase.client.scanner.max.result.size  
            rather than be limited by a particular number of rows since the size of  
            rows varies  
            table to table. If you know ahead of time that you will not require more  
            than a certain  
            number of rows from a scan, this configuration should be set to that row  
            limit via  
            Scan#setCaching. Higher caching values will enable faster scanners but will eat up  
            more  
            memory and some calls of next may take longer and longer times when the  
            cache is empty.  
            Do not set this value such that the time between invocations is greater  
            than the scanner  
            timeout; i.e. hbase.client.scanner.timeout.period  
          
      
      
      
        hbase.client.keyvalue.maxsize  
        10485760  
        Specifies the combined maximum allowed size of a KeyValue  
            instance. This is to set an upper boundary for a single entry saved  
            in a  
            storage file. Since they cannot be split it helps avoiding that a region  
            cannot be split any further because the data is too large. It seems  
            wise  
            to set this to a fraction of the maximum region size. Setting it to  
            zero  
            or less disables the check.  
          
      
      
      
        hbase.client.scanner.timeout.period  
        60000  
        Client scanner lease period in milliseconds.  
          
      
      
        hbase.client.localityCheck.threadPoolSize  
        2  
      
      
      
        hbase.bulkload.retries.number  
        10  
        Maximum retries. This is maximum number of iterations  
            to atomic bulk loads are attempted in the face of splitting operations  
            0 means never give up.  
          
      
      
        hbase.balancer.period  
        300000  
        Period at which the region balancer runs in the Master.  
          
      
      
        hbase.normalizer.period  
        1800000  
        Period at which the region normalizer runs in the Master.  
          
      
      
      
        hbase.regions.slop  
        0.2  
        Rebalance if any regionserver has average + (average *  
            slop) regions.  
      
      
      
        hbase.server.thread.wakefrequency  
        10000  
        Time to sleep in between searches for work (in  
            milliseconds).  
            Used as sleep interval by service threads such as log roller.  
          
      
      
        hbase.server.versionfile.writeattempts  
        3  
          
            How many time to retry attempting to write a version file  
            before just aborting. Each attempt is seperated by the  
            hbase.server.thread.wakefrequency milliseconds.  
          
      
      
      
        hbase.hregion.memstore.flush.size  
        134217728  
          
            Memstore will be flushed to disk if size of the memstore  
            exceeds this number of bytes. Value is checked by a thread that runs  
            every hbase.server.thread.wakefrequency.  
          
      
      
        hbase.hregion.percolumnfamilyflush.size.lower.bound  
        16777216  
          
            If FlushLargeStoresPolicy is used, then every time that we hit the  
            total memstore limit, we find out all the column families whose  
            memstores  
            exceed this value, and only flush them, while retaining the others whose  
            memstores are lower than this limit. If none of the families have  
            their  
            memstore size more than this, all the memstores will be flushed  
            (just as usual). This value should be less than half of the total memstore  
            threshold (hbase.hregion.memstore.flush.size).  
          
      
      
      
        hbase.hregion.preclose.flush.size  
        5242880  
          
            If the memstores in a region are this size or larger when we go  
            to close, run a "pre-flush" to clear out memstores before we put up  
            the region closed flag and take the region offline. On close,  
            a flush is run under the close flag to empty memory. During  
            this time the region is offline and we are not taking on any writes.  
            If the memstore content is large, this flush could take a long time to  
            complete. The preflush is meant to clean out the bulk of the memstore  
            before putting up the close flag and taking the region offline so the  
            flush that runs under the close flag has little to do.  
          
      
      
      
        hbase.hregion.memstore.block.multiplier 
        4  
          
            Block updates if memstore has hbase.hregion.memstore.block.multiplier  
            times hbase.hregion.memstore.flush.size bytes. Useful preventing  
            runaway memstore during spikes in update traffic. Without an  
            upper-bound, memstore fills such that when it flushes the  
            resultant flush files take a long time to compact or split, or  
            worse, we OOME.  
          
      
      
      
        hbase.hregion.memstore.mslab.enabled  
        true  
          
            Enables the MemStore-Local Allocation Buffer,  
            a feature which works to prevent heap fragmentation under  
            heavy write loads. This can reduce the frequency of stop-the-world  
            GC pauses on large heaps.  
          
      
      
      
        hbase.hregion.max.filesize  
        10737418240  
          
            Maximum HStoreFile size. If any one of a column families' HStoreFiles has  
            grown to exceed this value, the hosting HRegion is split in two.  
          
      
      
      
        hbase.hregion.majorcompaction  
        604800000  
        The time (in miliseconds) between 'major' compactions of  
            all  
            HStoreFiles in a region. Default: Set to 7 days. Major compactions tend to  
            happen exactly when you need them least so enable them such that they  
            run at  
            off-peak for your deploy; or, since this setting is on a periodicity that is  
            unlikely to match your loading, run the compactions via an external  
            invocation out of a cron job or some such.  
          
      
      
      
        hbase.hregion.majorcompaction.jitter  
        0.50  
        Jitter outer bound for major compactions.  
            On each regionserver, we multiply the hbase.region.majorcompaction  
            interval by some random fraction that is inside the bounds of this  
            maximum. We then add this + or - product to when the next  
            major compaction is to run. The idea is that major compaction  
            does happen on every regionserver at exactly the same time. The  
            smaller this number, the closer the compactions come together.  
          
      
      
      
        hbase.hstore.compactionThreshold  
        3  
          
            If more than this number of HStoreFiles in any one HStore  
            (one HStoreFile is written per flush of memstore) then a compaction  
            is run to rewrite all HStoreFiles files as one. Larger numbers  
            put off compaction but when it runs, it takes longer to complete.  
          
      
      
      
        hbase.hstore.flusher.count  
        2  
          
            The number of flush threads. With less threads, the memstore flushes  
            will be queued. With  
            more threads, the flush will be executed in parallel, increasing the hdfs  
            load. This can  
            lead as well to more compactions.  
          
      
      
      
        hbase.hstore.blockingStoreFiles  
        10  
          
            If more than this number of StoreFiles in any one Store  
            (one StoreFile is written per flush of MemStore) then updates are  
            blocked for this HRegion until a compaction is completed, or  
            until hbase.hstore.blockingWaitTime has been exceeded.  
          
      
      
      
        hbase.hstore.blockingWaitTime  
        90000  
          
            The time an HRegion will block updates for after hitting the StoreFile  
            limit defined by hbase.hstore.blockingStoreFiles.  
            After this time has elapsed, the HRegion will stop blocking updates even  
            if a compaction has not been completed.  
          
      
      
      
        hbase.hstore.compaction.max  
        10  
        Max number of HStoreFiles to compact per 'minor'  
            compaction.  
      
      
      
        hbase.hstore.compaction.kv.max  
        10  
        How many KeyValues to read and then write in a batch when  
            flushing  
            or compacting. Do less if big KeyValues and problems with OOME.  
            Do more if wide, small rows.  
          
      
      
        hbase.hstore.time.to.purge.deletes  
        0  
        The amount of time to delay purging of delete markers  
            with future timestamps. If  
            unset, or set to 0, all delete markers, including those with future  
            timestamps, are purged  
            during the next major compaction. Otherwise, a delete marker is kept until  
            the major compaction  
            which occurs after the marker's timestamp plus the value of this setting,  
            in milliseconds.  
          
      
      
        hbase.storescanner.parallel.seek.enable  
        false  
          
            Enables StoreFileScanner parallel-seeking in StoreScanner,  
            a feature which can reduce response latency under special conditions.  
          
      
      
        hbase.storescanner.parallel.seek.threads  
        10  
          
            The default thread pool size if parallel-seeking feature enabled.  
          
      
      
      
        hfile.block.cache.size  
        0.4  
        Percentage of maximum heap (-Xmx setting) to allocate to  
            block cache  
            used by HFile/StoreFile. Default of 0.4 means allocate 40%.  
            Set to 0 to disable but it's not recommended; you need at least  
            enough cache to hold the storefile indices.  
          
      
      
        hfile.block.index.cacheonwrite  
        false  
        This allows to put non-root multi-level index blocks into  
            the block  
            cache at the time the index is being written.  
          
      
      
        hfile.index.block.max.size  
        131072  
        When the size of a leaf-level, intermediate-level, or  
            root-level  
            index block in a multi-level block index grows to this size, the  
            block is written out and a new block is started.  
          
      
      
      
        hbase.bucketcache.ioengine  
          
        Where to store the contents of the bucketcache. One of:  
            heap,  
            offheap, or file. If a file, set it to file:PATH_TO_FILE. See  
            http://hbase.apache.org/book.html#offheap.blockcache for more  
            information.  
          
      
      
      
        hbase.bucketcache.combinedcache.enabled  
        true  
        Whether or not the bucketcache is used in league with the  
            LRU  
            on-heap block cache. In this mode, indices and blooms are kept in the LRU  
            blockcache and the data blocks are kept in the bucketcache.  
          
      
      
      
        hbase.bucketcache.size  
          
        A float that EITHER represents a percentage of total heap  
            memory  
            size to give to the cache (if < 1.0) OR, it is the total capacity in  
            megabytes of BucketCache. Default: 0.0  
          
      
      
        hbase.bucketcache.sizes  
          
        A comma-separated list of sizes for buckets for the  
            bucketcache.  
            Can be multiple sizes. List block sizes in order from smallest to  
            largest.  
            The sizes you use will depend on your data access patterns.  
            Must be a multiple of 1024 else you will run into  
            'java.io.IOException: Invalid HFile block magic' when you go to read from cache.  
            If you specify no values here, then you pick up the default bucketsizes  
            set  
            in code (See BucketAllocator#DEFAULT_BUCKET_SIZES).  
          
      
      
        hfile.format.version  
        3  
        The HFile format version to use for new files.  
            Version 3 adds support for tags in hfiles (See  
            http://hbase.apache.org/book.html#hbase.tags).  
            Distributed Log Replay requires that tags are enabled. Also see the  
            configuration  
            'hbase.replication.rpc.codec'.  
          
      
      
        hfile.block.bloom.cacheonwrite  
        false  
        Enables cache-on-write for inline blocks of a compound  
            Bloom filter.  
      
      
        io.storefile.bloom.block.size  
        131072  
        The size in bytes of a single block ("chunk") of a  
            compound Bloom  
            filter. This size is approximate, because Bloom blocks can only be  
            inserted at data block boundaries, and the number of keys per data  
            block varies.  
          
      
      
        hbase.rs.cacheblocksonwrite  
        false  
        Whether an HFile block should be added to the block cache  
            when the  
            block is finished.  
          
      
      
      
        hbase.rpc.timeout  
        60000  
        This is for the RPC layer to define how long  
            (millisecond) HBase client applications  
            take for a remote call to time out. It uses pings to check connections  
            but will eventually throw a TimeoutException.  
          
      
      
      
        hbase.client.operation.timeout  
        1200000  
        Operation timeout is a top-level restriction  
            (millisecond) that makes sure a  
            blocking operation in Table will not be blocked more than this. In each  
            operation, if rpc  
            request fails because of timeout or other reason, it will retry until  
            success or throw  
            RetriesExhaustedException. But if the total time being blocking reach the operation timeout  
            before retries exhausted, it will break early and throw  
            SocketTimeoutException.  
          
      
      
        hbase.cells.scanned.per.heartbeat.check  
        10000  
        The number of cells scanned in between heartbeat checks.  
            Heartbeat  
            checks occur during the processing of scans to determine whether or not the  
            server should stop scanning in order to send back a heartbeat message  
            to the  
            client. Heartbeat messages are used to keep the client-server connection  
            alive  
            during long running scans. Small values mean that the heartbeat checks will  
            occur more often and thus will provide a tighter bound on the  
            execution time of  
            the scan. Larger values mean that the heartbeat checks occur less  
            frequently  
          
      
      
        hbase.rpc.shortoperation.timeout  
        10000  
        This is another version of "hbase.rpc.timeout". For those  
            RPC operation  
            within cluster, we rely on this configuration to set a short timeout  
            limitation  
            for short operation. For example, short rpc timeout for region server's  
            trying  
            to report to active master can benefit quicker master failover process.  
          
      
      
        hbase.ipc.client.tcpnodelay  
        true  
        Set no delay on rpc socket connections. See  
            http://docs.oracle.com/javase/1.5.0/docs/api/java/net/Socket.html#getTcpNoDelay()  
          
      
      
        hbase.regionserver.hostname  
          
        This config is for experts: don't set its value unless  
            you really know what you are doing.  
            When set to a non-empty value, this represents the (external facing)  
            hostname for the underlying server.  
            See https://issues.apache.org/jira/browse/HBASE-12954 for details.  
          
      
      
      
        hbase.master.keytab.file  
          
        Full path to the kerberos keytab file to use for logging  
            in  
            the configured HMaster server principal.  
          
      
      
        hbase.master.kerberos.principal  
          
        Ex. "hbase/[email protected]". The kerberos principal  
            name  
            that should be used to run the HMaster process. The principal name should  
            be in the form: user/hostname@DOMAIN. If "_HOST" is used as the  
            hostname  
            portion, it will be replaced with the actual hostname of the running  
            instance.  
          
      
      
        hbase.regionserver.keytab.file  
          
        Full path to the kerberos keytab file to use for logging  
            in  
            the configured HRegionServer server principal.  
          
      
      
        hbase.regionserver.kerberos.principal  
          
        Ex. "hbase/[email protected]". The kerberos principal  
            name  
            that should be used to run the HRegionServer process. The principal name  
            should be in the form: user/hostname@DOMAIN. If "_HOST" is used as  
            the  
            hostname portion, it will be replaced with the actual hostname of the  
            running instance. An entry for this principal must exist in the file  
            specified in hbase.regionserver.keytab.file  
          
      
      
      
        hadoop.policy.file  
        hbase-policy.xml  
        The policy configuration file used by RPC servers to make  
            authorization decisions on client requests. Only used when HBase  
            security is enabled.  
          
      
      
        hbase.superuser  
          
        List of users or groups (comma-separated), who are  
            allowed  
            full privileges, regardless of stored ACLs, across the cluster.  
            Only used when HBase security is enabled.  
          
      
      
        hbase.auth.key.update.interval  
        86400000  
        The update interval for master key for authentication  
            tokens  
            in servers in milliseconds. Only used when HBase security is enabled.  
          
      
      
        hbase.auth.token.max.lifetime  
        604800000  
        The maximum lifetime in milliseconds after which an  
            authentication token expires. Only used when HBase security is  
            enabled.  
          
      
      
        hbase.ipc.client.fallback-to-simple-auth-allowed  
        false  
        When a client is configured to attempt a secure  
            connection, but attempts to  
            connect to an insecure server, that server may instruct the client to  
            switch to SASL SIMPLE (unsecure) authentication. This setting controls  
            whether or not the client will accept this instruction from the  
            server.  
            When false (the default), the client will not allow the fallback to  
            SIMPLE  
            authentication, and will abort the connection.  
          
      
      
        hbase.ipc.server.fallback-to-simple-auth-allowed  
        false  
        When a server is configured to require secure  
            connections, it will  
            reject connection attempts from clients using SASL SIMPLE (unsecure)  
            authentication.  
            This setting allows secure servers to accept SASL SIMPLE connections from  
            clients  
            when the client requests. When false (the default), the server will not  
            allow the fallback  
            to SIMPLE authentication, and will reject the connection. WARNING: This  
            setting should ONLY  
            be used as a temporary measure while converting clients over to secure  
            authentication. It  
            MUST BE DISABLED for secure operation.  
          
      
      
        hbase.coprocessor.enabled  
        true  
        Enables or disables coprocessor loading. If 'false'  
            (disabled), any other coprocessor related configuration will be  
            ignored.  
          
      
      
        hbase.coprocessor.user.enabled  
        true  
        Enables or disables user (aka. table) coprocessor  
            loading.  
            If 'false' (disabled), any table coprocessor attributes in table  
            descriptors will be ignored. If "hbase.coprocessor.enabled" is  
            'false'  
            this setting has no effect.  
          
      
      
        hbase.coprocessor.region.classes  
          
        A comma-separated list of Coprocessors that are loaded by  
            default on all tables. For any override coprocessor method, these  
            classes  
            will be called in order. After implementing your own Coprocessor, just  
            put  
            it in HBase's classpath and add the fully qualified class name here.  
            A coprocessor can also be loaded on demand by setting  
            HTableDescriptor.  
          
      
      
        hbase.rest.port  
        8080  
        The port for the HBase REST server.  
      
      
        hbase.rest.readonly  
        false  
        Defines the mode the REST server will be started in.  
            Possible values are:  
            false: All HTTP methods are permitted - GET/PUT/POST/DELETE.  
            true: Only the GET method is permitted.  
          
      
      
        hbase.rest.threads.max  
        100  
        The maximum number of threads of the REST server thread  
            pool.  
            Threads in the pool are reused to process REST requests. This  
            controls the maximum number of requests processed concurrently.  
            It may help to control the memory used by the REST server to  
            avoid OOM issues. If the thread pool is full, incoming requests  
            will be queued up and wait for some free threads.  
          
      
      
        hbase.rest.threads.min  
        2  
        The minimum number of threads of the REST server thread  
            pool.  
            The thread pool always has at least these number of threads so  
            the REST server is ready to serve incoming requests.  
          
      
      
        hbase.rest.support.proxyuser  
        false  
        Enables running the REST server to support proxy-user  
            mode.  
      
      
        hbase.defaults.for.version  
        1.2.3  
        This defaults file was compiled for version  
            ${project.version}. This variable is used  
            to make sure that a user doesn't have an old version of  
            hbase-default.xml on the  
            classpath.  
          
      
      
        hbase.defaults.for.version.skip  
        false  
        Set to true to skip the 'hbase.defaults.for.version'  
            check.  
            Setting this to true can be useful in contexts other than  
            the other side of a maven generation; i.e. running in an  
            ide. You'll want to set this boolean to true to avoid  
            seeing the RuntimException complaint: "hbase-default.xml file  
            seems to be for and old version of HBase (\${hbase.version}), this  
            version is X.X.X-SNAPSHOT"  
          
      
      
        hbase.coprocessor.master.classes  
          
        A comma-separated list of  
            org.apache.hadoop.hbase.coprocessor.MasterObserver coprocessors that  
            are  
            loaded by default on the active HMaster process. For any implemented  
            coprocessor methods, the listed classes will be called in order.  
            After  
            implementing your own MasterObserver, just put it in HBase's classpath  
            and add the fully qualified class name here.  
          
      
      
        hbase.coprocessor.abortonerror  
        true  
        Set to true to cause the hosting server (master or  
            regionserver)  
            to abort if a coprocessor fails to load, fails to initialize, or throws  
            an  
            unexpected Throwable object. Setting this to false will allow the server to  
            continue execution but the system wide state of the coprocessor in  
            question  
            will become inconsistent as it will be properly executing in only a  
            subset  
            of servers, so this is most useful for debugging only.  
          
      
      
        hbase.online.schema.update.enable  
        true  
        Set true to enable online schema changes.  
      
      
        hbase.table.lock.enable  
        true  
        Set to true to enable locking the table in zookeeper for  
            schema change operations.  
            Table locking from master prevents concurrent schema modifications to  
            corrupt table  
            state.  
          
      
      
      
        hbase.table.max.rowsize  
        1073741824  
          
            Maximum size of single row in bytes (default is 1 Gb) for Get'ting  
            or Scan'ning without in-row scan flag set. If row size exceeds this  
            limit  
            RowTooBigException is thrown to client.  
          
      
      
        hbase.thrift.minWorkerThreads  
        16  
        The "core size" of the thread pool. New threads are  
            created on every  
            connection until this many threads are created.  
          
      
      
        hbase.thrift.maxWorkerThreads  
        1000  
        The maximum size of the thread pool. When the pending  
            request queue  
            overflows, new threads are created until their number reaches this number.  
            After that, the server starts dropping connections.  
          
      
      
        hbase.thrift.maxQueuedRequests  
        1000  
        The maximum number of pending Thrift connections waiting  
            in the queue. If  
            there are no idle threads in the pool, the server queues requests. Only  
            when the queue overflows, new threads are added, up to  
            hbase.thrift.maxQueuedRequests threads.  
          
      
      
        hbase.thrift.htablepool.size.max  
        1000  
        The upper bound for the table pool used in the Thrift  
            gateways server.  
            Since this is per table name, we assume a single table and so with 1000  
            default  
            worker threads max this is set to a matching number. For other workloads  
            this number  
            can be adjusted as needed.  
          
      
      
        hbase.regionserver.thrift.framed  
        false  
        Use Thrift TFramedTransport on the server side.  
            This is the recommended transport for thrift servers and requires a  
            similar setting  
            on the client side. Changing this to false will select the default  
            transport,  
            vulnerable to DoS when malformed requests are issued due to THRIFT-601.  
          
      
      
        hbase.regionserver.thrift.framed.max_frame_size_in_mb  
        2  
        Default frame size when using framed transport  
          
      
      
        hbase.regionserver.thrift.compact  
        false  
        Use Thrift TCompactProtocol binary serialization  
            protocol.  
      
      
        hbase.rootdir.perms  
        700  
        FS Permissions for the root directory in a  
            secure(kerberos) setup.  
            When master starts, it creates the rootdir with this permissions or sets  
            the permissions  
            if it does not match.  
          
      
      
        hbase.data.umask.enable  
        false  
        Enable, if true, that file permissions should be assigned  
            to the files written by the regionserver  
          
      
      
        hbase.data.umask  
        000  
        File permissions that should be used to write data  
            files when hbase.data.umask.enable is true  
          
      
      
        hbase.metrics.showTableName  
        true  
        Whether to include the prefix "tbl.tablename" in  
            per-column family metrics.  
            If true, for each metric M, per-cf metrics will be reported for  
            tbl.T.cf.CF.M, if false,  
            per-cf metrics will be aggregated by column-family across tables, and  
            reported for cf.CF.M.  
            In both cases, the aggregated metric M across tables and cfs will be  
            reported.  
          
      
      
        hbase.metrics.exposeOperationTimes  
        true  
        Whether to report metrics about time taken performing an  
            operation on the region server. Get, Put, Delete, Increment, and  
            Append can all  
            have their times exposed through Hadoop metrics per CF and per region.  
          
      
      
    -->  
      
        hbase.snapshot.enabled  
        true  
        Set to true to allow snapshots to be taken / restored /  
            cloned.  
      
      
      
        hbase.snapshot.restore.take.failsafe.snapshot  
        true  
        Set to true to take a snapshot before the restore  
            operation.  
            The snapshot taken will be used in case of failure, to restore the  
            previous state.  
            At the end of the restore operation this snapshot will be deleted  
          
      
      
        hbase.snapshot.restore.failsafe.name  
        hbase-failsafe-{snapshot.name}-{restore.timestamp}  
        Name of the failsafe snapshot taken by the restore  
            operation.  
            You can use the {snapshot.name}, {table.name} and {restore.timestamp}  
            variables  
            to create a name based on what you are restoring.  
          
      
      
      
        hbase.server.compactchecker.interval.multiplier  
        1000  
        The number that determines how often we scan to see if  
            compaction is necessary.  
            Normally, compactions are done after some events (such as memstore flush), but  
            if  
            region didn't receive a lot of writes for some time, or due to different  
            compaction  
            policies, it may be necessary to check it periodically. The interval between  
            checks is  
            hbase.server.compactchecker.interval.multiplier multiplied by  
            hbase.server.thread.wakefrequency.  
          
      
      
        hbase.lease.recovery.timeout  
        900000  
        How long we wait on dfs lease recovery in total before  
            giving up.  
      
      
        hbase.lease.recovery.dfs.timeout  
        64000  
        How long between dfs recover lease invocations. Should be  
            larger than the sum of  
            the time it takes for the namenode to issue a block recovery command as  
            part of  
            datanode; dfs.heartbeat.interval and the time it takes for the primary  
            datanode, performing block recovery to timeout on a dead datanode;  
            usually  
            dfs.client.socket-timeout. See the end of HBASE-8389 for more.  
          
      
      
      
        hbase.column.max.version  
        1  
        New column family descriptors will use this value as the  
            default number of versions  
            to keep.  
          
      
      
        hbase.dfs.client.read.shortcircuit.buffer.size  
        131072  
        If the DFSClient configuration  
            dfs.client.read.shortcircuit.buffer.size is unset, we will  
            use what is configured here as the short circuit read default  
            direct byte buffer size. DFSClient native default is 1MB; HBase  
            keeps its HDFS files open so number of file blocks * 1MB soon  
            starts to add up and threaten OOME because of a shortage of  
            direct memory. So, we set it down from the default. Make  
            it > the default hbase block size set in the HColumnDescriptor  
            which is usually 64k.  
          
      
      
        hbase.regionserver.checksum.verify  
        true  
          
            If set to true (the default), HBase verifies the checksums for hfile  
            blocks. HBase writes checksums inline with the data when it writes  
            out  
            hfiles. HDFS (as of this writing) writes checksums to a separate file  
            than the data file necessitating extra seeks. Setting this flag saves  
            some on i/o. Checksum verification by HDFS will be internally  
            disabled  
            on hfile streams when this flag is set. If the hbase-checksum  
            verification  
            fails, we will switch back to using HDFS checksums (so do not disable HDFS  
            checksums! And besides this feature applies to hfiles only, not to  
            WALs).  
            If this parameter is set to false, then hbase will not verify any  
            checksums,  
            instead it will depend on checksum verification being done in the HDFS  
            client.  
          
      
      
        hbase.hstore.bytes.per.checksum  
        16384  
          
            Number of bytes in a newly created checksum chunk for HBase-level  
            checksums in hfile blocks.  
          
      
      
        hbase.hstore.checksum.algorithm  
        CRC32C  
          
            Name of an algorithm that is used to compute checksums. Possible values  
            are NULL, CRC32, CRC32C.  
          
      
      
      
        hbase.client.scanner.max.result.size  
        2097152  
        Maximum number of bytes returned when calling a scanner's  
            next method.  
            Note that when a single row is larger than this limit the row is still  
            returned completely.  
            The default value is 2MB, which is good for 1ge networks.  
            With faster and/or high latency networks this value should be increased.  
          
      
      
      
        hbase.server.scanner.max.result.size  
        104857600  
        Maximum number of bytes returned when calling a scanner's  
            next method.  
            Note that when a single row is larger than this limit the row is still  
            returned completely.  
            The default value is 100MB.  
            This is a safety setting to protect the server from OOM situations.  
          
      
      
        hbase.status.published  
        false  
          
            This setting activates the publication by the master of the status of the  
            region server.  
            When a region server dies and its recovery starts, the master will push  
            this information  
            to the client application, to let them cut the connection immediately  
            instead of waiting  
            for a timeout.  
          
      
      
        hbase.status.publisher.class  
        org.apache.hadoop.hbase.master.ClusterStatusPublisher$MulticastPublisher  
          
          
            Implementation of the status publication with a multicast message.  
          
      
      
        hbase.status.listener.class  
        org.apache.hadoop.hbase.client.ClusterStatusListener$MulticastListener  
          
          
            Implementation of the status listener with a multicast message.  
          
      
      
        hbase.status.multicast.address.ip  
        226.1.1.3  
          
            Multicast address to use for the status publication by multicast.  
          
      
      
        hbase.status.multicast.address.port  
        16100  
          
            Multicast port to use for the status publication by multicast.  
          
      
  
      
        hbase.dynamic.jars.dir  
        ${hbase.rootdir}/lib  
          
            The directory from which the custom filter/co-processor jars can be  
            loaded  
            dynamically by the region server without the need to restart. However,  
            an already loaded filter/co-processor class would not be un-loaded. See  
            HBASE-1936 for more details.  
          
      
      
        hbase.security.authentication  
        simple  
          
            Controls whether or not secure authentication is enabled for HBase.  
            Possible values are 'simple' (no authentication), and 'kerberos'.  
          
      
      
        hbase.rest.filter.classes  
        org.apache.hadoop.hbase.rest.filter.GzipFilter  
          
            Servlet filters for REST service.  
          
      
      
        hbase.master.loadbalancer.class  
        org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer  
          
          
            Class used to execute the regions balancing when the period occurs.  
            See the class comment for more on how it works  
            http://hbase.apache.org/devapidocs/org/apache/hadoop/hbase/master/balancer/StochasticLoadBalancer.html  
            It replaces the DefaultLoadBalancer as the default (since renamed  
            as the SimpleLoadBalancer).  
          
      
      
        hbase.security.exec.permission.checks  
        false  
          
            If this setting is enabled and ACL based access control is active (the  
            AccessController coprocessor is installed either as a system  
            coprocessor  
            or on a table as a table coprocessor) then you must grant all relevant  
            users EXEC privilege if they require the ability to execute  
            coprocessor  
            endpoint calls. EXEC privilege, like any other permission, can be  
            granted globally to a user, or to a user on a per table or per namespace  
            basis. For more information on coprocessor endpoints, see the  
            coprocessor  
            section of the HBase online manual. For more information on granting or  
            revoking permissions using the AccessController, see the security  
            section of the HBase online manual.  
          
      
      
        hbase.procedure.regionserver.classes  
          
        A comma-separated list of  
            org.apache.hadoop.hbase.procedure.RegionServerProcedureManager  
            procedure managers that are  
            loaded by default on the active HRegionServer process. The lifecycle  
            methods (init/start/stop)  
            will be called by the active HRegionServer process to perform the  
            specific globally barriered  
            procedure. After implementing your own RegionServerProcedureManager, just put  
            it in  
            HBase's classpath and add the fully qualified class name here.  
          
      
      
        hbase.procedure.master.classes  
          
        A comma-separated list of  
            org.apache.hadoop.hbase.procedure.MasterProcedureManager procedure  
            managers that are  
            loaded by default on the active HMaster process. A procedure is identified  
            by its signature and  
            users can use the signature and an instant name to trigger an execution of  
            a globally barriered  
            procedure. After implementing your own MasterProcedureManager, just put it in  
            HBase's classpath  
            and add the fully qualified class name here.  
          
      
      
        hbase.coordinated.state.manager.class  
        org.apache.hadoop.hbase.coordination.ZkCoordinatedStateManager  
          
        Fully qualified name of class implementing coordinated  
            state manager.  
      
      
        hbase.regionserver.storefile.refresh.period  
        0  
          
            The period (in milliseconds) for refreshing the store files for the  
            secondary regions. 0  
            means this feature is disabled. Secondary regions sees new files (from  
            flushes and  
            compactions) from primary once the secondary region refreshes the list of files  
            in the  
            region (there is no notification mechanism). But too frequent refreshes  
            might cause  
            extra Namenode pressure. If the files cannot be refreshed for longer than  
            HFile TTL  
            (hbase.master.hfilecleaner.ttl) the requests are rejected. Configuring HFile TTL to a larger  
            value is also recommended with this setting.  
          
      
      
        hbase.region.replica.replication.enabled  
        false  
          
            Whether asynchronous WAL replication to the secondary region replicas is  
            enabled or not.  
            If this is enabled, a replication peer named  
            "region_replica_replication" will be created  
            which will tail the logs and replicate the mutatations to region replicas  
            for tables that  
            have region replication > 1. If this is enabled once, disabling this  
            replication also  
            requires disabling the replication peer using shell or ReplicationAdmin java  
            class.  
            Replication to secondary region replicas works over standard inter-cluster  
            replication.  
            So replication, if disabled explicitly, also has to be enabled by  
            setting "hbase.replication"  
            to true for this feature to work.  
          
      
      
        hbase.http.filter.initializers  
        org.apache.hadoop.hbase.http.lib.StaticUserWebFilter  
          
            A comma separated list of class names. Each class in the list must  
            extend  
            org.apache.hadoop.hbase.http.FilterInitializer. The corresponding Filter will  
            be initialized. Then, the Filter will be applied to all user facing jsp  
            and servlet web pages.  
            The ordering of the list defines the ordering of the filters.  
            The default StaticUserWebFilter add a user principal as defined by the  
            hbase.http.staticuser.user property.  
          
      
      
        hbase.security.visibility.mutations.checkauths  
        false  
          
            This property if enabled, will check whether the labels in the visibility  
            expression are associated  
            with the user issuing the mutation  
          
      
      
        hbase.http.max.threads  
        10  
          
            The maximum number of threads that the HTTP Server will create in its  
            ThreadPool.  
          
      
      
        hbase.replication.rpc.codec  
        org.apache.hadoop.hbase.codec.KeyValueCodecWithTags  
          
            The codec that is to be used when replication is enabled so that  
            the tags are also replicated. This is used along with HFileV3 which  
            supports tags in them. If tags are not used or if the hfile version  
            used  
            is HFileV2 then KeyValueCodec can be used as the replication codec.  
            Note that  
            using KeyValueCodecWithTags for replication when there are no tags causes  
            no harm.  
          
      
      
        hbase.replication.source.maxthreads  
        10  
          
            The maximum number of threads any replication source will use for  
            shipping edits to the sinks in parallel. This also limits the number  
            of  
            chunks each replication batch is broken into.  
            Larger values can improve the replication throughput between the master and  
            slave clusters. The default of 10 will rarely need to be changed.  
          
      
      
      
          
            The user name to filter as, on static web filters  
            while rendering content. An example use is the HDFS  
            web UI (user to be used for browsing files).  
          
        hbase.http.staticuser.user  
        dr.stack  
      
      
        hbase.master.normalizer.class  
        org.apache.hadoop.hbase.master.normalizer.SimpleRegionNormalizer  
          
          
            Class used to execute the region normalization when the period occurs.  
            See the class comment for more on how it works  
            http://hbase.apache.org/devapidocs/org/apache/hadoop/hbase/master/normalizer/SimpleRegionNormalizer.html  
          
      
      
        hbase.regionserver.handler.abort.on.error.percent  
        0.5  
        The percent of region server RPC threads failed to abort  
            RS.  
            -1 Disable aborting; 0 Abort if even a single handler has died;  
            0.x Abort only when this percent of handlers have died;  
            1 Abort only all of the handers have died.  
          
      
      
        hbase.snapshot.master.timeout.millis  
        300000  
          
            Timeout for master for the snapshot procedure execution  
          
      
      
        hbase.snapshot.region.timeout  
        300000  
          
            Timeout for regionservers to keep threads in snapshot request pool waiting

你可能感兴趣的:(Hadoop生态体系,数据库及数据仓库)

解析XML文件及QTableWidget示例 ctrigger xml
解析XML文件及QTableWidget示例#include"mainwindow.h"#include"ui_mainwindow.h"#include#include#includeMainWindow::MainWindow(QWidget*parent):QMainWindow(parent),ui(newUi::MainWindow){ui->setupUi(this);setWindo
比较分析：Windsurf、Cody、Cline、Roo Cline、Copilot 和通义灵码张3蜂开源编程语言与开发技术选型与架构设计 copilot c#AI编程
随着人工智能技术的快速发展，开发者工具变得越来越智能化，特别是在代码生成、辅助编程等领域，市面上涌现了多种AI驱动的工具。本文将从开源性、集成能力、功能覆盖范围、支持的编程语言、生态兼容性、成本、学习曲线、响应速度、离线支持以及与.NETCore的适配性等十个维度对以下几种产品进行比较：Windsurf、Cody、Cline、RooCline、Copilot和通义灵码。1.开源性Windsurf:
每日一题--内存池秋凉づᐇ java 开发语言
内存池（MemoryPool）是一种高效的内存管理技术，通过预先分配并自主管理内存块，减少频繁申请/释放内存的系统开销，提升程序性能。它是高性能编程（如游戏引擎、数据库、网络服务器）中的核心优化手段。内存池的核心原理预先分配：初始化时一次性申请一大块内存（称为“池”），避免程序运行时频繁调用malloc/new。自主管理：将大块内存划分为多个固定或可变大小的内存单元，由程序自行分配和回收。复用机制
如何使用PHP爬虫根据关键词获取Shopee商品列表？数据小爬虫@ php 爬虫 android
在跨境电商领域，Shopee作为东南亚及中国台湾地区领先的电商平台，拥有海量的商品信息。无论是进行市场调研、数据分析，还是寻找热门商品，根据关键词获取Shopee商品列表都是一项极具价值的任务。然而，手动浏览和整理这些信息显然是低效且容易出错的。幸运的是，通过编写PHP爬虫程序，我们可以高效地完成这一任务。本文将详细介绍如何利用PHP爬虫根据关键词获取Shopee商品列表，并提供完整的代码示例。一
【PTA-数据库】《数据库原理与应用B》第二章选择题 .Phoenix. 《数据库原理与应用B》第二章数据库
1.关系模型的数据结构非常简单，只包含单一的数据结构——____C____。A.元组B.属性C.关系D.分量2____A____是一组具有相同数据类型的值的集合。A.域B.属性C.分量D.元组3.一个域允许的不同取值个数称为这个域的___D_____。A.分量B.目C.度D.基数4.若D1域的基数为2，D2域的基数为3，D3域的基数为4，则D1、D2、D3的笛卡尔积的基数为___C_____。A.
如何使用PHP爬虫获取Shopee（虾皮）商品详情？数据小爬虫@ php 爬虫开发语言
在跨境电商领域，Shopee（虾皮）作为东南亚及中国台湾地区领先的电商平台，拥有海量的商品信息。无论是进行市场调研、数据分析，还是寻找热门商品，获取Shopee商品详情都是一项极具价值的任务。然而，手动浏览和整理这些信息显然是低效且容易出错的。幸运的是，通过编写PHP爬虫程序，我们可以高效地完成这一任务。本文将详细介绍如何利用PHP爬虫获取Shopee商品详情，并提供完整的代码示例。一、为什么选择
便民服务一体化的智慧园区开源了 AI服务老曹音视频人工智能自动化运维能源开源
智慧园区场景视频监控平台是一款功能强大且简单易用的实时算法视频监控系统。它的愿景是最底层打通各大芯片厂商相互间的壁垒，省去繁琐重复的适配流程，实现芯片、算法、应用的全流程组合，从而大大减少企业级应用约95%的开发成本。充分利用现有的摄像头设备，无需大规模更换，降低成本同时提升系统的实施效率。用户只需在界面上进行简单的操作，就可以实现全视频的接入及布控。项目搭建地址基础项目搭建地址：yihecode
实现物流行业数字化、智能化管理的新型模式的智慧物流开源了 AI服务老曹开源能源人工智能云计算安全
智慧物流视频监控平台是一款功能强大且简单易用的实时算法视频监控系统。它的愿景是最底层打通各大芯片厂商相互间的壁垒，省去繁琐重复的适配流程，实现芯片、算法、应用的全流程组合，从而大大减少企业级应用约95%的开发成本。构建基于Ai技术的安全监管平台，可逐步实现智能化巡检，针对安全事故隐患进行有效监控预警，降低安全违规行为发生率，节省人工监管成本。用户只需在界面上进行简单的操作，就可以实现全视频的接入及
全流程数字化管理的智慧物流开源了 AI服务老曹开源科技生活人工智能自动化
智慧物流视频监控平台是一款功能强大且简单易用的实时算法视频监控系统。它的愿景是最底层打通各大芯片厂商相互间的壁垒，省去繁琐重复的适配流程，实现芯片、算法、应用的全流程组合，从而大大减少企业级应用约95%的开发成本。构建基于Ai技术的安全监管平台，可逐步实现智能化巡检，针对安全事故隐患进行有效监控预警，降低安全违规行为发生率，节省人工监管成本。用户只需在界面上进行简单的操作，就可以实现全视频的接入及
MCP协议 zhurui_xiaozhuzaizai 入口集锦人工智能自然语言处理
1什么是MCP？MCP（ModelContextProtocol，模型上下文协议）是由Anthropic推出的一种开放标准，旨在统一大型语言模型（LLM）与外部数据源和工具之间的通信协议。MCP的主要目的在于解决当前AI模型因数据孤岛限制而无法充分发挥潜力的难题，MCP使得AI应用能够安全地访问和操作本地及远程数据，为AI应用提供了连接万物的接口。1.1MCP与functioncallMCP是在O
降低成本、提高效率的智慧能源开源了。 ai产品老杨 vue.js 前端 javascript 人工智能安全
一、简介AI视频监控平台,是一款功能强大且简单易用的实时算法视频监控系统。愿景在最底层打通各大芯片厂商相互间的壁垒，省去繁琐重复的适配流程，实现芯片、算法、应用的全流程组合，减少企业级应用约95%的开发成本，在强大视频算法加持下的AR使得远程培训和远程操作指导不仅仅能够实现前后场的简单互动，而且能够实现人机结合，最终实现整个巡检流程的标准化。用户仅需在界面上简单操作，即可实现全视频的接入及布控。通
HTML网页图像标签齐天大荒 HTML html 前端 css
HTML网页图像标签常见的图像格式JPGGIFPNGBMP…一、标签的定义及用法在html中，标签是使用来在网页中嵌入一幅图像。从技术上讲，图像并不是插入到网页中，而是链接到网页中，标签的作用是为被引用的图像创建占位符。标签在网页中很常用，比如，引入一个logo图片、按钮背景图片、工具图标等等。只要是有图片的地方，源代码中基本都有标签（除一些背景图片以外）。二、标签语法格式说明：src属性是用来指
深度学习模型性能全景评估与优化指南 niuTaylor 深度学习人工智能
深度学习模型性能全景评估与优化指南一、算力性能指标体系1.核心算力指标对比指标计算方式适用场景硬件限制TOPS(TeraOperationsPerSecond)每秒万亿次整数运算量化模型推理NVIDIAJetsonNano仅支持FP16/FP32TFLOPS(TeraFLoating-pointOPerationsperSecond)TFLOPS=Cores×FLOPs/Cycle×Frequen
【系统架构设计师-2018年】案例分析-答案及详解数据知道系统架构软考高级系统架构设计师
试题一（25分）阅读以下关于软件系统设计的叙述，在答题纸上回答问题1至问题3。【说明】某文化产业集团委托软件公司开发一套文化用品商城系统，业务涉及文化用品销售、定制、竞拍和点评等板块，以提升商城的信息化建设水平。该软件公司组织项目组完成了需求调研，现已进入到系统架构设计阶段。考虑到系统需求对架构设计决策的影响，项目组先列出了可能影响系统架构设计的部分需求如下：（a）用户界面支持用户的个性化定制；（
React 18 如何定义变量，及赋值与渲染痴心阿文 React react.js javascript 前端
React18中，定义变量、赋值和渲染的方式因变量的用途和作用域不同而有所差异，下面为你详细介绍不同场景下的实现方法。1.函数组件内定义普通变量在函数组件里，你可以像在普通JavaScript函数中一样定义变量，并且这些变量会在每次组件重新渲染时重新创建。importReactfrom'react';constMyComponent=()=>{//定义普通变量并赋值constmessage='He
DNS污染：网络世界的“隐形劫持”与防御 dns劫持dns网络安全
在互联网的底层架构中，DNS（域名系统）如同数字世界的“导航员”，将用户输入的域名翻译成机器可读的IP地址。然而，DNS污染（DNSPoisoning）正像一场无声的“地址篡改”危机，威胁着全球网络的安全与稳定。本文将深入拆解DNS污染的技术原理、现实危害及应对策略，帮助个人与企业构建安全防线。一、DNS污染的本质：一场“地址簿”的篡改DNS污染，指攻击者通过技术手段向DNS服务器注入虚假的域名解
移动端网页布局注意事项及解决 1.winphone系统a、input标签被点击时产生的半透明灰色背景怎么去掉 Ailsa-show
移动端网页布局注意事项及解决1.winphone系统a、input标签被点击时产生的半透明灰色背景怎么去掉1、关闭iOS键盘首字母自动大写2、禁止文本缩放html{-webkit-text-size-adjust:100%;}3、移动端如何清除输入框内阴影在iOS上，输入框默认有内部阴影，但无法使用box-shadow来清除，如果不需要阴影，可以这样关闭：input,textarea{border

HarmonyOS5开发：Ark-TS 深度解析：从状态管理到性能优化，揭秘鸿蒙开发的底层逻辑 harmonyos-next
Ark-TS作为鸿蒙生态的核心开发语言，其设计哲学和技术细节值得让我们一起深入挖掘以下下。这篇文章将会带您和我们一起聚焦Ark-TS的状态管理机制、类型系统优化及声明式UI的底层实现，通过代码示例和原理分析，带您揭开Ark-TS高效开发的神秘面纱。一、状态管理：Ark-TS的“神经中枢”在Ark-TS中，状态管理是驱动UI更新的核心机制。不同的状态装饰器（如@State、@Prop、@Link）各
专利状态查询做一个码农都是奢望学习学习
我们学校没开通电子申请，只能纸质申请，导致信件延迟。所以需要及时掌握专利的状态。查询方法如下：1登录国知局国家知识产权公共服务平台(cnipa.gov.cn)打开中国及多国查询2输入申请号查询结果1：授权了需要我们缴费了！结果2：未缴纳申请费去缴费系统输入需要缴费专利的申请号，进行付款即可。3申请号获得方法查询即可4专利证书下载专利证书下载(cnipa.gov.cn)
使用SQL-PGVector进行PostgreSQL与语义搜索/RAG的结合 fgayif sql postgresql 数据库 python
在现代数据密集型应用中，语义搜索和检索增强生成（RAG）技术越来越受欢迎。通过结合PostgreSQL和pgvector扩展，我们可以实现高效的语义搜索。本文将深入探讨如何配置和使用SQL-PGVector，实现强大的数据查询能力。技术背景介绍PostgreSQL是一个功能强大的开源关系数据库，在处理结构化数据方面具备优势。为了增强其在非结构化数据处理中的能力，我们可以使用pgvector扩展，该
新型铁螯合剂FOT1：靶向铁死亡治疗代谢相关脂肪性肝炎的新突破感冒发烧流鼻涕笔记
摘要：代谢相关脂肪性肝炎（MASH）严重威胁公众健康，目前治疗手段有限。本文聚焦于浙江大学王福俤、闵军霞及温州医科大学郑明华团队的最新研究。该研究通过对MASH患者人群大队列数据的分析，结合多种小鼠MASH疾病模型功能筛选，发现MASH患者肝脏铁过量，且与疾病进展呈强正相关。研究团队开发的新型铁螯合剂FOT1（FerroTerminator1，铁死终结者），在多种MASH模型中表现出色，能够有效逆
【C++】priority_queue的使用及模拟实现（含仿函数介绍）梓䈑 C++学习 c++开发语言
文章目录前言一、priority_queue的介绍二、priority_queue的使用三、仿函数四、priority_queue的模拟实现前言一、priority_queue的介绍（优先级队列是默认使用vector作为其底层存储数据的容器适配器，在vector上又使用了堆算法将vector中元素构造成堆的结构，因此priority_queue就是堆）二、priority_queue的使用及模拟实
在.Net Core（.Net5）中使用开源组件SqlTableDependency来监听ms sqlserver的数据库数据变化 Lingbug 数据库 .netcore .net
文章目录1、本文主要说明在.NetCore（Demo为.Net5）中使用开源组件SqlTableDependency来监听mssqlserver的数据库数据变化2、github地址：https://github.com/IsNemoEqualTrue/monitor-table-change-with-sqltabledependency3、安装nuget包：install-packageSqlT
如何通过 SQLyog 连接远程 MySQL 数据库？（附工具下载）心灵宝贝 oracle 数据库
MySQL数据库管理工具，提供了图形化界面（GUI），方便用户进行数据库的管理、查询和优化。下载安装SQLyog：https://pan.quark.cn/s/28f872a50972SQLyog的主要功能：用户友好界面：简洁直观的界面，适合数据库管理员和开发人员使用。查询浏览器：支持编写和执行SQL查询，提供语法高亮和自动补全功能。数据导入/导出：支持多种格式（如CSV、XML、SQL等）的数据
河南大学数据库实验4 凡巾数据库 oracle
创建一个名为TEST数据库，要求如下：（下面三个表中属性的数据类型需要自己设计合适的数据类型）1、建立专业表speciality，它由专业号specno、专业名specname组成，其中专业号为主键，采用列级定义主键，专业名不能为空。2、建立院系表department，它由院名dname、院长dean、院职工人数dnum组成。其中院名为主属性，采用表级定义主键。3、建立一个“学生”表Student
【总结】常用API架构类型软件测试 API
引言在现代软件开发中，API(应用程序编程接口)已经成为各类系统之间交互的核心。不同的API架构类型适用于不同的业务需求和技术场景，选择合适的架构可以提高系统的性能、可维护性和扩展性。本文将介绍几种常见的API架构类型，并分析它们的特点、适用场景及优缺点。1.RESTfulAPI简介REST(RepresentationalStateTransfer)是一种基于HTTP协议的架构风格，强调使用标准
oceanbase与mysql性能对比_金融业分布式数据库:TDSQL、HotDB、OceanBase等原理、POC性能对比及选择是...... 高中物理宋老师
本帖最后由Amygo于2020-3-1501:33编辑1、分布式的实现，是通过中间件实现分布式，还是源码级别引入分布式算法实现的？解答：(1)分布式数据库是至少由计算节点、存储节点、管理平台、备份还原程序四个部分组成，从数据库系统理论知识上说分成：全局自治和场地自治，也粗略认为：全局可理解为计算节点、场地可理解为存储节点(2)这个问题的标题“中间件实现分布式还是源码级别引入分布式算法”这个说法存在
户储EMS开发|工商业储能EMS/户储EMS/EMS能源管理系统作用与功能|储能ems排名|SmartEMS3823型工商业/户储EMS系统分布式DTU/分散式DTU配电终端能源
户储EMS开发|工商业储能EMS/户储EMS/EMS能源管理系统作用与功能|储能ems排名|SmartEMS3823型工商业/户储EMS系统一：名词解释及背景EMS能量管理系统EMS（EnergyManagementSystem，能源管理系统）是储能系统的总体决策系统。能源管理系统包括电网级能源管理系统和微电网级能源管理系统。储能系统中主要的EMS系统是微电网层面。EMS作为支撑储能系统的信息管理
2025年渗透测试面试题总结-某四字大厂实习面试复盘一面二面三面（题目+回答）独行soc 2025年渗透测试面试指南面试职场和发展安全 web安全红蓝攻防 python
网络安全领域各种资源，学习文档，以及工具分享、前沿信息分享、POC、EXP分享。不定期分享各种好玩的项目及好用的工具，欢迎关注。目录一面1.数组和链表各自的优势和原因2.操作系统层面解析和进程3.线程和进程通信方式及数据安全问题4.线程和多进程的选用场景及原因5.SQL注入绕WAF方式6.FUZZ绕WAF的payload长度通常是多少7.不查资料直接写IPv4正则regex8.Fastjson反序
技术革命、需求升级与商业生态迭代——基于开源AI大模型与智能商业范式的创新研究说私域人工智能开源小程序微信零售
摘要：本文以技术哲学与商业生态系统理论为分析框架，通过质性研究与案例分析法，系统阐释第三次与第四次科技革命如何通过技术范式创新引发用户需求跃迁，进而驱动商业生态系统的结构性变革。研究聚焦开源AI大模型、AI智能名片、S2B2C商城及小程序源码等前沿技术工具，解构其如何重构"技术赋权-需求进化-商业物种爆发"的价值传导链条。研究发现：技术革命通过创造新需求空间、重构价值网络拓扑结构、降低创新参与门槛
SAX解析xml文件小猪猪08 xml
1.创建SAXParserFactory实例 2.通过SAXParserFactory对象获取SAXParser实例 3.创建一个类SAXParserHander继续DefaultHandler，并且实例化这个类 4.SAXParser实例的parse来获取文件 public static void main(String[] args) { //
为什么mysql里的ibdata1文件不断的增长？ brotherlamp linux linux运维 linux资料 linux视频 linux运维自学
我们在 Percona 支持栏目经常收到关于 MySQL 的 ibdata1 文件的这个问题。当监控服务器发送一个关于 MySQL 服务器存储的报警时，恐慌就开始了 —— 就是说磁盘快要满了。一番调查后你意识到大多数地盘空间被 InnoDB 的共享表空间 ibdata1 使用。而你已经启用了 innodbfileper_table，所以问题是： ibdata1存了什么？当你启用了 i
Quartz-quartz.properties配置 eksliang quartz
其实Quartz JAR文件的org.quartz包下就包含了一个quartz.properties属性配置文件并提供了默认设置。如果需要调整默认配置，可以在类路径下建立一个新的quartz.properties，它将自动被Quartz加载并覆盖默认的设置。下面是这些默认值的解释 #-----集群的配置 org.quartz.scheduler.instanceName =
informatica session的使用 18289753290 workflow session log Informatica
如果希望workflow存储最近20次的log，在session里的Config Object设置，log options做配置，save session log :sessions run ;savesessio log for these runs:20 session下面的source 里面有个tracing
Scrapy抓取网页时出现CRC check failed 0x471e6e9a != 0x7c07b839L的错误酷的飞上天空 scrapy
Scrapy版本0.14.4 出现问题现象： ERROR: Error downloading <GET http://xxxxx CRC check failed 解决方法 1.设置网络请求时的header中的属性'Accept-Encoding': '*;q=0' 明确表示不支持任何形式的压缩格式，避免程序的解压
java Swing小集锦永夜-极光 java swing
1.关闭窗体弹出确认对话框 1.1 this.setDefaultCloseOperation (JFrame.DO_NOTHING_ON_CLOSE); 1.2 this.addWindowListener ( new WindowAdapter () { public void windo
强制删除.svn文件夹随便小屋 java
在windows上，从别处复制的项目中可能带有.svn文件夹，手动删除太麻烦，并且每个文件夹下都有。所以写了个程序进行删除。因为.svn文件夹在windows上是只读的，所以用File中的delete()和deleteOnExist()方法都不能将其删除，所以只能采用windows命令方式进行删除
GET和POST有什么区别？及为什么网上的多数答案都是错的。 aijuans get post
如果有人问你，GET和POST，有什么区别？你会如何回答？我的经历前几天有人问我这个问题。我说GET是用于获取数据的，POST，一般用于将数据发给服务器之用。这个答案好像并不是他想要的。于是他继续追问有没有别的区别？我说这就是个名字而已，如果服务器支持，他完全可以把G
谈谈新浪微博背后的那些算法 aoyouzi 谈谈新浪微博背后的那些算法
本文对微博中常见的问题的对应算法进行了简单的介绍，在实际应用中的算法比介绍的要复杂的多。当然，本文覆盖的主题并不全，比如好友推荐、热点跟踪等就没有涉及到。但古人云“窥一斑而见全豹”，希望本文的介绍能帮助大家更好的理解微博这样的社交网络应用。微博是一个很多人都在用的社交应用。天天刷微博的人每天都会进行着这样几个操作：原创、转发、回复、阅读、关注、@等。其中，前四个是针对短博文，最后的关注和@则针
Connection reset 连接被重置的解决方法百合不是茶 java 字符流连接被重置
流是java的核心部分,,昨天在做android服务器连接服务器的时候出了问题,就将代码放到java中执行,结果还是一样连接被重置被重置的代码如下; 客户端代码; package 通信软件服务器; import java.io.BufferedWriter; import java.io.OutputStream; import java.io.O
web.xml配置详解之filter bijian1013 java web.xml filter
一.定义 <filter> <filter-name>encodingfilter</filter-name> <filter-class>com.my.app.EncodingFilter</filter-class> <init-param> <param-name>encoding<
Heritrix Bill_chen 多线程 xml 算法制造配置管理
作为纯Java语言开发的、功能强大的网络爬虫Heritrix，其功能极其强大，且扩展性良好，深受热爱搜索技术的盆友们的喜爱，但它配置较为复杂，且源码不好理解，最近又使劲看了下，结合自己的学习和理解，跟大家分享Heritrix的点点滴滴。 Heritrix的下载（http://sourceforge.net/projects/archive-crawler/）安装、配置，就不罗嗦了，可以自己找找资
【Zookeeper】FAQ bit1129 zookeeper
1.脱离IDE，运行简单的Java客户端程序 #ZkClient是简单的Zookeeper~$ java -cp "./:zookeeper-3.4.6.jar:./lib/*" ZKClient 1. Zookeeper是的Watcher回调是同步操作，需要添加异步处理的代码 2. 如果Zookeeper集群跨越多个机房，那么Leader/
The user specified as a definer ('aaa'@'localhost') does not exist 白糖_ localhost
今天遇到一个客户BUG，当前的jdbc连接用户是root，然后部分删除操作都会报下面这个错误：The user specified as a definer ('aaa'@'localhost') does not exist 最后找原因发现删除操作做了触发器，而触发器里面有这样一句 /*!50017 DEFINER = ''aaa@'localhost' */ 原来最初
javascript中showModelDialog刷新父页面 bozch JavaScript 刷新父页面 showModalDialog
在页面中使用showModalDialog打开模式子页面窗口的时候，如果想在子页面中操作父页面中的某个节点，可以通过如下的进行： window.showModalDialog('url',self,‘status...’); // 首先中间参数使用self 在子页面使用w
编程之美-买书折扣 bylijinnan 编程之美
import java.util.Arrays; public class BookDiscount { /**编程之美买书折扣书上的贪心算法的分析很有意思，我看了半天看不懂，结果作者说，贪心算法在这个问题上是不适用的。。下面用动态规划实现。哈利波特这本书一共有五卷，每卷都是8欧元，如果读者一次购买不同的两卷可扣除5%的折扣，三卷10%，四卷20%，五卷
关于struts2.3.4项目跨站执行脚本以及远程执行漏洞修复概要 chenbowen00 struts WEB安全
因为近期负责的几个银行系统软件，需要交付客户，因此客户专门请了安全公司对系统进行了安全评测，结果发现了诸如跨站执行脚本，远程执行漏洞以及弱口令等问题。下面记录下本次解决的过程以便后续 1、首先从最简单的开始处理，服务器的弱口令问题，首先根据安全工具提供的测试描述中发现应用服务器中存在一个匿名用户，默认是不需要密码的，经过分析发现服务器使用了FTP协议，而使用ftp协议默认会产生一个匿名用
[电力与暖气]煤炭燃烧与电力加温 comsci
在宇宙中,用贝塔射线观测地球某个部分,看上去,好像一个个马蜂窝,又像珊瑚礁一样,原来是某个国家的采煤区..... 不过,这个采煤区的煤炭看来是要用完了.....那么依赖将起燃烧并取暖的城市,在极度严寒的季节中...该怎么办呢? &nbs
oracle O7_DICTIONARY_ACCESSIBILITY参数 daizj oracle
O7_DICTIONARY_ACCESSIBILITY参数控制对数据字典的访问.设置为true,如果用户被授予了如select any table等any table权限,用户即使不是dba或sysdba用户也可以访问数据字典.在9i及以上版本默认为false,8i及以前版本默认为true.如果设置为true就可能会带来安全上的一些问题.这也就为什么O7_DICTIONARY_ACCESSIBIL
比较全面的MySQL优化参考 dengkane mysql
本文整理了一些MySQL的通用优化方法，做个简单的总结分享，旨在帮助那些没有专职MySQL DBA的企业做好基本的优化工作，至于具体的SQL优化，大部分通过加适当的索引即可达到效果，更复杂的就需要具体分析了，可以参考本站的一些优化案例或者联系我，下方有我的联系方式。这是上篇。 1、硬件层相关优化 1.1、CPU相关在服务器的BIOS设置中，可
C语言homework2，有一个逆序打印数字的小算法 dcj3sjt126com c
#h1# 0、完成课堂例子 1、将一个四位数逆序打印 1234 ==> 4321 实现方法一： # include <stdio.h> int main(void) { int i = 1234; int one = i%10; int two = i / 10 % 10; int three = i / 100 % 10;
apacheBench对网站进行压力测试 dcj3sjt126com apachebench
ab 的全称是 ApacheBench ，是 Apache 附带的一个小工具，专门用于 HTTP Server 的 benchmark testing ，可以同时模拟多个并发请求。前段时间看到公司的开发人员也在用它作一些测试，看起来也不错，很简单，也很容易使用，所以今天花一点时间看了一下。通过下面的一个简单的例子和注释，相信大家可以更容易理解这个工具的使用。
2种办法让HashMap线程安全 flyfoxs java jdk jni
多线程之--2种办法让HashMap线程安全多线程之--synchronized 和reentrantlock的优缺点多线程之--2种JAVA乐观锁的比较( NonfairSync VS. FairSync) HashMap不是线程安全的,往往在写程序时需要通过一些方法来回避.其实JDK原生的提供了2种方法让HashMap支持线程安全.
Spring Security（04）——认证简介 234390216 Spring Security 认证过程
认证简介目录 1.1 认证过程 1.2 Web应用的认证过程 1.2.1 ExceptionTranslationFilter 1.2.2 在request之间共享SecurityContext 1
Java 位运算 Javahuhui java 位运算
// 左移( << ) 低位补0 // 0000 0000 0000 0000 0000 0000 0000 0110 然后左移2位后，低位补0： // 0000 0000 0000 0000 0000 0000 0001 1000 System.out.println(6 << 2);// 运行结果是24 // 右移( >> ) 高位补"
mysql免安装版配置 ldzyz007 mysql
1、my-small.ini是为了小型数据库而设计的。不应该把这个模型用于含有一些常用项目的数据库。 2、my-medium.ini是为中等规模的数据库而设计的。如果你正在企业中使用RHEL,可能会比这个操作系统的最小RAM需求(256MB)明显多得多的物理内存。由此可见，如果有那么多RAM内存可以使用，自然可以在同一台机器上运行其它服务。 3、my-large.ini是为专用于一个SQL数据
MFC和ado数据库使用时遇到的问题你不认识的休道人 sql C++mfc
=================================================================== 第一个 =================================================================== try{ CString sql; sql.Format("select * from p
表单重复提交Double Submits rensanning double
可能发生的场景： *多次点击提交按钮 *刷新页面 *点击浏览器回退按钮 *直接访问收藏夹中的地址 *重复发送HTTP请求（Ajax）（1）点击按钮后disable该按钮一会儿，这样能避免急躁的用户频繁点击按钮。这种方法确实有些粗暴，友好一点的可以把按钮的文字变一下做个提示，比如Bootstrap的做法： http://getbootstrap.co
Java String 十大常见问题 tomcat_oracle java 正则表达式
　1.字符串比较，使用“==”还是equals()? 　　"=="判断两个引用的是不是同一个内存地址(同一个物理对象)。　　equals()判断两个字符串的值是否相等。　　除非你想判断两个string引用是否同一个对象，否则应该总是使用equals()方法。　　如果你了解字符串的驻留(String Interning)则会更好地理解这个问题。　　
SpringMVC 登陆拦截器实现登陆控制 xp9802 springMVC
思路，先登陆后，将登陆信息存储在session中，然后通过拦截器，对系统中的页面和资源进行访问拦截，同时对于登陆本身相关的页面和资源不拦截。实现方法： 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

HBase第一天：HBase组件及架构、安装HBase部署集群、HBase的shell操作、HBase数据结构、命名空间、原理、读写流程、flush与合并、hbase-default.xml配置详解

本文目录

第1章 HBase简介

1.1 什么是HBase

1.2 Hbase特点

1.3 HBase架构

1.3 HBase中的角色

1.3.1 HMaster

1.3.2 RegionServer

1.2.3 其他组件

第2章 HBase安装

2.1 Zookeeper正常部署

2.2 Hadoop正常部署

2.3 HBase的解压

2.4 HBase的配置文件

2.5 HBase远程发送到其他集群

2.6 HBase服务的启动

2.7 查看HBase页面

第3章 HBase Shell操作

3.1 基本操作

3.2 表的操作

第4章 HBase数据结构

4.1 RowKey

4.2 Column Family

4.3 Cell

4.4 Time Stamp

4.5 命名空间 相当于mysql的数据库

第5 章 HBase原理

5.1 读流程

5.2 写流程

5.3 数据flush过程

5.4 数据合并过程

你可能感兴趣的:(Hadoop生态体系,数据库及数据仓库)

4.5 命名空间相当于mysql的数据库