Oracle 分析及动态采样

之前在说Oracle Optimizer中的CBO时讲到，当表没有做分析的时候，Oracle 会使用动态采样来收集统计信息。获取准确的段对象（表，表分区，索引等）的分析数据，是CBO存在的基石，CBO的机制就是收集尽可能多的对象信息和系统信息，通过对这些信息进行计算，分析，评估，最终得出一个成本最低的执行计划。所以对于CBO，数据段的分析就非常重要。

Oracle Optimizer CBO RBO

http://blog.csdn.net/tianlesoftware/archive/2010/08/19/5824886.aspx

一．先演示一个示例，来理解分析的作用

1.1创建表

SQL> create table t as select object_id,object_name from dba_objects where 1=2;

表已创建。

SQL> create index index_t on t(object_id);

索引已创建。

SQL> insert into t select object_id,object_name from dba_objects;

已创建72926行。

SQL> commit;

提交完成。

1.2查看分的分析及执行计划

SQL> select num_rows,avg_row_len,blocks,last_analyzed from user_tables where table_name='T';

NUM_ROWS AVG_ROW_LEN BLOCKS LAST_ANALYZED

---------- ----------- ---------- --------------

SQL> select blevel,leaf_blocks,distinct_keys,last_analyzed from user_indexes where table_name='T';

BLEVEL LEAF_BLOCKS DISTINCT_KEYS LAST_ANALYZED

---------- ----------- ------------- --------------

0 0 0 25-8月 -10

从查询结果看出，表的行数，行长，占用的数据块数及最后的分析时间都是空。索引的相关信息也没有，说明这个表和说因都没有被分析，如果此时有一条SQL 对表做查询，CBO 由于无法获取这些信息，很可能生成错误的执行计划，如：

SQL> set linesize 200

SQL> set autot trace exp;

SQL> select /*+dynamic_sampling(t 0) */ * from t where object_id>30;

执行计划

----------------------------------------------------------

Plan hash value: 80339723

---------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 4 | 316 | 0 (0)| 00:00:01 |

| 1 | TABLE ACCESS BY INDEX ROWID| T | 4 | 316 | 0 (0)| 00:00:01 |

|* 2 | INDEX RANGE SCAN | INDEX_T | 1 | | 0 (0)| 00:00:01 |

---------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

2 - access("OBJECT_ID">30)

SQL>

在Oracle 10g以后，如果一个表没有做分析，数据库将自动对它做动态采样分析，所以这里采用hint的方式将动态采样的级别设置为0，即不使用动态采样。

从这个执行计划，看书CBO 估计出表中满足条件的记录为4条，索引使用了索引。我们对表做一下分析，用结果比较一下。

1.3 分析表及查看分析之后的执行计划

分析可以通过两中方式：

一种是analyze 命令，如：

analyze table tablename compute statistics for all indexes;

还有一种就是通过DBMS_STATS包来分析，从9i 开始，Oracle 推荐使用DBMS_STATS包对表进行分析操作，因为DBMS_STATS 提供了更多的功能，以及灵活的操作方式。

SQL> exec dbms_stats.gather_table_stats('SYS','T');

PL/SQL 过程已成功完成。

SQL> select blevel,leaf_blocks,distinct_keys,last_analyzed from user_indexes where table_name='T';

BLEVEL LEAF_BLOCKS DISTINCT_KEYS LAST_ANALYZED

---------- ----------- ------------- --------------

1 263 72926 25-8月 -10

SQL> select num_rows,avg_row_len,blocks,last_analyzed from user_tables where table_name='T';

NUM_ROWS AVG_ROW_LEN BLOCKS LAST_ANALYZED

---------- ----------- ---------- --------------

72926 29 345 25-8月 -10

从上面的结果，可以看出DBMS_STATS.gather_table_stats已经对表和索引都做了分析。现在我们在来看一下执行计划。

SQL> set autot trace exp;

SQL> select * from t where object_id>30;

执行计划

----------------------------------------------------------

Plan hash value: 1601196873

--------------------------------------------------------------------------

--------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 72899 | 2064K| 96 (2)| 00:00:02 |

|* 1 | TABLE ACCESS FULL| T | 72899 | 2064K| 96 (2)| 00:00:02 |

--------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - filter("OBJECT_ID">30)

从这个计划，我们看出CBO 估算出的结果是72899 条记录，与实际的72926很近。此时选择全表扫描更优。通过这个例子，我们也看出了分析对执行计划的重要性。

二．直方图（Histogram）

DBMS_STATS 包对段表的分析有三个层次：

（1）表自身的分析：包括表中的行数，数据块数，行长等信息。

（2）列的分析：包括列值的重复数，列上的空值，数据在列上的分布情况。

（3）索引的分析：包括索引叶块的数量，索引的深度，索引的聚合因子等。

直方图就是列分析中数据在列上的分布情况。

当Oracle 做直方图分析时，会将要分析的列上的数据分成很多数量相同的部分，每一部分称为一个bucket,这样CBO就可以非常容易地知道这个列上的数的分布情况，这种数据的分布将作为一个非常重要的因素纳入到执行计划成本的计算当中。

对于数据分布非常倾斜的表，做直方图是非常有用的。如：　1,10,20,30,40,50. 那么在一个数值范围（bucket）内，它的数据记录基本上一样。如果是：1,5,5,5,5,10,10,20,50,100. 那么它在bucket内，数据分布就是严重的倾斜。

直方图有时对于CBO非常重要，特别是对于有字段数据非常倾斜的表，做直方图分析尤为重要。可以用dbms_stats包来分析。默认情况下，dbms_stats 包会对所有的列做直方图分析。如：

SQL> exec dbms_stats.gather_table_stats('SYS','T',cascade=>true);

PL/SQL 过程已成功完成。

然后从user_histograms视图上查看到相关的信息：

SQL> select table_name,column_name,endpoint_number,endpoint_value from user_histograms where table_name='T';

TABLE_NAME COLUMN_NAME ENDPOINT_NUMBER ENDPOINT_VALUE

------------------------------ -------------------- --------------- --------------

T OBJECT_ID 0 2

T OBJECT_NAME 0 2.4504E+35

T OBJECT_ID 1 76685

T OBJECT_NAME 1 1.0886E+36

如果一个列上的数据有比较严重的倾斜，对这个列做直方图是必要的，但是，Oracle 对数据分析是需要消耗资源的，特别是对于一些很大的段对象，分析的时间尤其长。对于OLAP系统，可能需要几个小时才能完成。

所以做不做分析就需要DBA 权衡好了。但有一点要注意，不要在生产环境中随便修改分析方案，除非你有十足的把握。否则可能导致非常严重的后果。

三． DBMS_STATS包

DBMS_STAS包不仅能够对表进行分析，它还可以对数据库分析进行管理。按照功能可以分一下几类：

（1）性能数据的收集

（2）性能数据的设置

（3）性能数据的删除

（4）性能数据的备份和恢

更多信息参考Oracle 联机文档：

11g DBMS_STATS

http://download.oracle.com/docs/cd/E11882_01/appdev.112/e10577/d_stats.htm#ARPLS68486

10g DBMS_STATS

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.1 DBMS_STATS包的几个常用功能：性能的手机，设定和删除

性能数据的收集包含这样几个存储过程：

GATHER_DATABASE_STATS Procedures
GATHER_DICTIONARY_STATS Procedure
GATHER_FIXED_OBJECTS_STATS Procedure
GATHER_INDEX_STATS Procedure
GATHER_SCHEMA_STATS Procedures
GATHER_SYSTEM_STATS Procedure
GATHER_TABLE_STATS Procedure

从名字也可以看出各自的作用，这些存储过程用来收集数据库不同级别对象的性能数据，包括：数据库，数据字典，表，索引，SCHEMA的性能等。

3.1.1 GATHER_TABLE_STATS Procedure 存储过程

在10g中, GATHER_TABLE_STATS的参数如下：

DBMS_STATS.GATHER_TABLE_STATS (

ownname VARCHAR2,

tabname VARCHAR2,

partname VARCHAR2 DEFAULT NULL,

estimate_percent NUMBER DEFAULT to_estimate_percent_type

(get_param('ESTIMATE_PERCENT')),

block_sample BOOLEAN DEFAULT FALSE,

method_opt VARCHAR2 DEFAULT get_param('METHOD_OPT'),

degree NUMBER DEFAULT to_degree_type(get_param('DEGREE')),

granularity VARCHAR2 DEFAULT GET_PARAM('GRANULARITY'),

cascade BOOLEAN DEFAULT to_cascade_type(get_param('CASCADE')),

stattab VARCHAR2 DEFAULT NULL,

statid VARCHAR2 DEFAULT NULL,

statown VARCHAR2 DEFAULT NULL,

no_invalidate BOOLEAN DEFAULT to_no_invalidate_type (

get_param('NO_INVALIDATE')),

force BOOLEAN DEFAULT FALSE);

到了11g，对参数做了调整：

DBMS_STATS.GATHER_TABLE_STATS (

ownname VARCHAR2,

tabname VARCHAR2,

partname VARCHAR2 DEFAULT NULL,

estimate_percent NUMBER DEFAULT to_estimate_percent_type

(get_param('ESTIMATE_PERCENT')),

block_sample BOOLEAN DEFAULT FALSE,

method_opt VARCHAR2 DEFAULT get_param('METHOD_OPT'),

degree NUMBER DEFAULT to_degree_type(get_param('DEGREE')),

granularity VARCHAR2 DEFAULT GET_PARAM('GRANULARITY'),

cascade BOOLEAN DEFAULT to_cascade_type(get_param('CASCADE')),

stattab VARCHAR2 DEFAULT NULL,

statid VARCHAR2 DEFAULT NULL,

statown VARCHAR2 DEFAULT NULL,

no_invalidate BOOLEAN DEFAULT to_no_invalidate_type (

get_param('NO_INVALIDATE')),

force BOOLEAN DEFAULT FALSE);

对参数的说明：

Parameter	Description
ownname	Schema of table to analyze
tabname	Name of table
partname	Name of partition
estimate_percent	Percentage of rows to estimate (NULL means compute) The valid range is [0.000001,100]. Use the constant DBMS_STATS.AUTO_SAMPLE_SIZE to have Oracle determine the appropriate sample size for good statistics. This is the default.The default value can be changed using the SET_PARAM Procedure.
block_sample	Whether or not to use random block sampling instead of random row sampling. Random block sampling is more efficient, but if the data is not randomly distributed on disk, then the sample values may be somewhat correlated. Only pertinent when doing an estimate statistics.
method_opt	Accepts: FOR ALL [INDEXED \| HIDDEN] COLUMNS [size_clause] FOR COLUMNS [size clause] column\|attribute [size_clause] [,column\|attribute [size_clause]...] size_clause is defined as size_clause := SIZE {integer \| REPEAT \| AUTO \| SKEWONLY} - integer : Number of histogram buckets. Must be in the range [1,254]. - REPEAT : Collects histograms only on the columns that already have histograms. - AUTO : Oracle determines the columns to collect histograms based on data distribution and the workload of the columns. - SKEWONLY : Oracle determines the columns to collect histograms based on the data distribution of the columns. The default is FOR ALL COLUMNS SIZE AUTO.The default value can be changed using the SET_PARAM Procedure.
degree	Degree of parallelism. The default for degree is NULL. The default value can be changed using the SET_PARAM Procedure NULL means use the table default value specified by the DEGREE clause in the CREATE TABLE or ALTER TABLE statement. Use the constant DBMS_STATS.DEFAULT_DEGREE to specify the default value based on the initialization parameters. The AUTO_DEGREE value determines the degree of parallelism automatically. This is either 1 (serial execution) or DEFAULT_DEGREE (the system default value based on number of CPUs and initialization parameters) according to size of the object.
granularity	Granularity of statistics to collect (only pertinent if the table is partitioned). 'ALL' - gathers all (subpartition, partition, and global) statistics 'AUTO'- determines the granularity based on the partitioning type. This is the default value. 'DEFAULT' - gathers global and partition-level statistics. This option is obsolete, and while currently supported, it is included in the documentation for legacy reasons only. You should use the 'GLOBAL AND PARTITION' for this functionality. Note that the default value is now 'AUTO'. 'GLOBAL' - gathers global statistics 'GLOBAL AND PARTITION' - gathers the global and partition level statistics. No subpartition level statistics are gathered even if it is a composite partitioned object. 'PARTITION '- gathers partition-level statistics 'SUBPARTITION' - gathers subpartition-level statistics.
cascade	Gather statistics on the indexes for this table. Index statistics gathering is not parallelized. Using this option is equivalent to running the GATHER_INDEX_STATS Procedure on each of the table's indexes. Use the constant DBMS_STATS.AUTO_CASCADE to have Oracle determine whether index statistics to be collected or not. This is the default. The default value can be changed using theSET_PARAM Procedure.
stattab	User statistics table identifier describing where to save the current statistics
statid	Identifier (optional) to associate with these statistics within stattab
statown	Schema containing stattab (if different than ownname)
no_invalidate	Does not invalidate the dependent cursors if set to TRUE. The procedure invalidates the dependent cursors immediately if set to FALSE. Use DBMS_STATS.AUTO_INVALIDATE. to have Oracle decide when to invalidate dependent cursors. This is the default. The default can be changed using the SET_PARAM Procedure.
force	Gather statistics of table even if it is locked

在gather_table_stats 存储过程的所有参数中，除了ownname和tabname，其他的参数都有默认值。所以我们在调用这个存储过程时，Oracle 会使用参数的默认值对表进行分析。如：

SQL> exec dbms_stats.gather_table_STATS('SYS','T');

PL/SQL 过程已成功完成。

如果想查看当前的默认值，可以使用dbms_stats.get_param函数来获取：

SQL> select dbms_stats.get_param('method_opt') from dual;

DBMS_STATS.GET_PARAM('METHOD_OPT')

------------------------------------------------------------

FOR ALL COLUMNS SIZE AUTO

结合上面对参数的说明：

- AUTO : Oracle determines the columns to collect histograms based on data distribution and the workload of the columns.

我们可以看出，就是对所有的列做直方图分析，直方图设置的bucket值由Oracle自己决定。

3.1.1.1 estimate_percent 参数

这个参数是一个百分比值，它告诉分析包需要使用表中数据的多大比例来做分析。

理论上来讲，采样的数据越多，得到的信息就越接近于实际，CBO做出的执行计划就越优化，但是，采样越多，消耗的系统资源必然越多。对系统的影响也越大。所以对于这个值的设置，要根据业务情况来。如果数据的直方图分布比较均匀，就可以使用默认值：AUTO_SAMPLE_SIZE，即让Oracle 自己来判断采样的比例。有时，特别是对于批量加载的表，我们可以预估表中的数据量，可以人工地设置一个合理的值。一般，对于一个有1000万数据的表分区，可以把这个参数设置为0.000001.

3.1. 1.2 Method_option 参数

这个参数用来定义直方图分析的一些值。

FOR ALL [INDEXED | HIDDEN] COLUMNS [size_clause]

FOR COLUMNS [size clause] column|attribute [size_clause] [,column|attribute [size_clause]...]

这里给出了4种指定哪些列进行分析的方式：

（1）所有列：for all column

（2）索引列：只对有索引的列进行分析，for all indexed columns

（3）影藏列：只对影藏的列进行分析，for all hidden columns

（4）显示指定列：显示的指定那些列进行分析，for columns columns_name

该参数默认值：for all columns size auto.

3.1. 1.3 degree 参数

用来指定分析时使用的并行度。有以下这些设置：

(1) Null：如果设置为null，Oracle 将使用被分析表属性的并行度，比如表在创建时指定的并行度，或者后者使用alter table 重新设置的并行度。

(2) 一个数值：可以显示地指定分析时使用的并行度。

(3) Default_degree: 如果设置为default，Oracle 将根据初始化参数中相关参数的设置来决定使用的并行度。

这个参数的默认值是Null，即通过表上的并行度属性来决定分析使用的并行度。当需要分析的表或表分区非常大，并且系统资源比较充分的时候，就可以考虑使用并行的方式来做分析，这样就会大大提高分析的速度。相反，如果你的系统资源比较吃紧，那么启用并行可能会适得其反。

3.1. 1.4 Granularity

分析的粒度，有以下几个配置：

（1） ALL : 将会对表的全局（global），分区，子分区的数据都做分析

（2） AUTO: Oracle 根据分区的类型，自动决定做哪一种粒度的分析。

（3） GLOBAL：只做全局级别的分析。

（4） GLOBAL AND PARTITION: 只对全局和分区级别做分析，对子分区不做分析，这是和ALL的一个区别。

（5） PARTITION: 只在分区级别做分析。

（6） SUBPARTITION: 只在子分区做分析。

在生产环境中，特别是OLAP 或者数据仓库的环境中，这个参数的设置会直接影响到CBO的执行计划选择。

在OLAP或者数据仓库系统中，经常有这样的事情，新创建一个分区，将批量的数据（通常是很大的数据）加载到分区中，对分区做分析，然后做报表或者数据挖掘。在理想的情况下，对表的全局，分区都做分析，这样才能得到最充足的数据，但是通常这样的表都非常大，如果每增加一个分区都需要做一次全局分析，那么会消耗极大的系统资源。但是如果只对新加入的分区进行分区而不做全局分析，oracle 在全局范围内的信息就会不准确。

该参数在默认情况下，DBMS_STATS 包会对表级（全局），分区级（对应参数partition）都会进行分析。如果把cascade 设置为true，相应索引的全局和分区级别也都会被分析。如果只对分区级进行分析，而全局没有分析，那么全局信息没有更新，依然会导致CBO 作出错误的执行计划。

所以当一些新的数据插入到表中时，如果对这些新的数据进行分析，是一个非常重要的问题。一般参考如下原则：

（1）看一下新插入的数据在全表中所占的比例，如果所占比例不是很大，那么可以考虑不做全局分析，否则就需要考虑，一句是业务的实际运行情况。

（2）采样比例。如果载入的数据量非常大，比如上千万或者更大，就要把采样比例压缩的尽可能地小，但底线是不能影响CBO做出正确的执行计划，采样比例的上线是不能消耗太多的资源而影响到业务的正常运行。

（3）新加载的数据应该要做分区级的数据分析。至于是否需要直方图分析，以及设置多少个buckets（size参数指定），需要DBA一句数据的分布情况进行考虑，关键是视数据的倾斜程度而定。

3.1.2 GATHER_SCHEMA_STATS 存储过程

这个存储过程用于对某个用户下所有的对象进行分析。如果你的数据用户对象非常多，单独对每个对象进行分析设定会非常不方便，这个存储过程就很方便。它的好处在于如果需要分析的对象非常多，将可以大大降低DBA的工作量，不足之处是所有分析使用相同的分析策略，可能会导致分析不是最优。所以要根据实际情况来决定。

该存储过程参数如下：

DBMS_STATS.GATHER_SCHEMA_STATS (

ownname VARCHAR2,

estimate_percent NUMBER DEFAULT to_estimate_percent_type

(get_param('ESTIMATE_PERCENT')),

block_sample BOOLEAN DEFAULT FALSE,

method_opt VARCHAR2 DEFAULT get_param('METHOD_OPT'),

degree NUMBER DEFAULT to_degree_type(get_param('DEGREE')),

granularity VARCHAR2 DEFAULT GET_PARAM('GRANULARITY'),

cascade BOOLEAN DEFAULT to_cascade_type(get_param('CASCADE')),

stattab VARCHAR2 DEFAULT NULL,

statid VARCHAR2 DEFAULT NULL,

options VARCHAR2 DEFAULT 'GATHER',

objlist OUT ObjectTab,

statown VARCHAR2 DEFAULT NULL,

no_invalidate BOOLEAN DEFAULT to_no_invalidate_type (

get_param('NO_INVALIDATE')),

force BOOLEAN DEFAULT FALSE,

obj_filter_list ObjectTab DEFAULT NULL);

参数说明如下：

Parameter	Description
`ownname`	Schema to analyze (`NULL` means current schema)
`estimate_percent`	Percentage of rows to estimate (`NULL` means compute): The valid range is [0.000001,100]. Use the constant `DBMS_STATS`.`AUTO_SAMPLE_SIZE` to have Oracle determine the appropriate sample size for good statistics. This is the default.The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`block_sample`	Whether or not to use random block sampling instead of random row sampling. Random block sampling is more efficient, but if the data is not randomly distributed on disk, then the sample values may be somewhat correlated. Only pertinent when doing an estimate statistics.
`method_opt`	Accepts: `FOR ALL [INDEXED \| HIDDEN] COLUMNS` `[size_clause]` size_clause is defined as size_clause := `SIZE` {integer \| `REPEAT` \| `AUTO` \| `SKEWONLY`} `- integer` : Number of histogram buckets. Must be in the range [1,254]. `- REPEAT` : Collects histograms only on the columns that already have histograms. `- AUTO` : Oracle determines the columns to collect histograms based on data distribution and the workload of the columns. `- SKEWONLY` : Oracle determines the columns to collect histograms based on the data distribution of the columns. The default is `FOR ALL COLUMNS SIZE AUTO`.The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`degree`	Degree of parallelism. The default for `degree` is `NULL`. The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure. `NULL` means use the table default value specified by the `DEGREE` clause in the `CREATE TABLE` or `ALTER TABLE` statement. Use the constant `DBMS_STATS.DEFAULT_DEGREE` to specify the default value based on the initialization parameters.The `AUTO_DEGREE` `value determines the degree of parallelism automatically. This is either 1 (serial execution) or DEFAULT_DEGREE` (the system default value based on number of CPUs and initialization parameters) according to size of the object.
`granularity`	Granularity of statistics to collect (only pertinent if the table is partitioned). `'ALL'` - gathers all (subpartition, partition, and global) statistics `'AUTO'`- determines the granularity based on the partitioning type. This is the default value. `'DEFAULT'` - gathers global and partition-level statistics. This option is obsolete, and while currently supported, it is included in the documentation for legacy reasons only. You should use the '`GLOBAL AND PARTITION`' for this functionality. Note that the default value is now '`AUTO`'. `'GLOBAL'` - gathers global statistics '`GLOBAL AND PARTITION`' - gathers the global and partition level statistics. No subpartition level statistics are gathered even if it is a composite partitioned object. `'PARTITION` '- gathers partition-level statistics `'SUBPARTITION'` - gathers subpartition-level statistics.
`cascade`	Gather statistics on the indexes as well. Using this option is equivalent to running the GATHER_INDEX_STATS Procedure on each of the indexes in the schema in addition to gathering table and column statistics. Use the constant `DBMS_STATS.AUTO_CASCADE` to have Oracle determine whether index statistics to be collected or not. This is the default. The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`stattab`	User statistics table identifier describing where to save the current statistics
`statid`	Identifier (optional) to associate with these statistics within `stattab`
`options`	Further specification of which objects to gather statistics for: `GATHER`: Gathers statistics on all objects in the schema. `GATHER` `AUTO`: Gathers all necessary statistics automatically. Oracle implicitly determines which objects need new statistics, and determines how to gather those statistics. When `GATHER AUTO` is specified, the only additional valid parameters are `ownname`, `stattab`, `statid`, `objlist` and `statown`; all other parameter settings are ignored. Returns a list of processed objects. `GATHER` `STALE`: Gathers statistics on stale objects as determined by looking at the `_tab_modifications` views. Also, return a list of objects found to be stale. `GATHER` `EMPTY`: Gathers statistics on objects which currently have no statistics. also, return a list of objects found to have no statistics. `LIST AUTO`: Returns a list of objects to be processed with `GATHER AUTO`. `LIST` `STALE`: Returns list of stale objects as determined by looking at the `_tab_modifications` views. `LIST` `EMPTY`: Returns list of objects which currently have no statistics.
`objlist`	List of objects found to be stale or empty
`statown`	Schema containing `stattab` (if different than `ownname`)
`no_invalidate`	Does not invalidate the dependent cursors if set to `TRUE`. The procedure invalidates the dependent cursors immediately if set to `FALSE`. Use `DBMS_STATS`.`AUTO_INVALIDATE`. to have Oracle decide when to invalidate dependent cursors. This is the default. The default can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`force`	Gather statistics on objects even if they are locked
`obj_filter_list`	A list of object filters. When provided, `GATHER_SCHEMA_STATS` will gather statistics only on objects which satisfy at least one object filter in the list as needed. In a single object filter, we can specify the constraints on the object attributes. The attribute values specified in the object filter are case- insensitive unless double-quoted. Wildcard is allowed in the attribute values. Suppose non-`NULL` values s1, s2, ... are specified for attributes a1, a2, ... in one object filter. An object o is said to satisfy this object filter if (o.a1 like s1) and (o.a2 like s2) and ... is true. See Applying an Object Filter List.

3.1.3 DBMS_STATS.GATHER_INDEX_STATS 存储过程

该存储过程用于对索引的分析，如果我们在使用DBMS_STATS.GATHER_TABLES_STATS的分析时设置参数cascade=>true。那么Oracle会同时执行这个存储过程来对索引进行分析。

存储过程参数：

DBMS_STATS.GATHER_INDEX_STATS (

ownname VARCHAR2,

indname VARCHAR2,

partname VARCHAR2 DEFAULT NULL,

estimate_percent NUMBER DEFAULT to_estimate_percent_type

(GET_PARAM('ESTIMATE_PERCENT')),

stattab VARCHAR2 DEFAULT NULL,

statid VARCHAR2 DEFAULT NULL,

statown VARCHAR2 DEFAULT NULL,

degree NUMBER DEFAULT to_degree_type(get_param('DEGREE')),

granularity VARCHAR2 DEFAULT GET_PARAM('GRANULARITY'),

no_invalidate BOOLEAN DEFAULT to_no_invalidate_type

(GET_PARAM('NO_INVALIDATE')),

force BOOLEAN DEFAULT FALSE);

Parameter	Description
`ownname`	Schema of index to analyze
`indname`	Name of index
`partname`	Name of partition
`estimate_percent`	Percentage of rows to estimate (`NULL` means compute). The valid range is `[0.000001,100]`. Use the constant `DBMS_STATS`.`AUTO_SAMPLE_SIZE` to have Oracle determine the appropriate sample size for good statistics. This is the default.The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`stattab`	User statistics table identifier describing where to save the current statistics
`statid`	Identifier (optional) to associate with these statistics within `stattab`
`statown`	Schema containing `stattab` (if different than `ownname`)
`degree`	Degree of parallelism. The default for `degree` is `NULL`. The default value can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure. `NULL` means use of table default value that was specified by the `DEGREE` clause in the `CREATE/ALTER INDEX` statement. Use the constant `DBMS_STATS.DEFAULT_DEGREE` for the default value based on the initialization parameters. The AUTO_DEGREE value determines the degree of parallelism automatically. This is either 1 (serial execution) or `DEFAULT_DEGREE (the system default value based on number of CPUs and initialization parameters) according to size of the object`.
`granularity`	Granularity of statistics to collect (only pertinent if the table is partitioned). `'ALL'` - gathers all (subpartition, partition, and global) statistics `'AUTO'`- determines the granularity based on the partitioning type. This is the default value. `'DEFAULT'` - gathers global and partition-level statistics. This option is obsolete, and while currently supported, it is included in the documentation for legacy reasons only. You should use the '`GLOBAL AND PARTITION`' for this functionality. Note that the default value is now '`AUTO`'. `'GLOBAL'` - gathers global statistics '`GLOBAL AND PARTITION`' - gathers the global and partition level statistics. No subpartition level statistics are gathered even if it is a composite partitioned object. `'PARTITION` '- gathers partition-level statistics `'SUBPARTITION'` - gathers subpartition-level statistics.
`no_invalidate`	Does not invalidate the dependent cursors if set to `TRUE`. The procedure invalidates the dependent cursors immediately if set to `FALSE`. Use `DBMS_STATS`.`AUTO_INVALIDATE`. to have Oracle decide when to invalidate dependent cursors. This is the default. The default can be changed using the SET_DATABASE_PREFS Procedure, SET_GLOBAL_PREFS Procedure, SET_SCHEMA_PREFS Procedure and SET_TABLE_PREFS Procedure.
`force`	Gather statistics on object even if it is locked

上面讨论了三个常用的存储过程。分析对CBO 来说非常重要，如果不能按照自己的系统指定出切合实际的数据分析方案，可能会导致如下问题的发生：

（1）分析信息不充分导致CBO 产生错误的执行计划，导致SQL执行效率低下。

（2）过多的分析工具带来系统性能的严重下降。

3.2 DBMS_STATS包管理功能

3.2.1 获取分析数据

GET_COLUMN_STATS Procedures
GET_INDEX_STATS Procedures
GET_SYSTEM_STATS Procedure
GET_TABLE_STATS Procedure

这四个存储过程分别为用户获取字段，索引，表和系统的统计信息。它的用法是首先定义要获取性能指标的变量，然后使用存储过程将性能指标的值赋给变量，最后将变量的值输出。如：

SQL> set serveroutput on

SQL> declare

2 dist number;

3 dens number;

4 ncnt number;

5 orec dbms_stats.statrec;

6 avgc number;

7 begin

8 dbms_stats.get_column_stats('SYS','T','object_ID',distcnt=>dist,density=>dens,nullcnt=>ncnt,srec=>orec,avgclen=>avgc);

9 dbms_output.put_line('the distcnt is:' ||to_char(dist));

10 dbms_output.put_line('the density is:' ||to_char(dens));

11 dbms_output.put_line('the nullcnt is:' ||to_char(ncnt));

12 dbms_output.put_line('the srec is:' ||to_char(ncnt));

13 dbms_output.put_line('the avgclen is:' ||to_char(avgc));

14 end;

15 /

the distcnt is:72926

the density is:.0000137125305103804

the nullcnt is:0

the srec is:0

the avgclen is:5

PL/SQL 过程已成功完成。

3.2.2 设置分析数据

SET_COLUMN_STATS Procedures
SET_INDEX_STATS Procedures
SET_SYSTEM_STATS Procedure
SET_TABLE_STATS Procedure

这几个存储过程允许我们手工地为字段，索引，表和系统性能数据赋值。它的一个用处是当相应的指标不准确导致执行计划失败时，可以使用这种方法手工地来为这些性能数据赋值。在极端情况下，这也不失为一个解决问题的方法。

关于这4个存储过程的绝提用法参考 oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.2.3 删除分析数据

DELETE_COLUMN_STATS Procedure
DELETE_DATABASE_STATS Procedure
DELETE_DICTIONARY_STATS Procedure
DELETE_FIXED_OBJECTS_STATS Procedure
DELETE_INDEX_STATS Procedure
DELETE_SCHEMA_STATS Procedure
DELETE_SYSTEM_STATS Procedure
DELETE_TABLE_STATS Procedure

当性能数据出现异常导致CBO判断错误时，为了立刻修正这个错误，删除性能数据也是一种补救的方法，比如删除了表的数据，让CBO重新对表做动态采样分析，得到一个正确的结果。

它可以删除字段，数据库，数据字典，基表，索引，表等级别的性能数据。

具体参考oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.2.4 保存分析数据

CREATE_STAT_TABLE Procedure
DROP_STAT_TABLE Procedure

可以用这两个存储过程创建一个表，用于存放性能数据，这样有利于对性能数据的管理，也可以删除这个表。

具体参考oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.2.5 导入和导出分析数据

EXPORT_COLUMN_STATS Procedure
EXPORT_DATABASE_STATS Procedure
EXPORT_DICTIONARY_STATS Procedure
EXPORT_FIXED_OBJECTS_STATS Procedure
EXPORT_INDEX_STATS Procedure
EXPORT_SCHEMA_STATS Procedure
EXPORT_SYSTEM_STATS Procedure
EXPORT_TABLE_STATS Procedure

IMPORT_COLUMN_STATS Procedure
IMPORT_DATABASE_STATS Procedure
IMPORT_DICTIONARY_STATS Procedure
IMPORT_FIXED_OBJECTS_STATS Procedure
IMPORT_INDEX_STATS Procedure
IMPORT_SCHEMA_STATS Procedure
IMPORT_SYSTEM_STATS Procedure
IMPORT_TABLE_STATS Procedure

这些存储过程可以将已经有的性能指标导入到用户创建好的表中存放，需要时，可以从表中倒回来。

具体参考oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.2.6 锁定分析数据

LOCK_SCHEMA_STATS Procedure
LOCK_TABLE_STATS Procedure

UNLOCK_SCHEMA_STATS Procedure
UNLOCK_TABLE_STATS Procedure

The LOCK_* procedures either freeze the current set of the statistics or to keep the statistics empty (uncollected).When statistics on a table are locked, all the statistics depending on the table, including table statistics, column statistics, histograms and statistics on all dependent indexes, are considered to be locked.

可能在某些时候，我们觉得当前的统计信息非常好，执行计划很准确，并且表中数据几乎不变化，那么可以使用LOCK_TABLE_STATS Procedure 来锁定表的统计信息，不允许对表做分析或者设定分析数据。当表的分析数据被锁定之后，相关的所有分析数据，包括表级，列级，直方图，索引的分析数据都将被锁定，不允许被更新。

具体参考oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

3.2.7 分析数据的恢复

RESET_PARAM_DEFAULTS Procedure
RESTORE_DICTIONARY_STATS Procedure
RESTORE_FIXED_OBJECTS_STATS Procedure
RESTORE_SCHEMA_STATS Procedure
RESTORE_SYSTEM_STATS Procedure
RESTORE_TABLE_STATS Procedure

Whenever statistics in dictionary are modified, old versions of statistics are saved automatically for future restoring. The old statistics are purged automatically at regular intervals based on the statistics history retention setting and the time of recent statistics gathering performed in the system. Retention is configurable using the ALTER_STATS_HISTORY_RETENTION Procedure.

比如我们重新分析了表，发现分析的数据导致了CBO选择了错误的执行计划，为了挽救这种局面，可以将统计信息恢复到从前的那个时间点，也就是CBO执行计划正确的时间点，先解决这个问题，再来分析问题的原因。

具体参考oracle 联机文档：

http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_stats.htm#i1036461

四．动态采样

4.1 什么是动态采样

动态采样（Dynamic Sampling）技术的最初提出是在Oracle 9i R2，在段（表，索引，分区）没有分析的情况下，为了使CBO 优化器得到足够的信息以保证做出正确的执行计划而发明的一种技术，可以把它看做分析手段的一种补充。

当段对象没有统计信息时（即没有做分析），动态采样技术可以通过直接从需要分析的对象上收集数据块（采样）来获得CBO需要的统计信息。

一个简单的例子：

创建表：

SQL> create table t

2 as

3 select owner,object_type from all_objects;

表已创建。

查看表的记录数：

SQL> select count(*) from t;

COUNT(*)

----------

72236 -- 记录数

这里创建了一张普通表，没有做分析，我们在hint中用0级来限制动态采样，此时CBO 唯一可以使用的信息就是表存储在数据字典中的一些信息，如有多少个extent，有多少个block，但是这些信息是不够的。

SQL> set autot traceonly explain

SQL> select /*+dynamic_sampling(t 0) */ * from t;

执行计划

----------------------------------------------------------

Plan hash value: 1601196873

--------------------------------------------------------------------------

--------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 15928 | 435K| 55 (0)| 00:00:01 |

| 1 | TABLE ACCESS FULL| T | 15928 | 435K| 55 (0)| 00:00:01 |

在没有做动态分析的情况下，CBO 估计的记录数是15928条，与真实的72236 相差甚远。

我们用动态分析来查看一下：

SQL> select * from t;

执行计划

----------------------------------------------------------

Plan hash value: 1601196873

--------------------------------------------------------------------------

--------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 80232 | 2193K| 56 (2)| 00:00:01 |

| 1 | TABLE ACCESS FULL| T | 80232 | 2193K| 56 (2)| 00:00:01 |

--------------------------------------------------------------------------

Note

-----

- dynamic sampling used for this statement (level=2)

在Oracle 10g中默认对没有分析的段做动态采样，上面的查询结果显示使用了Level 2级的动态采样，CBO 估计的结果是80232 与72236 很接近了。

注意一点：

在没有动态采样的情况下，对于没有分析过的段，CBO也可能错误地将结果判断的程度扩大话。如：

SQL> delete from t;

已删除72236行。

SQL> commit;

提交完成。

SQL> select /*+dynamic_sampling(t 0) */ * from t;

执行计划

----------------------------------------------------------

Plan hash value: 1601196873

--------------------------------------------------------------------------

--------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 15928 | 435K| 55 (0)| 00:00:01 |

| 1 | TABLE ACCESS FULL| T | 15928 | 435K| 55 (0)| 00:00:01 |

--------------------------------------------------------------------------

SQL> select * from t;

执行计划

----------------------------------------------------------

Plan hash value: 1601196873

--------------------------------------------------------------------------

--------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 1 | 28 | 55 (0)| 00:00:01 |

| 1 | TABLE ACCESS FULL| T | 1 | 28 | 55 (0)| 00:00:01 |

--------------------------------------------------------------------------

Note

-----

- dynamic sampling used for this statement (level=2)

如果细心一点，可能看出2个执行计划的差别。在没有采用动态分析的情况下，CBO 对t表估计的还是15928行记录，但是用动态分析就显示1条记录。而表中的数据在查询之前已经删除掉了。出现这种情况的原因是因为高水位。虽然表的数据已经删除，但是表分配的extent 和block 没有被回收，所以在这种情况下CBO 依然认为有那么多的数据在那。

通过这一点，我们可以看出，此时CBO能够使用的信息非常有限，也就是这个表有几个extent，有几个block。但动态采样之后，Oracle 立即发现，原来数据块中都是空的。

关于Oracle 高水位，参考我的blog：Oracle 高水位(HWM)

http://blog.csdn.net/tianlesoftware/archive/2009/10/22/4707900.aspx

动态采样有两方面的作用：

（1） CBO 依赖的是充分的统计分析信息，但是并不是每个用户都会非常认真，及时地去对每个表做分析。为了保证执行计划都尽可能地正确，Oracle 需要使用动态采样技术来帮助CBO 获取尽可能多的信息。

（2）全局临时表。通常来讲，临时表的数据是不做分析的，因为它存放的数据是临时性的，可能很快就释放了，但是当一个查询关联到这样的临时表时，CBO要想获得临时表上的统计信息分析数据，就只能依赖于动态采样了。

动态采样除了可以在段对象没有分析时，给CBO提供分析数据之外，还有一个独特的能力，它可以对不同列之间的相关性做统计。

相对的，表分析的信息是独立的。如：

（1）表的行数，平均行长。

（2）表的每个列的最大值，最小值，重复率，也可能包含直方图。

（3）索引的聚合因子，索引叶的块数目，索引的高度等。

尽管看到动态采样的优点，但是它的缺点也是显而易见，否则Oracle 一定会一直使用动态采样来取代数据分析：

（1）采样的数据块有限，对于海量数据的表，结果难免有偏差。

（2）采样会消耗系统资源，特别是OLTP数据库，尤其不推荐使用动态采样。

4.2 动态采样的级别

Oracle 为动态采样划分了11个级别，在Oracle 的官网上详细的介绍。

13.5.7.4 Dynamic Sampling Levels

http://download.oracle.com/docs/cd/E11882_01/server.112/e10821/stats.htm#PFGRF94760

The sampling levels are as follows if the dynamic sampling level used is from a cursor hint or from the OPTIMIZER_DYNAMIC_SAMPLING initialization parameter:

Level 0: Do not use dynamic sampling.

Level 1: Sample all tables that have not been analyzed if the following criteria are met: (1) there is at least 1 unanalyzed table in the query; (2) this unanalyzed table is joined to another table or appears in a subquery or non-mergeable view; (3) this unanalyzed table has no indexes; (4) this unanalyzed table has more blocks than the number of blocks that would be used for dynamic sampling of this table. The number of blocks sampled is the default number of dynamic sampling blocks (32).

Level 2: Apply dynamic sampling to all unanalyzed tables. The number of blocks sampled is two times the default number of dynamic sampling blocks.

Level 3: Apply dynamic sampling to all tables that meet Level 2 criteria, plus all tables for which standard selectivity estimation used a guess for a predicate that is a potential dynamic sampling predicate. The number of blocks sampled is the default number of dynamic sampling blocks. For unanalyzed tables, the number of blocks sampled is twice the default number of dynamic sampling blocks.

Level 4: Apply dynamic sampling to all tables that meet Level 3 criteria, plus all tables that have single-table predicates that reference 2 or more columns. The number of blocks sampled is the default number of dynamic sampling blocks. For unanalyzed tables, the number of blocks sampled is two times the default number of dynamic sampling blocks.

Levels 5, 6, 7, 8, and 9: Apply dynamic sampling to all tables that meet the previous level criteria using 2, 4, 8, 32, or 128 times the default number of dynamic sampling blocks respectively.

Level 10: Apply dynamic sampling to all tables that meet the Level 9 criteria using all blocks in the table.

The sampling levels are as follows if the dynamic sampling level for a table is set using the DYNAMIC_SAMPLING optimizer hint:

Level 0: Do not use dynamic sampling.

Level 1: The number of blocks sampled is the default number of dynamic sampling blocks (32).

Levels 2, 3, 4, 5, 6, 7, 8, and 9: The number of blocks sampled is 2, 4, 8, 16, 32, 64, 128, or 256 times the default number of dynamic sampling blocks respectively.

Level 10: Read all blocks in the table.

4.2.1 Level 0

不做动态分析

4.2.2 Level 1

Oracle 对没有分析的表进行动态采样，但需要同时满足以下4个条件。

（1） SQL中至少有一个未分析的表

（2）未分析的表出现在关联查询或者子查询中

（3）未分析的表没有索引

（4）未分析的表占用的数据块要大于动态采样默认的数据块（32个）

4.2.3 Level 2

对所有的未分析表做分析，动态采样的数据块是默认数据块的2倍。

4.2.4 Level 3

采样的表包含满足Level 2定义的所有表，同时包括，那些谓词有可能潜在地需要动态采样的表，这些动态采样的数据块为默认数据块，对没有分析的表，动态采样的默认块为默认数据块的2倍。

4.2.5 Level 4

采样的表包含满足Level 3定义的表，同时还包括一些表，他们包含一个单表的谓词会引用另外的2个列或者更多的列；采样的块数是动态采样默认数据块数；对没有分析的表，动态采样的数据块为默认数据块的2倍。

4.2.6 Level 5，6，7，8，9

采样的表包含满足Level 4定义的表，同时分别使用动态采样默认数据块的2，4，8，32，128 倍的数量来做动态分析。

4.2.7 Level 10

采样的表包含满足Level 9定义的所有表，同时对表的所有数据进行动态采样。

采样的数据块越多，得到的分析数据就越接近与真实，但同时伴随着资源消耗的也越大。

4.3 什么时候使用动态采样

动态采样也需要额外的消耗数据库资源，所以，如果 SQL 被反复执行，变量被绑定，硬分析很少，在这样一个环境中，是不宜使用动态采样的，就像OLTP系统。动态采样发生在硬分析时，如果很少有硬分析发生，动态采样的意义就不大。

而在OLAP或者数据仓库环境下，SQL执行消耗的资源要远远大于SQL解析，那么让解析在消耗多一点资源做一些动态采样分析，从而做出一个最优的执行计划是非常值得的。实际上在这样的环境中，硬分析消耗的资源几乎是可以忽略的。

所以，一般在OLAP 或者数据仓库环境中，将动态采样的level 设置为3或者4 比较好。相反，在OLTP系统下，不应该使用动态采样。

整理自《让Oracle 跑的更快》

------------------------------------------------------------------------------

Blog： http://blog.csdn.net/tianlesoftware

网上资源： http://tianlesoftware.download.csdn.net

相关视频：http://blog.csdn.net/tianlesoftware/archive/2009/11/27/4886500.aspx

DBA1 群：62697716(满); DBA2 群：62697977(满)

DBA3 群：63306533; 聊天群：40132017

你可能感兴趣的:(oracle)

SYSAUX表空间WRH$_ACTIVE_SESSION_HISTORY占用空间过大的清理办法 jcsx 数据库 oracle
SYSAUX表空间WRH$_ACTIVE_SESSION_HISTORY占用空间过大的清理办法一、查看@$ORACLE_HOME/rdbms/admin/awrinfo.sql一般是truncate旧分区。查看snapshotsqlplus/assysdbasetlinesize1000;setpagesize200;colbegin_interval_timeformata30;colend_i
Python 数据库自动化操作指南老胖闲聊 Python python 数据库自动化
本指南详细讲解如何使用Python操作MySQL、Oracle和MicrosoftSQLServer数据库，涵盖常用库、基础操作、高级功能及完整代码示例。目录MySQL操作详解Oracle操作详解MicrosoftSQLServer操作详解通用注意事项一、MySQL操作详解1.常用库mysql-connector-python（官方驱动）安装：pipinstallmysql-connector-p
Oracle建表 java-王森笔记
今天要介绍的是在oracle中建表。先用管理员权限创建一个表空间：createtablespacehellen_spacedatafile‘/opt/oracle/oradata/orcl/hellen_space01.dbf’size20m;查看创建的表空间：[oracle@mopheeorcl]$cd/opt/oracle/oradata/orcl/[oracle@mopheeorcl]$ls
IvorySQL 初始化（initdb）过程深度解析 IvorySQL IvorySQL postgresql 数据库
作为一款深度兼容Oracle的开源数据库，IvorySQL在初始化阶段通过多模式架构设计，实现从底层到应用层的灵活兼容。以下是其核心流程的拆解：一、初始化模式：PG与Oracle的“双面基因”1.模式选择与参数设计通过initdb命令的-m参数，用户可指定数据库的初始兼容模式：#初始化Oracle兼容模式（默认）./initdb-D/data-moracle#初始化PostgreSQL原生模式./
NoSQL 数据库的应用场景与挑战无界探索数据库 nosql
```htmlNoSQL数据库的应用场景与挑战随着互联网的快速发展，数据量呈爆炸式增长，传统的关系型数据库（如MySQL、Oracle等）在处理大规模数据时遇到了瓶颈。NoSQL数据库应运而生，它以其灵活的数据模型和强大的可扩展性，满足了现代应用对大数据存储和处理的需求。应用场景高并发读写场景：NoSQL数据库通过分布式架构设计，能够轻松应对高并发读写请求。例如，在电商网站中，用户浏览商品、下单购
Oracle数据库数据编程SQL＜2.2 DDL 视图、序列＞ Tyler先森 Oracle 数据库 oracle sql
目录一、Oracle视图(Views)（一）Oracle视图特点（二）Oracle视图创建语法关键参数：（三）Oracle视图类型1、普通视图2、连接视图（可更新）3、对象视图4、物化视图（MaterializedViews）（四）Oracle视图数据字典（五）Oracle可更新视图规则（六）视图的优缺点1、视图的优点：2、视图的缺点：3、视图和表的区别二、Oracle序列(Sequences)（
Oracle数据库数据编程SQL＜2.3 DML增、删、改及merge into＞ Tyler先森 Oracle 数据库 oracle sql
目录一、DML数据操纵语言（AateManipulationLanguage)二、【insert】插入数据1、单行插入2、批量插入3、将数据同时插入到多张表insertall/insertfirst三、【update】更新数据1、语法2、举例3、update使用注意事项：四、【delete】删除数据---多用于删除特定数据1、语法2、deletefrom表不加条件则删除全部数据五、delete和t
Oracle数据库数据编程SQL＜2.1 DDL、DCL表、列及约束＞ Tyler先森 Oracle 数据库 oracle sql
目录一、对表的操作（一）复制表1、语法2、练习3、仅复制表格式--在where后加一个不成立的条件（二）自建表1、数据类型（1）字符类型：char2、varchar/varchar2char（数）固定长度类型varchar/varchar2（数）可变长度类型（2）数值类型：number、intnumber（数1，数2）int（数）（3）日期类型：date、timestampdate不用加长度tim
Oracle数据库数据编程SQL＜1.4 表连接、子查询＞ Tyler先森 Oracle sql 数据库大数据 oracle
目录一、表连接（一）内连接innerjoin，等值连接（二）外连接outerjoin，等值连接1、左外连接left{outer}join2、右外连接right{outer}join3、全外连接full{outer}join（三）不等值连接（四）自连接（五）用where的方式进行表连接1、显示两张表共有的部分，没有(+)加号是内连接（innerjoin）2、显示左表全部的信息，(+)加号在等号右边是
阿里开源的免费数据集成工具——DataX 遇码大数据开源 datax 数据集成大数据 seatunnel kettle flinkcdc
企业里真实的数据流转是什么样子的呢？左侧描述了一个企业真实的样子，我们总是需要把数据从一个地方搬到另一个地方，最后就是搬来搬去搬成了一张张解不开的网。右侧则表达了使用DataX为中心实现数据的同步。什么是DataXDataX是一个异构数据源离线同步工具，致力于实现包括关系型数据库(MySQL、Oracle等)、HDFS、Hive、ODPS、HBase、FTP等各种异构数据源之间稳定高效的数据同步功
GaussDB与传统关系型数据库Oracle在架构设计和应用场景上的核心差异笑远数据库 gaussdb oracle
理解GaussDB与传统关系型数据库Oracle在架构设计和应用场景上的核心差异，对于企业选择合适的数据库解决方案至关重要。以下将从多个维度深入解析两者的主要区别，以帮助您全面了解它们在现代数据管理中的定位和优势。1.架构设计上的核心差异1.1分布式架构vs.单体架构GaussDB：分布式架构：GaussDB（以华为GaussDB为例）采用分布式架构，能够横向扩展以处理海量数据和高并发请求。其设计
IvorySQL 初始化（initdb）过程深度解析数据库
作为一款深度兼容Oracle的开源数据库，IvorySQL在初始化阶段通过多模式架构设计，实现从底层到应用层的灵活兼容。以下是其核心流程的拆解：一、初始化模式：PG与Oracle的“双面基因”1.模式选择与参数设计通过initdb命令的-m参数，用户可指定数据库的初始兼容模式：#初始化Oracle兼容模式（默认）./initdb-D/data-moracle#初始化PostgreSQL原生模式./
【赵渝强老师】Oracle数据库的闪回查询数据库oracle
Oracle数据库的闪回查询（FlashbackQuery）是对查询语句select的扩展，它会从还原数据中提取所需要的历史数据以反映数据在历史的某个时间段上的状态。视频讲解如下：https://www.bilibili.com/video/BV1raDGYTEDQ/?aid=113434884575...一、闪回查询简介使用闪回查询可以用于查询在特定时间点存在的所有历史数据。使用闪回查询功能，可
ORACLE创建用户给予权限刘寰运营 oracle 数据库 mysql
–CreatetheusercreateuserMKJK--创建用户identifiedby“”;----密码–Grant/RevokeobjectprivilegesgrantselectonHISDB.EXAM_TA_BILLtoMKJK;grantselectonHISDB.EXAM_TA_BOOKtoMKJK;grantselectonHISDB.EXAM_TA_REPtoMKJK;gra
基于oracle linux的 DBI/DBD 标准化安装文档(二) oracle
一、安装DBIDBI(DatabaseInterface)是perl连接数据库的接口。其是perl连接数据库的最优方法，他支持包括Orcale,Sybase,mysql,db2等绝大多数的数据库，下面将简要介绍其安装方法。1.1解压tar-zxvfDBI-1.616_901.tar.gz1.2安装依赖yuminstallperl-ExtUtils-CBuilderperl-ExtUtils-Mak
java：关于 Java 技术 Katie。 Java 实战项目 java 开发语言
Java技术详解一、前言Java作为一种跨平台、面向对象的编程语言，自1995年由SunMicrosystems（后被Oracle收购）推出以来，便以其简单易学、稳定安全和高性能等优点风靡全球。经过二十余年的不断发展，Java已经成为企业级应用开发、移动互联网、分布式系统、大数据以及云计算等多个领域的主流技术之一。本文将对Java技术进行全面而深入的介绍，从语言基本语法到高级特性，从JVM架构到企
基于oracle linux的 DBI/DBD 标准化安装文档(二) oracle
一、安装DBIDBI(DatabaseInterface)是perl连接数据库的接口。其是perl连接数据库的最优方法，他支持包括Orcale,Sybase,mysql,db2等绝大多数的数据库，下面将简要介绍其安装方法。1.1解压tar-zxvfDBI-1.616_901.tar.gz1.2安装依赖yuminstallperl-ExtUtils-CBuilderperl-ExtUtils-Mak
oracle密码过期 ORA-28001: the password has expired 程序羊喜羊羊 oracle oracle 数据库
oracle数据库默认profile的密码有效期规则是default，有效期为180天，到期之后的密码就不能使用了，可以通过修改密码有效期或者修改密码后再次使用。1.使用sqlplus连接数据库（或者使用navicat等可视化工具通过管理用户连接）sqlplus"/assysdba"2.查看用户的proifle是哪个，一般是defaultSELECTusername,PROFILEFROMdba_
Oracle ORA-28001: the password has expired解决办法 idomyway Oracle oracle ora 28001 expired
前言Oracle提示错误消息ORA-28001:thepasswordhasexpired，是由于Oracle11G的新特性所致，Oracle11G创建用户时缺省密码过期限制是180天（即6个月），如果超过180天用户密码未做修改则该用户无法登录。解决方法1、修改方法ALTERUSER用户名IDENTIFIEDBY密码;修改密码后，会发现该账户会被锁定，这时需要通过如下SQL语句进行解锁：ALTE
思庄oracle技术分享-ORA-28001: the password has expired duanweifang oracle数据库 oracle 数据库
问题描述：trace文件中发现存在ora-28001告警，如下所示：数据库：oracle11.2.0.464位MonOct1704:26:022022Errorsinfiled:\app\administrator\diag\rdbms\orcl\orcl\trace\orcl_ora_1228.trc(incident=169673):ORA-00600:internalerrorcode,ar
ORA-28001: the password has expired解决办法飞奔的yah 数据库
登录数据库一、window登录oracle1）打开cmd,输入：sqlplus/nolog输入：connusername/passworld@数据库名称2）当然还有其他的方式：sql>conn/assyddba;即可登录oracle超级管理员用户（不需要用户和密码）。sql>connusername/password;通过输入用户名和密码的形式可以登录到普通用户。sql>connusername/
【赵渝强老师】Oracle数据库的闪回技术数据库oracle
在Oracle数据库的操作过程中，会不可避免地出现操作失误或者用户失误，例如不小心删除了一个表或者提交了一个错误的事务等。这些失误和错误可能会造成重要数据的丢失，最终导致Oracle数据库停止。在传统意义上，当发生数据丢失、数据错误问题时，解决的主要办法是数据的导入导出或者使用备份恢复技术。但是这些方法都需要在发生错误前，有一个正确的备份才能进行恢复。为了减少这方面的损失，Oracle提供了闪回技
基于oracle linux的 DBI/DBD 标准化安装文档(三) linux
一、安装DBIDBI(DatabaseInterface)是perl连接数据库的接口。其是perl连接数据库的最优方法，他支持包括Orcale,Sybase,mysql,db2等绝大多数的数据库，下面将简要介绍其安装方法。1.1解压tar-zxvfDBI-1.616_901.tar.gz1.2安装依赖yuminstallperl-ExtUtils-CBuilderperl-ExtUtils-Mak
oracle19c容器数据库,Oracle EBS支持19c容器数据库张兴艺 oracle19c容器数据库
OracleEBS的官方blog，有OracleEBS团队负责更新和维护。记录了OracleEBS的相关特性、认证信息、发布日期等信息等，https://blogs.oracle.com/ebstech/ebs-resourcesEBS发布日期Release12.012.112.2FirstRelease12.0.1(4/2007)12.1.1(4/2009)12.2.2(9/2013)Relea
OEL5.8 x64 安装oracle数据库环境配置脚本 weixin_33972649 数据库 awk
平时要搭建大量的oracle的测试环境，重复多了也感觉的到麻烦了，干脆整个脚本来创建安装oracle之前的一些环境变量等相关配置，提高安装oracle10g效率，也可以稍改改用于11g的安装前环境配置，整理自用。本文出自:http://koumm.blog.51cto.com本文适用环境：RHEL/CentOS/OEL5.8X64安装过程中选中图形界面，开发包，开发库，老的软件开发包等。脚本如下：
redhat安装oracle 12.0.1 我命由我liu 数据库
1.关闭服务并disableSelinuxNetworkManagerFirewall2.配置yum源[oracle@oracle12c-70~]$[oracle@oracle12c-70~]$cat/etc/yum.repos.d/local.repo[local-yum]name=CentOS-$releasever-Mediabaseurl=http://yum.cloud1.sip.sh.
《深度剖析：MySQL、Oracle、SQL Server分页语法大揭秘》人工智能
在数据处理的庞大版图中，分页查询宛如一座桥梁，连接着海量数据与高效数据展示的彼岸。无论是搭建面向用户的应用程序，还是构建复杂的数据管理系统，分页查询都扮演着不可或缺的角色。对于开发者而言，熟练驾驭不同数据库的分页语法，不仅是技术能力的体现，更是在实际项目中优化数据处理效率的关键。今天，就让我们一同深入MySQL、Oracle、SQLServer这三大主流数据库的分页世界，探索它们独特的分页之道。M
Java：企业级开发的王者 java
1.1Java简介Java由SunMicrosystems（现属Oracle）于1995年推出，是一种面向对象、跨平台的编程语言。凭借"WriteOnce,RunAnywhere"（一次编写，到处运行）的理念，Java成为企业级开发的首选语言。Java的核心优势✔跨平台性（JVM实现）✔强大的生态系统（Spring、Hibernate等框架）✔内存自动管理（GC垃圾回收）✔高并发支持（多线程、NI
MySQL学习所念皆成. JAVA WEB mysql 数据库学习
MySQL一、MySQL数据库相关概念1.1什么是MySQL?MySQL是一个关系型数据库管理系统，由瑞典MySQLAB公司开发，目前属于Oracle旗下产品。MySQL是最流行的关系型数据库管理系统之一，在WEB应用方面，MySQL是最好的RDBMS(RelationalDatabaseManagementSystem，关系数据库管理系统)应用软件之一。1.2MySQL的优点？数据库体积小、速度
Oracle AI应用的LLM模型典型配置后端
最近在做一些基于Oracle的一些AI应用测试工作，AI肯定离不开配置LLM相关，虽然是简单配置类，但实际还是遇到一些卡点，记录下来供今后参考。1.配置Embedding模型2.特殊语法传参JSON格式3.测试Embedding有效4.修改MAX_STRING_SIZE5.配置为DeepSeek的LLM6.测试Chat和Showsql有效m.ximalaya.com/sound/825946205
辗转相处求最大公约数沐刃青蛟 C++漏洞
无言面对”江东父老“了，接触编程一年了，今天发现还不会辗转相除法求最大公约数。惭愧惭愧！为此，总结一下以方便日后忘了好查找。 1.输入要比较的两个数a,b 忽略：2.比较大小（因为后面要的是大的数对小的数做%操作） 3.辗转相除（用循环不停的取余，如a%b,直至b=0） 4.最后的a为两数的最大公约数 &
F5负载均衡会话保持技术及原理技术白皮书 bijian1013 F5 负载均衡
一.什么是会话保持？在大多数电子商务的应用系统或者需要进行用户身份认证的在线系统中，一个客户与服务器经常经过好几次的交互过程才能完成一笔交易或者是一个请求的完成。由于这几次交互过程是密切相关的，服务器在进行这些交互过程的某一个交互步骤时，往往需要了解上一次交互过程的处理结果，或者上几步的交互过程结果，服务器进行下
Object.equals方法：重载还是覆盖 Cwind java generics override overload
本文译自StackOverflow上对此问题的讨论。原问题链接在阅读Joshua Bloch的《Effective Java（第二版）》第8条“覆盖equals时请遵守通用约定”时对如下论述有疑问： “不要将equals声明中的Object对象替换为其他的类型。程序员编写出下面这样的equals方法并不鲜见，这会使程序员花上数个小时都搞不清它为什么不能正常工作：” pu
初始线程 15700786134
暑假学习的第一课是讲线程，任务是是界面上的一条线运动起来。既然是在界面上，那必定得先有一个界面，所以第一步就是，自己的类继承JAVA中的JFrame，在新建的类中写一个界面，代码如下： public class ShapeFr
Linux的tcpdump 被触发 tcpdump
用简单的话来定义tcpdump，就是：dump the traffic on a network，根据使用者的定义对网络上的数据包进行截获的包分析工具。 tcpdump可以将网络中传送的数据包的“头”完全截获下来提供分析。它支持针对网络层、协议、主机、网络或端口的过滤，并提供and、or、not等逻辑语句来帮助你去掉无用的信息。实用命令实例默认启动 tcpdump 普通情况下，直
安卓程序listview优化后还是卡顿肆无忌惮_ ListView
最近用eclipse开发一个安卓app，listview使用baseadapter，里面有一个ImageView和两个TextView。使用了Holder内部类进行优化了还是很卡顿。后来发现是图片资源的问题。把一张分辨率高的图片放在了drawable-mdpi文件夹下，当我在每个item中显示，他都要进行缩放，导致很卡顿。解决办法是把这个高分辨率图片放到drawable-xxhdpi下。 &nb
扩展easyUI tab控件，添加加载遮罩效果知了ing jquery
(function () { $.extend($.fn.tabs.methods, { //显示遮罩 loading: function (jq, msg) { return jq.each(function () { var panel = $(this).tabs(&
gradle上传jar到nexus 矮蛋蛋 gradle
原文地址： https://docs.gradle.org/current/userguide/maven_plugin.html configurations { deployerJars } dependencies { deployerJars "org.apache.maven.wagon
千万条数据外网导入数据库的解决方案。 alleni123 sql mysql
从某网上爬了数千万的数据，存在文本中。然后要导入mysql数据库。悲剧的是数据库和我存数据的服务器不在一个内网里面。。 ping了一下， 19ms的延迟。于是下面的代码是没用的。 ps = con.prepareStatement(sql); ps.setString(1, info.getYear())............; ps.exec
JAVA IO InputStreamReader和OutputStreamReader 百合不是茶 JAVA.io操作字符流
这是第三篇关于java.io的文章了，从开始对io的不了解-->熟悉--->模糊，是这几天来对文件操作中最大的感受，本来自己认为的熟悉了的，刚刚在回想起前面学的好像又不是很清晰了，模糊对我现在或许是最好的鼓励我会更加的去学加油！： JAVA的API提供了另外一种数据保存途径，使用字符流来保存的，字符流只能保存字符形式的流字节流和字符的难点：a,怎么将读到的数据
MO、MT解读 bijian1013 GSM
MO= Mobile originate，上行，即用户上发给SP的信息。MT= Mobile Terminate，下行，即SP端下发给用户的信息；上行:mo提交短信到短信中心下行:mt短信中心向特定的用户转发短信，你的短信是这样的，你所提交的短信，投递的地址是短信中心。短信中心收到你的短信后，存储转发，转发的时候就会根据你填写的接收方号码寻找路由，下发。在彩信领域是一样的道理。下行业务：由SP
五个JavaScript基础问题 bijian1013 JavaScript call apply this Hoisting
下面是五个关于前端相关的基础问题，但却很能体现JavaScript的基本功底。问题1：Scope作用范围考虑下面的代码： (function() { var a = b = 5; })(); console.log(b); 什么会被打印在控制台上？回答：上面的代码会打印 5。 &nbs
【Thrift二】Thrift Hello World bit1129 Hello world
本篇，不考虑细节问题和为什么，先照葫芦画瓢写一个Thrift版本的Hello World，了解Thrift RPC服务开发的基本流程 1. 在Intellij中创建一个Maven模块，加入对Thrift的依赖，同时还要加上slf4j依赖，如果不加slf4j依赖，在后面启动Thrift Server时会报错 <dependency>
【Avro一】Avro入门 bit1129 入门
本文的目的主要是总结下基于Avro Schema代码生成，然后进行序列化和反序列化开发的基本流程。需要指出的是，Avro并不要求一定得根据Schema文件生成代码，这对于动态类型语言很有用。 1. 添加Maven依赖 <?xml version="1.0" encoding="UTF-8"?> <proj
安装nginx+ngx_lua支持WAF防护功能 ronin47
需要的软件:LuaJIT-2.0.0.tar.gz nginx-1.4.4.tar.gz &nb
java-5.查找最小的K个元素-使用最大堆 bylijinnan java
import java.util.Arrays; import java.util.Random; public class MinKElement { /** * 5.最小的K个元素 * I would like to use MaxHeap. * using QuickSort is also OK */ public static void
TCP的TIME-WAIT bylijinnan socket
原文连接： http://vincent.bernat.im/en/blog/2014-tcp-time-wait-state-linux.html 以下为对原文的阅读笔记说明：主动关闭的一方称为local end，被动关闭的一方称为remote end 本地IP、本地端口、远端IP、远端端口这一“四元组”称为quadruplet，也称为socket 1、TIME_WA
jquery ajax 序列化表单 coder_xpf Jquery ajax 序列化
checkbox 如果不设定值，默认选中值为on；设定值之后，选中则为设定的值 <input type="checkbox" name="favor" id="favor" checked="checked"/> $("#favor&quo
Apache集群乱码和最高并发控制 cuisuqiang apache tomcat 并发集群乱码
都知道如果使用Http访问，那么在Connector中增加URIEncoding即可，其实使用AJP时也一样，增加useBodyEncodingForURI和URIEncoding即可。最大连接数也是一样的，增加maxThreads属性即可，如下，配置如下： <Connector maxThreads="300" port="8019" prot
websocket dalan_123 websocket
一、低延迟的客户端-服务器和服务器-客户端的连接很多时候所谓的http的请求、响应的模式，都是客户端加载一个网页，直到用户在进行下一次点击的时候，什么都不会发生。并且所有的http的通信都是客户端控制的，这时候就需要用户的互动或定期轮训的，以便从服务器端加载新的数据。通常采用的技术比如推送和comet（使用http长连接、无需安装浏览器安装插件的两种方式：基于ajax的长
菜鸟分析网络执法官 dcj3sjt126com 网络
最近在论坛上看到很多贴子在讨论网络执法官的问题。菜鸟我正好知道这回事情.人道"人之患好为人师" 手里忍不住,就写点东西吧. 我也很忙.又没有MM,又没有MONEY....晕倒有点跑题. OK,闲话少说,切如正题. 要了解网络执法官的原理. 就要先了解局域网的通信的原理. 前面我们看到了.在以太网上传输的都是具有以太网头的数据包.
Android相对布局属性全集 dcj3sjt126com android
RelativeLayout布局android:layout_marginTop="25dip" //顶部距离android:gravity="left" //空间布局位置android:layout_marginLeft="15dip //距离左边距 // 相对于给定ID控件android:layout_above 将该控件的底部置于给定ID的
Tomcat内存设置详解 eksliang jvm tomcat tomcat内存设置
Java内存溢出详解一、常见的Java内存溢出有以下三种： 1. java.lang.OutOfMemoryError: Java heap space ----JVM Heap（堆）溢出JVM在启动的时候会自动设置JVM Heap的值，其初始空间(即-Xms)是物理内存的1/64，最大空间(-Xmx)不可超过物理内存。可以利用JVM提
Java6 JVM参数选项 greatwqs java HotSpot jvm jvm参数 JVM Options
Java 6 JVM参数选项大全（中文版）作者：Ken Wu Email: ken.wug@gmail.com 转载本文档请注明原文链接 http://kenwublog.com/docs/java6-jvm-options-chinese-edition.htm！本文是基于最新的SUN官方文档Java SE 6 Hotspot VM Opt
weblogic创建JMC i5land weblogic jms
进入 weblogic控制太 1.创建持久化存储 --Services--Persistant Stores--new--Create FileStores--name随便起--target默认--Directory写入在本机建立的文件夹的路径--ok 2.创建JMS服务器 --Services--Messaging--JMS Servers--new--name随便起--Pers
基于 DHT 网络的磁力链接和BT种子的搜索引擎架构 justjavac DHT
上周开发了一个磁力链接和 BT 种子的搜索引擎 {Magnet & Torrent}，本文简单介绍一下主要的系统功能和用到的技术。系统包括几个独立的部分：使用 Python 的 Scrapy 框架开发的网络爬虫，用来爬取磁力链接和种子；使用 PHP CI 框架开发的简易网站；搜索引擎目前直接使用的 MySQL，将来可以考虑使
sql添加、删除表中的列 macroli sql
添加没有默认值：alter table Test add BazaarType char(1) 有默认值的添加列：alter table Test add BazaarType char(1) default(0) 删除没有默认值的列：alter table Test drop COLUMN BazaarType 删除有默认值的列：先删除约束（默认值）alter table Test DRO
PHP中二维数组的排序方法 abc123456789cba 排序二维数组 PHP
<?php/*** @package BugFree* @version $Id: FunctionsMain.inc.php,v 1.32 2005/09/24 11:38:37 wwccss Exp $*** Sort an two-dimension array by some level
hive优化之------控制hive任务中的map数和reduce数 superlxw1234 hive hive优化
一、控制hive任务中的map数: 1. 通常情况下，作业会通过input的目录产生一个或者多个map任务。主要的决定因素有： input的文件总个数，input的文件大小，集群设置的文件块大小(目前为128M, 可在hive中通过set dfs.block.size;命令查看到，该参数不能自定义修改)；2.
Spring Boot 1.2.4 发布 wiselyman spring boot
Spring Boot 1.2.4已于6.4日发布，repo.spring.io and Maven Central可以下载(推荐使用maven或者gradle构建下载)。这是一个维护版本，包含了一些修复small number of fixes,建议所有的用户升级。 Spring Boot 1.3的第一个里程碑版本将在几天后发布，包含许多