J.Kuchiki

【OpenGauss源码学习 —— 列存储（创建表）】

列存储

什么是列存储？
语法实现
- 语法格式
- 参数说明
- 示例
- 源码分析（创建表）
- - 语法层（Gram.y）
  - 子模块（utility.cpp）
总结

声明：本文的部分内容参考了他人的文章。在编写过程中，我们尊重他人的知识产权和学术成果，力求遵循合理使用原则，并在适用的情况下注明引用来源。
本文主要参考了 OpenGauss1.1.0 的开源代码和《OpenGauss数据库源码解析》一书以及OpenGauss社区学习文档

什么是列存储？

列存储是一种优化技术，用于在数据库系统中存储和查询大量数据。与传统的行存储方式不同，列存储将每个列的数据分别存储在独立的存储单元中，而不是按照行的方式存储。这种存储方式在分析性查询、聚合操作和大规模数据处理等场景下具有很大的优势。
行、列存储模型各有优劣，建议根据实际情况选择。通常openGauss用于OLTP（联机事务处理）场景的数据库，默认使用行存储，仅对执行复杂查询且数据量大的OLAP（联机分析处理）场景时，才使用列存储。默认情况下，创建的表为行存储。行存储和列存储的差异如下图所示：

上图中，左上为行存表，右上为行存表在硬盘上的存储方式。左下为列存表，右下为列存表在硬盘上的存储方式。

列存储的特点和优势：

压缩效率高：由于相同类型的数据在列中是连续存储的，可以采用更加高效的压缩算法，从而减少存储空间的使用。

数据读取效率高：在查询中只加载需要的列，减少了不必要的数据传输，提高了查询效率。

聚合操作效率高：在列存储中，同一列的数据相邻存储，这样在进行聚合操作时只需要对该列中的数据进行计算，减少了不必要的读取和计算。

列存储适合分析性查询：分析性查询通常涉及多个列的聚合和筛选操作，列存储的存储方式更适合这种场景，可以提高查询效率。

适用于大规模数据处理：列存储在大规模数据处理、数据仓库等场景中具有明显的性能优势，能够更好地支持复杂的分析任务。

列存储相比于行存储的优点和缺点如下：

存储模型	优点	缺点
行存	数据被保存在一起。INSERT/UPDATE容易。	选择(SELECT)时即使只涉及某几列，所有数据也都会被读取。
列存	1. 查询时只有涉及到的列会被读取。 2. 投影(Projection)很高效。 3. 任何列都能作为索引。	1. 选择完成时，被选择的列要重新组装。 2. INSERT/UPDATE比较麻烦。

一般情况下，如果表的字段比较多（大宽表），查询中涉及到的列不多的情况下，适合列存储。如果表的字段个数比较少，查询大部分字段，那么选择行存储比较好。

存储类型	适用场景
行存	1. 点查询(返回记录少，基于索引的简单查询)。 2. 增、删、改操作较多的场景。 3. 频繁的更新、少量的插入。
列存	1. 统计分析类查询 (关联、分组操作较多的场景)。 2. 即席查询（查询条件不确定，行存表扫描难以使用索引）。 3. 一次性大批量插入。 4. 表列数较多，建议使用列存表。 5. 如果每次查询时，只涉及了表的少数（<50%总列数）几个列，建议使用列存表。

语法实现

语法格式

CREATE TABLE table_name 
    (column_name data_type [, ... ])
    [ WITH ( ORIENTATION  = value) ];

参数说明

参数	说明
table_name	要创建的表名。
column_name	新表中要创建的字段名。
data_type	字段的数据类型。
ORIENTATION	指定表数据的存储方式，即行存方式、列存方式，该参数设置成功后就不再支持修改。取值范围： ROW，表示表的数据将以行式存储。行存储适合于OLTP业务，适用于点查询或者增删操作较多的场景。 ROW，表示表的数据将以行式存储。列存储适合于数据仓库业务，此类型的表上会做大量的汇聚计算，且涉及的列操作较少。

示例

来看一下官方文档给出的两个实际案例：

不指定ORIENTATION参数时，表默认为行存表。例如：

openGauss=# CREATE TABLE customer_test1
(
  state_ID   CHAR(2),
  state_NAME VARCHAR2(40),
  area_ID    NUMBER
);

--删除表
openGauss=# DROP TABLE customer_test1;

创建列存表时，需要指定ORIENTATION参数。例如：

openGauss=# CREATE TABLE customer_test2
(
  state_ID   CHAR(2),
  state_NAME VARCHAR2(40),
  area_ID    NUMBER
)
WITH (ORIENTATION = COLUMN);

--删除表
openGauss=# DROP TABLE customer_test2;

源码分析（创建表）

语法层（Gram.y）

接下来从代码实现层面来看看吧，创建列存表所涉及的语法代码如下：

注：Gram.y文件是YACC（Yet Another Compiler Compiler）工具生成的语法分析器的输入文件，用于解析SQL语句或其他领域特定语言。

columnDef:	ColId Typename ColCmprsMode create_generic_options ColQualList
				{
					ColumnDef *n = makeNode(ColumnDef);
					n->colname = $1;
					n->typname = $2;
					n->inhcount = 0;
					n->is_local = true;
					n->is_not_null = false;
					n->is_from_type = false;
					n->storage = 0;
					n->cmprs_mode = $3;
					n->raw_default = NULL;
					n->cooked_default = NULL;
					n->collOid = InvalidOid;
					n->fdwoptions = $4;
                    n->clientLogicColumnRef=NULL;
					
                    SplitColQualList($5, &n->constraints, &n->collClause,&n->clientLogicColumnRef, yyscanner);

					$$ = (Node *)n;
				}
		;

下面我们来分析一下这段代码：

columnDef:：这是一个非终结符，表示列定义的语法规则开始。

ColId Typename ColCmprsMode create_generic_options ColQualList：这是规则的产生式，由一系列非终结符组成，代表列定义的各个部分。

{ }：这是动作部分的开始和结束，包含在花括号内的代码会在解析这个规则时执行。

ColumnDef *n = makeNode(ColumnDef);：在这里，创建了一个 ColumnDef 类型的节点，并将其指针赋值给 n。

n->colname = $1;：将解析得到的列名（通过 $1 表示）赋值给列定义的节点的 colname 字段。

n->typname = $2;：将解析得到的类型名赋值给列定义的节点的 typname 字段。

n->inhcount = 0;：将继承计数字段初始化为 0。

n->is_local = true;：设置 is_local 字段为 true。

n->is_not_null = false;：设置 is_not_null 字段为 false。

n->is_from_type = false;：设置 is_from_type 字段为 false。

n->storage = 0;：将存储字段初始化为 0。

n->cmprs_mode = $3;：将解析得到的压缩模式赋值给 cmprs_mode 字段。

n->raw_default = NULL;：将默认原始值字段初始化为 NULL。

n->cooked_default = NULL;：将默认经过处理的值字段初始化为 NULL。

n->collOid = InvalidOid;：将排序规则 OID 初始化为 InvalidOid。

n->fdwoptions = $4;：将解析得到的外部数据包含选项赋值给 fdwoptions 字段。

n->clientLogicColumnRef=NULL;：将客户逻辑列引用字段初始化为 NULL。

SplitColQualList($5, &n->constraints, &n->collClause, &n->clientLogicColumnRef, yyscanner);：调用函数 SplitColQualList，将解析得到的列限制、排序规则和客户逻辑列引用传递给相应的字段。

$$ = (Node *)n;：将构造的列定义节点 n 赋值给规则的结果。

;：表示语法规则结束。

其中，ColumnDef 结构一般在数据库的源代码中进行定义。它通常是作为系统内部数据结构的一部分，用于表示用户在创建表时定义的列的属性。
ColumnDef 结构源码如下：（路径：src/include/nodes/parsenodes_common.h）

/*
 * ColumnDef - 列定义（用于各种创建操作）
 *
 * 如果列有默认值，我们可以在“原始”形式（未经转换的解析树）或“处理过”形式（经过解析分析的可执行表达式树）中拥有该值的表达式，
 * 这取决于如何创建此 ColumnDef 节点（通过解析还是从现有关系继承）。在同一个节点中不应同时存在两者！
 *
 * 类似地，我们可以在原始形式（表示为 CollateClause，arg==NULL）或处理过形式（校对的 OID）中拥有 COLLATE 规范。
 *
 * 约束列表可能在由 gram.y 生成的原始解析树中包含 CONSTR_DEFAULT 项，但 transformCreateStmt 将删除该项并设置 raw_default。
 * CONSTR_DEFAULT 项不应出现在任何后续处理中。
 */
typedef struct ColumnDef {
    NodeTag type;              /* 结点类型标记 */
    char *colname;             /* 列名 */
    TypeName *typname;         /* 列的数据类型 */
    int kvtype;                /* 如果使用 KV 存储，kv 属性类型 */
    int inhcount;              /* 列继承的次数 */
    bool is_local;             /* 列是否有本地（非继承）定义 */
    bool is_not_null;          /* 是否指定 NOT NULL 约束？ */
    bool is_from_type;         /* 列定义来自表类型 */
    bool is_serial;            /* 列是否是序列类型 */
    char storage;              /* attstorage 设置，或默认为 0 */
    int8 cmprs_mode;           /* 应用于此列的压缩方法 */
    Node *raw_default;         /* 默认值（未经转换的解析树） */
    Node *cooked_default;      /* 默认值（经过转换的表达式树） */
    CollateClause *collClause; /* 未经转换的 COLLATE 规范，如果有的话 */
    Oid collOid;               /* 校对 OID（如果未设置，则为 InvalidOid） */
    List *constraints;         /* 列的其他约束 */
    List *fdwoptions;          /* 每列的 FDW 选项 */
    ClientLogicColumnRef *clientLogicColumnRef; /* 客户端逻辑引用 */
    Position *position;
    Form_pg_attribute dropped_attr; /* 在创建类似表 OE 过程中被删除的属性的结构 */
} ColumnDef;

这里重点来看看n->cmprs_mode = $3;也就是列的压缩方法是如何定义的：

ColCmprsMode:    /* 列压缩模式规则 */
    DELTA           {$$ = ATT_CMPR_DELTA;}        /* delta 压缩 */
    | PREFIX        {$$ = ATT_CMPR_PREFIX;}       /* 前缀压缩 */
    | DICTIONARY    {$$ = ATT_CMPR_DICTIONARY;}   /* 字典压缩 */
    | NUMSTR        {$$ = ATT_CMPR_NUMSTR;}       /* 数字-字符串压缩 */
    | NOCOMPRESS    {$$ = ATT_CMPR_NOCOMPRESS;}   /* 不压缩 */
    | /* EMPTY */   {$$ = ATT_CMPR_UNDEFINED;}    /* 用户未指定 */
;

以上代码是 opengauss 数据库系统中定义列压缩模式的规则。每行代码对应了一种列压缩模式，例如 DELTA 压缩、前缀压缩、字典压缩等。在解析和创建表的过程中，用户可以通过指定列的压缩模式来定义对该列的数据压缩方式。根据语法规则，解析器会将不同的压缩模式转换为对应的内部表示值，以便在内部进行处理。

子模块（utility.cpp）

函数 CreateCommand（路径：src/gausskernel/process/tcop/utility.cpp），用于处理创建表（CREATE 命令）的操作，源码如下：

/*
 * Notice: parse_tree could be from cached plan, do not modify it under other memory context
 */
#ifdef PGXC
void CreateCommand(CreateStmt *parse_tree, const char *query_string, ParamListInfo params, 
                   bool is_top_level, bool sent_to_remote)
#else
void CreateCommand(CreateStmt* parse_tree, const char* query_string, ParamListInfo params, bool is_top_level)
#endif

{
    List* stmts = NIL;
    ListCell* l = NULL;
    Oid rel_oid;
#ifdef PGXC
    bool is_temp = false;
    bool is_object_temp = false;
    PGXCSubCluster* sub_cluster = NULL;
    char* tablespace_name = NULL;
    char relpersistence = RELPERSISTENCE_PERMANENT;
    bool table_is_exist = false;
    char* internal_data = NULL;
    List* uuids = (List*)copyObject(parse_tree->uuids);

    char* first_exec_node = NULL;
    bool is_first_node = false;
    char* query_string_with_info = (char*)query_string;
    char* query_string_with_data = (char*)query_string;

    if (IS_PGXC_COORDINATOR && !IsConnFromCoord()) {
        first_exec_node = find_first_exec_cn();
        is_first_node = (strcmp(first_exec_node, g_instance.attr.attr_common.PGXCNodeName) == 0);
    }
#endif

    /*
     * DefineRelation() needs to know "isTopLevel"
     * by "DfsDDLIsTopLevelXact" to prevent "create hdfs table" running
     * inside a transaction block.
     */
    if (IS_PGXC_COORDINATOR && !IsConnFromCoord())
        u_sess->exec_cxt.DfsDDLIsTopLevelXact = is_top_level;

    /* Run parse analysis ... */
    if (u_sess->attr.attr_sql.enable_parallel_ddl)
        stmts = transformCreateStmt((CreateStmt*)parse_tree, query_string, NIL, true, is_first_node);
    else
        stmts = transformCreateStmt((CreateStmt*)parse_tree, query_string, NIL, false);

    /*
     * If stmts is NULL, then the table is exists.
     * we need record that for searching the group of table.
     */
    if (stmts == NIL) {
        table_is_exist = true;
        /*
         * Just return here, if we continue
         * to send if not exists stmt, may
         * cause the inconsistency of metadata.
         * If we under xc_maintenance_mode, we can do
         * this to slove some problem of inconsistency.
         */
        if (u_sess->attr.attr_common.xc_maintenance_mode == false)
            return;
    }

#ifdef PGXC
    if (IS_MAIN_COORDINATOR) {
        /*
         * Scan the list of objects.
         * Temporary tables are created on Datanodes only.
         * Non-temporary objects are created on all nodes.
         * In case temporary and non-temporary objects are mized return an error.
         */
        bool is_first = true;

        foreach (l, stmts) {
            Node* stmt = (Node*)lfirst(l);

            if (IsA(stmt, CreateStmt)) {
                CreateStmt* stmt_loc = (CreateStmt*)stmt;
                sub_cluster = stmt_loc->subcluster;
                tablespace_name = stmt_loc->tablespacename;
                relpersistence = stmt_loc->relation->relpersistence;
                is_object_temp = stmt_loc->relation->relpersistence == RELPERSISTENCE_TEMP;
                internal_data = stmt_loc->internalData;
                if (is_object_temp)
                    u_sess->exec_cxt.hasTempObject = true;

                if (is_first) {
                    is_first = false;
                    if (is_object_temp)
                        is_temp = true;
                } else {
                    if (is_object_temp != is_temp)
                        ereport(ERROR,
                            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                                errmsg("CREATE not supported for TEMP and non-TEMP objects"),
                                errdetail("You should separate TEMP and non-TEMP objects")));
                }
            } else if (IsA(stmt, CreateForeignTableStmt)) {
#ifdef ENABLE_MULTIPLE_NODES
                validate_streaming_engine_status(stmt);
#endif
                if (in_logic_cluster()) {
                    CreateStmt* stmt_loc = (CreateStmt*)stmt;
                    sub_cluster = stmt_loc->subcluster;
                }

                /* There are no temporary foreign tables */
                if (is_first) {
                    is_first = false;
                } else {
                    if (!is_temp)
                        ereport(ERROR,
                            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                                errmsg("CREATE not supported for TEMP and non-TEMP objects"),
                                errdetail("You should separate TEMP and non-TEMP objects")));
                }
            } else if (IsA(stmt, CreateSeqStmt)) {
                CreateSeqStmt* sstmt = (CreateSeqStmt*)stmt;

                Const* n = makeConst(INT8OID, -1, InvalidOid, sizeof(int64), Int64GetDatum(sstmt->uuid), false, true);

                uuids = lappend(uuids, n);
            }
        }

        /* Package the internalData after the query_string */
        if (internal_data != NULL) {
            query_string_with_data = append_internal_data_to_query(internal_data, query_string);
        }

        /*
         * Now package the uuids message that create table on RemoteNode need.
         */
        if (uuids != NIL) {
            char* uuid_info = nodeToString(uuids);
            AssembleHybridMessage(&query_string_with_info, query_string_with_data, uuid_info);
        } else
            query_string_with_info = query_string_with_data;
    }

    /*
     * If I am the main execute CN but not CCN,
     * Notify the CCN to create firstly, and then notify other CNs except me.
     */
    if (IS_PGXC_COORDINATOR && !IsConnFromCoord()) {
        if (u_sess->attr.attr_sql.enable_parallel_ddl && !is_first_node) {
            if (!sent_to_remote) {
                RemoteQuery* step = makeNode(RemoteQuery);
                step->combine_type = COMBINE_TYPE_SAME;
                step->sql_statement = (char*)query_string_with_info;

                if (is_object_temp)
                    step->exec_type = EXEC_ON_NONE;
                else
                    step->exec_type = EXEC_ON_COORDS;

                step->exec_nodes = NULL;
                step->is_temp = is_temp;
                ExecRemoteUtility_ParallelDDLMode(step, first_exec_node);
                pfree_ext(step);
            }
        }
    }

    if (u_sess->attr.attr_sql.enable_parallel_ddl) {
        if (IS_PGXC_COORDINATOR && !IsConnFromCoord() && !is_first_node)
            stmts = transformCreateStmt((CreateStmt*)parse_tree, query_string, uuids, false);
    }
#endif

#ifdef PGXC
    /*
     * Add a RemoteQuery node for a query at top level on a remote
     * Coordinator, if not already done so
     */
    if (!sent_to_remote) {
        if (u_sess->attr.attr_sql.enable_parallel_ddl && !is_first_node)
            stmts = AddRemoteQueryNode(stmts, query_string_with_info, EXEC_ON_DATANODES, is_temp);
        else
            stmts = AddRemoteQueryNode(stmts, query_string_with_info, CHOOSE_EXEC_NODES(is_object_temp), is_temp);

        if (IS_PGXC_COORDINATOR && !IsConnFromCoord() &&
            (sub_cluster == NULL || sub_cluster->clustertype == SUBCLUSTER_GROUP)) {
            const char* group_name = NULL;
            Oid group_oid = InvalidOid;

            /*
             * If TO-GROUP clause is specified when creating table, we
             * only have to add required datanode in remote DDL execution
             */
            if (sub_cluster != NULL) {
                ListCell* lc = NULL;
                foreach (lc, sub_cluster->members) {
                    group_name = strVal(lfirst(lc));
                }
            } else if (in_logic_cluster() && !table_is_exist) {
                /*
                 *  for CreateForeignTableStmt ,
                 *  CreateTableStmt with user not attached to logic cluster
                 */
                group_name = PgxcGroupGetCurrentLogicCluster();
                if (group_name == NULL) {
                    ereport(ERROR, (errcode(ERRCODE_UNDEFINED_OBJECT), errmsg("Cannot find logic cluster.")));
                }
            } else {
                Oid tablespace_id = InvalidOid;
                bool dfs_tablespace = false;

                if (tablespace_name != NULL) {
                    tablespace_id = get_tablespace_oid(tablespace_name, false);
                } else {
                    tablespace_id = GetDefaultTablespace(relpersistence);
                }

                /* Determine if we are working on a HDFS table. */
                dfs_tablespace = IsSpecifiedTblspc(tablespace_id, FILESYSTEM_HDFS);

                /*
                 * If TO-GROUP clause is not specified we are using the installation group to
                 * distribute table.
                 *
                 * For HDFS table/Foreign Table we don't refer default_storage_nodegroup
                 * to make table creation.
                 */
                if (table_is_exist) {
                    Oid rel_id = RangeVarGetRelid(((CreateStmt*)parse_tree)->relation, NoLock, true);
                    if (OidIsValid(rel_id)) {
                        Oid table_groupoid = get_pgxc_class_groupoid(rel_id);
                        if (OidIsValid(table_groupoid)) {
                            group_name = get_pgxc_groupname(table_groupoid);
                        }
                    }
                    if (group_name == NULL) {
                        group_name = PgxcGroupGetInstallationGroup();
                    }
                } else if (dfs_tablespace || IsA(parse_tree, CreateForeignTableStmt)) {
                    group_name = PgxcGroupGetInstallationGroup();
                } else if (strcmp(u_sess->attr.attr_sql.default_storage_nodegroup, INSTALLATION_MODE) == 0 ||
                           u_sess->attr.attr_common.IsInplaceUpgrade) {
                    group_name = PgxcGroupGetInstallationGroup();
                } else {
                    group_name = u_sess->attr.attr_sql.default_storage_nodegroup;
                }

                /* If we didn't identify an installation node group error it out out */
                if (group_name == NULL) {
                    ereport(ERROR,
                        (errcode(ERRCODE_UNDEFINED_OBJECT),
                            errmsg("Installation node group is not defined in current cluster")));
                }
            }

            /* Fetch group name */
            group_oid = get_pgxc_groupoid(group_name);
            if (!OidIsValid(group_oid)) {
                ereport(ERROR,
                    (errcode(ERRCODE_UNDEFINED_OBJECT), errmsg("Target node group \"%s\" doesn't exist", group_name)));
            }

            if (in_logic_cluster()) {
                check_logic_cluster_create_priv(group_oid, group_name);
            } else {
                /* No limit in logic cluster mode */
                /* check to block non-redistribution process creating table to old group */
                if (!u_sess->attr.attr_sql.enable_cluster_resize) {
                    char in_redistribution = get_pgxc_group_redistributionstatus(group_oid);
                    if (in_redistribution == 'y') {
                        ereport(ERROR,
                            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                                errmsg("Unable to create table on old installation group \"%s\" while in cluster "
                                       "resizing.",
                                    group_name)));
                    }
                }
            }

            /* Build exec_nodes to table creation */
            const int total_len = list_length(stmts);
            Node* node = (Node*)list_nth(stmts, (total_len - 1));

            // *node* should be a RemoteQuery Node
            AssertEreport(query_string != NULL, MOD_EXECUTOR, "Node type is not remote type");
            RemoteQuery* rquery = (RemoteQuery*)node;
            // *exec_nodes* should be a NULL pointer
            AssertEreport(!rquery->exec_nodes, MOD_EXECUTOR, "remote query is not DN");
            rquery->exec_nodes = makeNode(ExecNodes);
            /* Set group oid here for sending bucket map to dn */
            rquery->exec_nodes->distribution.group_oid = group_oid;
            if (find_hashbucket_options(stmts)) {
                rquery->is_send_bucket_map = true;
            }
            /*
             * Check node group permissions, we only do such kind of ACL check
             * for user-defined nodegroup(none-installation)
             */
            AclResult acl_result = pg_nodegroup_aclcheck(group_oid, GetUserId(), ACL_CREATE);
            if (acl_result != ACLCHECK_OK) {
                aclcheck_error(acl_result, ACL_KIND_NODEGROUP, group_name);
            }

            /*
             * Notice!!
             * In cluster resizing stage we need special processing logics in table creation as:
             *	[1]. create table delete_delta ... to group old_group on all DN
             *	[2]. display pgxc_group.group_members
             *	[3]. drop table delete_delta ==> drop delete_delta on all DN
             *
             * So, as normal, when target node group's status is marked as 'installation' or
             * 'redistribution', we have to issue a full-DN create table request, remeber
             * pgxc_class.group_members still reflects table's logic distribution to tell pgxc
             * planner to build Scan operator in multi_nodegroup way. The reason we have to so is
             * to be compatible with current gs_switch_relfilenode() invokation in cluster expand
             * and shrunk mechanism.
             */
            if (need_full_dn_execution(group_name)) {
                /* Sepcial path, issue full-DN create table request */
                rquery->exec_nodes->nodeList = GetAllDataNodes();
            } else {
                /* Normal path, issue only needs DNs in create table request */
                Oid* members = NULL;
                int nmembers = 0;
                nmembers = get_pgxc_groupmembers(group_oid, &members);

                /* Append nodeId to exec_nodes */
                rquery->exec_nodes->nodeList = GetNodeGroupNodeList(members, nmembers);
                pfree_ext(members);

                if (uuids && nmembers < u_sess->pgxc_cxt.NumDataNodes) {
                    char* create_seqs;
                    RemoteQuery* step;

                    /* Create table in NodeGroup with sequence. */
                    create_seqs = assemble_create_sequence_msg(stmts, uuids);
                    step = make_remote_query_for_seq(rquery->exec_nodes, create_seqs);
                    stmts = lappend(stmts, step);
                }
            }
        }
    }
#endif

    if (uuids != NIL) {
        list_free_deep(uuids);
        uuids = NIL;
    }

    /* ... and do it */
    foreach (l, stmts) {
        Node* stmt = (Node*)lfirst(l);

        if (IsA(stmt, CreateStmt)) {
            Datum toast_options;
            static const char* const validnsps[] = HEAP_RELOPT_NAMESPACES;

            /* forbid user to set or change inner options */
            ForbidOutUsersToSetInnerOptions(((CreateStmt*)stmt)->options);

            /* Create the table itself */
            rel_oid = DefineRelation((CreateStmt*)stmt,
                                    ((CreateStmt*)stmt)->relkind == RELKIND_MATVIEW ?
                                                                    RELKIND_MATVIEW : RELKIND_RELATION,
                                    InvalidOid);
            /*
             * Let AlterTableCreateToastTable decide if this one
             * needs a secondary relation too.
             */
            CommandCounterIncrement();

            /* parse and validate reloptions for the toast table */
            toast_options =
                transformRelOptions((Datum)0, ((CreateStmt*)stmt)->options, "toast", validnsps, true, false);

            (void)heap_reloptions(RELKIND_TOASTVALUE, toast_options, true);

            AlterTableCreateToastTable(rel_oid, toast_options, ((CreateStmt *)stmt)->oldToastNode);
            AlterCStoreCreateTables(rel_oid, toast_options, (CreateStmt*)stmt);
            AlterDfsCreateTables(rel_oid, toast_options, (CreateStmt*)stmt);
#ifdef ENABLE_MULTIPLE_NODES
            Datum reloptions = transformRelOptions(
                (Datum)0, ((CreateStmt*)stmt)->options, NULL, validnsps, true, false);
            StdRdOptions* std_opt = (StdRdOptions*)heap_reloptions(RELKIND_RELATION, reloptions, true);
            if (StdRelOptIsTsStore(std_opt)) {
                create_ts_store_tables(rel_oid, toast_options);
            }
            /* create partition policy if ttl or period defined */
            create_part_policy_if_needed((CreateStmt*)stmt, rel_oid);
#endif   /* ENABLE_MULTIPLE_NODES */
        } else if (IsA(stmt, CreateForeignTableStmt)) {
            /* forbid user to set or change inner options */
            ForbidOutUsersToSetInnerOptions(((CreateStmt*)stmt)->options);

            /* if this is a log ft, check its definition */
            check_log_ft_definition((CreateForeignTableStmt*)stmt);

            /* Create the table itself */
            if (pg_strcasecmp(((CreateForeignTableStmt *)stmt)->servername, 
                STREAMING_SERVER) == 0) {
                /* Create stream */
                rel_oid = DefineRelation((CreateStmt*)stmt, RELKIND_STREAM, InvalidOid);
            } else {
                /* Create foreign table */
                rel_oid = DefineRelation((CreateStmt*)stmt, RELKIND_FOREIGN_TABLE, InvalidOid);
            }
            CreateForeignTable((CreateForeignTableStmt*)stmt, rel_oid);
        } else {
            if (IsA(stmt, AlterTableStmt))
                ((AlterTableStmt*)stmt)->fromCreate = true;

            /* Recurse for anything else */
            ProcessUtility(stmt,
                query_string_with_info,
                params,
                false,
                None_Receiver,
#ifdef PGXC
                true,
#endif /* PGXC */
                NULL);
        }

        /* Need CCI between commands */
        if (lnext(l) != NULL)
            CommandCounterIncrement();
    }

    /* reset */
    t_thrd.xact_cxt.inheritFileNode = false;
    parse_tree->uuids = NIL;
}

CreateCommand 函数负责处理 CREATE TABLE、CREATE FOREIGN TABLE 等创建表的 SQL 语句。下面简单介绍一下CreateCommand 函数的执行流程：

在开始之前，根据宏定义，函数有不同的参数，具体分为 PGXC（PostgreSQL扩展性集群）模式和非 PGXC 模式。在 PGXC 模式下，还有一些额外的变量用于并行 DDL（数据定义语言）执行和集群扩展/缩减。

这个函数首先初始化一些变量，包括一些用于 PGXC 模式下的信息，例如集群信息、表空间名、表的持久性等。

设置当前会话的状态，以便 DefineRelation() 函数判断是否需要执行 DDL 语句。对于 PGXC 模式，还会设置并行 DDL 的状态。

进行解析分析，将原始的 parse_tree 转化为一个列表 stmts，其中包含了各种 DDL 语句。解析分析是数据库执行 DDL 语句的第一步，将原始的语法树转换为可以执行的逻辑语句。

如果 stmts 为空，意味着表已经存在，会标记 table_is_exist 为真。这可能会在集群中有一些特殊的处理，具体操作可能会终止或返回。

在 PGXC 模式下，根据一些条件判断，选择性地设置 query_string_with_info，可能包含集群信息和UUID等。

在 PGXC 模式下，如果当前节点是主协调器且不是从协调器连接的，会根据条件发送远程查询，进行表的创建操作，具体取决于表的临时性质和是否启用并行 DDL。

在 PGXC 模式下，如果启用了并行 DDL，会再次进行解析分析，为了在并行 DDL 模式下对每个节点进行处理。

进行迭代处理 stmts 列表中的每个语句，根据语句类型分别执行相应的操作：

如果是 CreateStmt，调用 DefineRelation 函数定义表，然后根据情况创建相应的关联表（如 TOAST 表、列存储表、分布式表等）。

如果是 CreateForeignTableStmt，调用 DefineRelation 函数定义外部表，然后根据情况创建相应的外部表。

对于其他类型的语句，进行递归处理。

在语句执行之间，增加 CommandCounter，确保在不同语句之间的数据一致性。

最后，清理和释放一些资源，包括清空 uuids 列表和重置相关状态。

其中，函数 DefineRelation 是用于创建新表及其元数据的核心函数，它涵盖了与表的物理存储和逻辑结构相关的各种操作，并确保表的定义符合数据库系统的要求。
DefineRelation 函数源码如下：（路径：src/gausskernel/optimizer/commands/tablecmds.cpp）

/* ----------------------------------------------------------------
 *		DefineRelation
 *				Creates a new relation.
 *
 * stmt carries parsetree information from an ordinary CREATE TABLE statement.
 * The other arguments are used to extend the behavior for other cases:
 * relkind: relkind to assign to the new relation
 * ownerId: if not InvalidOid, use this as the new relation's owner.
 *
 * Note that permissions checks are done against current user regardless of
 * ownerId.  A nonzero ownerId is used when someone is creating a relation
 * "on behalf of" someone else, so we still want to see that the current user
 * has permissions to do it.
 *
 * If successful, returns the OID of the new relation.
 * ----------------------------------------------------------------
 */
Oid DefineRelation(CreateStmt* stmt, char relkind, Oid ownerId)
{
    char relname[NAMEDATALEN];
    Oid namespaceId;
    List* schema = stmt->tableElts;
    Oid relationId;
    Oid tablespaceId;
    Relation rel;
    TupleDesc descriptor;
    List* inheritOids = NIL;
    List* old_constraints = NIL;
    bool localHasOids = false;
    int parentOidCount;
    List* rawDefaults = NIL;
    List* cookedDefaults = NIL;
    List *ceLst = NIL;
    Datum reloptions;
    ListCell* listptr = NULL;
    AttrNumber attnum;
    static const char* const validnsps[] = HEAP_RELOPT_NAMESPACES;
    Oid ofTypeId;
    Node* orientedFrom = NULL;
    char* storeChar = ORIENTATION_ROW;
    bool timeseries_checked = false;
    bool dfsTablespace = false;
    bool isInitdbOnDN = false;
    HashBucketInfo* bucketinfo = NULL;
    DistributionType distType;

    /*
     * isalter is true, change the owner of the objects as the owner of the
     * namespace, if the owner of the namespce has the same name as the namescpe
     */
    bool isalter = false;
    bool hashbucket = false;

    bool relisshared = u_sess->attr.attr_common.IsInplaceUpgrade && u_sess->upg_cxt.new_catalog_isshared;
    errno_t rc;
    /*
     * Truncate relname to appropriate length (probably a waste of time, as
     * parser should have done this already).
     */
    rc = strncpy_s(relname, NAMEDATALEN, stmt->relation->relname, NAMEDATALEN - 1);
    securec_check(rc, "", "");
    
    if (stmt->relation->relpersistence == RELPERSISTENCE_UNLOGGED && STMT_RETRY_ENABLED)
        stmt->relation->relpersistence = RELPERSISTENCE_PERMANENT;

    /* During grayscale upgrade, forbid creating LIST/RANGE tables if workingVersionNum is too low. */
    if (stmt->distributeby != NULL) {
        distType = stmt->distributeby->disttype;
        if ((distType == DISTTYPE_RANGE || distType == DISTTYPE_LIST) && 
            t_thrd.proc->workingVersionNum < RANGE_LIST_DISTRIBUTION_VERSION_NUM) {
            ereport(ERROR,
                (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                    errmsg(
                        "Working Version Num less than %u does not support LIST/RANGE distributed tables.", 
                        RANGE_LIST_DISTRIBUTION_VERSION_NUM)));
        }
    }

    /*
     * Check consistency of arguments
     */
    if (stmt->oncommit != ONCOMMIT_NOOP
        && !(stmt->relation->relpersistence == RELPERSISTENCE_TEMP
        || stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP)) {
        ereport(ERROR,
               (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
               errmsg("ON COMMIT can only be used on temporary tables")));
    }

    //@Temp Table. We do not support on commit drop right now.
    if ((stmt->relation->relpersistence == RELPERSISTENCE_TEMP
        || stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP)
        && stmt->oncommit == ONCOMMIT_DROP) {
        ereport(
            ERROR,
            (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                errmsg(
               "ON COMMIT only support PRESERVE ROWS or DELETE ROWS option")));
    }

    if (stmt->constraints != NIL && relkind == RELKIND_FOREIGN_TABLE) {
        ereport(ERROR, (errcode(ERRCODE_WRONG_OBJECT_TYPE), errmsg("constraints on foreign tables are not supported")));
    }

    if (stmt->constraints != NIL && relkind == RELKIND_STREAM) {
        ereport(ERROR, (errcode(ERRCODE_WRONG_OBJECT_TYPE), errmsg("constraints on streams are not supported")));
    }
    /*
     * For foreign table ROUNDROBIN distribution is a built-in support.
     */
    if (IsA(stmt, CreateForeignTableStmt) &&
        (IsSpecifiedFDW(((CreateForeignTableStmt*)stmt)->servername, DIST_FDW) ||
            IsSpecifiedFDW(((CreateForeignTableStmt*)stmt)->servername, LOG_FDW) ||
            IsSpecifiedFDW(((CreateForeignTableStmt*)stmt)->servername, GC_FDW)) &&
        (IS_PGXC_COORDINATOR || (isRestoreMode && stmt->subcluster)) && !stmt->distributeby) {
        stmt->distributeby = makeNode(DistributeBy);
        stmt->distributeby->disttype = DISTTYPE_ROUNDROBIN;
        stmt->distributeby->colname = NULL;
    }
    /*
     * Look up the namespace in which we are supposed to create the relation,
     * check we have permission to create there, lock it against concurrent
     * drop, and mark stmt->relation as RELPERSISTENCE_TEMP if a temporary
     * namespace is selected.
     */
    namespaceId = RangeVarGetAndCheckCreationNamespace(stmt->relation, NoLock, NULL);

    if (u_sess->attr.attr_sql.enforce_a_behavior) {
        /* Identify user ID that will own the table
         *
         * change the owner of the objects as the owner of the namespace
         * if the owner of the namespce has the same name as the namescpe
         * note: the object must be of the ordinary table, sequence, view or
         *		composite type
         */
        if (!OidIsValid(ownerId) && (relkind == RELKIND_RELATION || relkind == RELKIND_SEQUENCE ||
            relkind == RELKIND_VIEW || relkind == RELKIND_COMPOSITE_TYPE
            || relkind == RELKIND_CONTQUERY))
            ownerId = GetUserIdFromNspId(namespaceId);

        if (!OidIsValid(ownerId))
            ownerId = GetUserId();
        else if (ownerId != GetUserId())
            isalter = true;

        if (isalter) {
            /* Check namespace permissions. */
            AclResult aclresult;

            aclresult = pg_namespace_aclcheck(namespaceId, ownerId, ACL_CREATE);
            if (aclresult != ACLCHECK_OK)
                aclcheck_error(aclresult, ACL_KIND_NAMESPACE, get_namespace_name(namespaceId));
        }
    }
    /*
     * Security check: disallow creating temp tables from security-restricted
     * code.  This is needed because calling code might not expect untrusted
     * tables to appear in pg_temp at the front of its search path.
     */
    if ((stmt->relation->relpersistence == RELPERSISTENCE_TEMP
        || stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP)
        && InSecurityRestrictedOperation()) {
        ereport(ERROR,
            (errcode(ERRCODE_INSUFFICIENT_PRIVILEGE),
                errmsg("cannot create temporary table within security-restricted operation")));
    }

    /*
     * Select tablespace to use.  If not specified, use default tablespace
     * (which may in turn default to database's default).
     */
    if (stmt->tablespacename) {
        tablespaceId = get_tablespace_oid(stmt->tablespacename, false);
    } else {
        tablespaceId = GetDefaultTablespace(stmt->relation->relpersistence);
        /* note InvalidOid is OK in this case */
    }

    dfsTablespace = IsSpecifiedTblspc(tablespaceId, FILESYSTEM_HDFS);

    if (dfsTablespace) {
        FEATURE_NOT_PUBLIC_ERROR("HDFS is not yet supported.");
    }

    if (dfsTablespace && is_feature_disabled(DATA_STORAGE_FORMAT)) {
        ereport(ERROR, (errcode(ERRCODE_FEATURE_NOT_SUPPORTED), errmsg("Unsupport the dfs table in this version.")));
    }

    PreCheckCreatedObj(stmt, dfsTablespace, relkind);

    /* Check permissions except when using database's default */
    if (OidIsValid(tablespaceId) && tablespaceId != u_sess->proc_cxt.MyDatabaseTableSpace) {
        AclResult aclresult;

        aclresult = pg_tablespace_aclcheck(tablespaceId, GetUserId(), ACL_CREATE);
        if (aclresult != ACLCHECK_OK)
            aclcheck_error(aclresult, ACL_KIND_TABLESPACE, get_tablespace_name(tablespaceId));
        // view is not related to tablespace, so no need to check permissions
        if (isalter && relkind != RELKIND_VIEW &&  relkind != RELKIND_CONTQUERY) {
            aclresult = pg_tablespace_aclcheck(tablespaceId, ownerId, ACL_CREATE);
            if (aclresult != ACLCHECK_OK)
                aclcheck_error(aclresult, ACL_KIND_TABLESPACE, get_tablespace_name(tablespaceId));
        }
    }

    /* In all cases disallow placing user relations in pg_global */
    if (!relisshared && tablespaceId == GLOBALTABLESPACE_OID)
        ereport(ERROR,
            (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
                errmsg("only shared relations can be placed in pg_global tablespace")));

    /* Identify user ID that will own the table */
    if (!OidIsValid(ownerId))
        ownerId = GetUserId();

    /* Add default options for relation if need. */
    if (!dfsTablespace) {
        if (!u_sess->attr.attr_common.IsInplaceUpgrade) {
            stmt->options = AddDefaultOptionsIfNeed(stmt->options, relkind, stmt->row_compress);
        }
    } else {
        checkObjectCreatedinHDFSTblspc(stmt, relkind);
    }

    /* Only support one partial cluster key for dfs table. */
    if (stmt->clusterKeys && list_length(stmt->clusterKeys) > 1) {
        ereport(ERROR,
            (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                errmsg("Only support one partial cluster key for dfs/cstore table.")));
    }

    /* Check tablespace's permissions for partition */
    if (stmt->partTableState) {
        check_part_tbl_space(stmt, ownerId, dfsTablespace);
    }

    /*
     * Parse and validate reloptions, if any.
     */
    /* global temp table */
    OnCommitAction oncommitAction = GttOncommitOption(stmt->options);
    if (stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP
        && relkind == RELKIND_RELATION) {
        if (oncommitAction != ONCOMMIT_NOOP && stmt->oncommit == ONCOMMIT_NOOP) {
            stmt->oncommit = oncommitAction;
        } else {
            if (oncommitAction != ONCOMMIT_NOOP && stmt->oncommit != ONCOMMIT_NOOP) {
                stmt->options = RemoveRelOption(stmt->options, "on_commit_delete_rows", NULL);
            }
            DefElem *opt = makeNode(DefElem);

            opt->type = T_DefElem;
            opt->defnamespace = NULL;
            opt->defname = "on_commit_delete_rows";
            opt->defaction = DEFELEM_UNSPEC;

            /* use reloptions to remember on commit clause */
            if (stmt->oncommit == ONCOMMIT_DELETE_ROWS) {
                opt->arg = reinterpret_cast<Node *>(makeString("true"));
            } else if (stmt->oncommit == ONCOMMIT_PRESERVE_ROWS) {
                opt->arg = reinterpret_cast<Node *>(makeString("false"));
            } else if (stmt->oncommit == ONCOMMIT_NOOP) {
                opt->arg = reinterpret_cast<Node *>(makeString("false"));
            } else {
                elog(ERROR, "global temp table not support on commit drop clause");
            }
            stmt->options = lappend(stmt->options, opt);
        }
    } else if (oncommitAction != ONCOMMIT_NOOP) {
        elog(ERROR, "The parameter on_commit_delete_rows is exclusive to the global temp table, which cannot be "
                    "specified by a regular table");
    }

    reloptions = transformRelOptions((Datum)0, stmt->options, NULL, validnsps, true, false);

    orientedFrom = (Node*)makeString(ORIENTATION_ROW); /* default is ORIENTATION_ROW */
    StdRdOptions* std_opt = (StdRdOptions*)heap_reloptions(relkind, reloptions, true);
    if (std_opt != NULL) {
        hashbucket = std_opt->hashbucket;
        if (hashbucket == true && t_thrd.proc->workingVersionNum < 92063) {
            ereport(ERROR,
                (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                    errmsg("hash bucket table not supported in current version!")));
        }
        if (pg_strcasecmp(ORIENTATION_COLUMN, StdRdOptionsGetStringData(std_opt, orientation, ORIENTATION_ROW)) == 0) {
            orientedFrom = (Node*)makeString(ORIENTATION_COLUMN);
            storeChar = ORIENTATION_COLUMN;
        } else if (pg_strcasecmp(ORIENTATION_ORC,
            StdRdOptionsGetStringData(std_opt, orientation, ORIENTATION_ROW)) == 0) {
            /*
             * Don't allow "create DFS table" to run inside a transaction block.
             *
             * "DfsDDLIsTopLevelXact" is set in "case T_CreateStmt" of
             * standard_ProcessUtility()
             *
             * exception: allow "CREATE DFS TABLE" operation in transaction block
             * during redis a table.
             */
            if (IS_PGXC_COORDINATOR && !IsConnFromCoord() && u_sess->attr.attr_sql.enable_cluster_resize == false)
                PreventTransactionChain(u_sess->exec_cxt.DfsDDLIsTopLevelXact, "CREATE DFS TABLE");

            orientedFrom = (Node*)makeString(ORIENTATION_ORC);
            storeChar = ORIENTATION_COLUMN;
        } else if(0 == pg_strcasecmp(ORIENTATION_TIMESERIES,
                    StdRdOptionsGetStringData(std_opt, orientation, ORIENTATION_ROW))) {
            orientedFrom = (Node *)makeString(ORIENTATION_TIMESERIES);
            storeChar = ORIENTATION_TIMESERIES;
            /* for ts table redistribute, timeseries table redis_ is reserved */
            if (!u_sess->attr.attr_sql.enable_cluster_resize) {
                if (strncmp(relname, "redis_", 6) == 0) {
                    ereport(ERROR,
                        (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                            errmsg("Invalid table name prefix redis_, reserved in redis mode.")));
                }
            }
            /*
             * Check the kvtype parameter legality for timeseries storage method.
             * If all the kvtype exclude tstime are same, change the orientation to row or column explicitly.
             */
            timeseries_checked = validate_timeseries(&stmt, &reloptions, &storeChar, &orientedFrom);
            std_opt = (StdRdOptions*)heap_reloptions(relkind, reloptions, true);
        }

        // Set kvtype to ATT_KV_UNDEFINED in row-oriented or column-oriented table.
        if (0 != pg_strcasecmp(storeChar, ORIENTATION_TIMESERIES)) {
            clear_kvtype_row_column(stmt);
        }

        /*
         * Because we also support create partition policy for non timeseries table, we should check parameter
         * ttl and period if it contains
         */
        if (timeseries_checked ||
            0 != pg_strcasecmp(TIME_UNDEFINED, StdRdOptionsGetStringData(std_opt, ttl, TIME_UNDEFINED)) ||
            0 != pg_strcasecmp(TIME_UNDEFINED, StdRdOptionsGetStringData(std_opt, period, TIME_UNDEFINED))) {
            partition_policy_check(stmt, std_opt, timeseries_checked);
            if (stmt->partTableState != NULL) {
                check_part_tbl_space(stmt, ownerId, dfsTablespace);
                checkPartitionSynax(stmt);
            }
        }

        if (IS_SINGLE_NODE && stmt->partTableState != NULL) {
            if (stmt->partTableState->rowMovement != ROWMOVEMENT_DISABLE)
                stmt->partTableState->rowMovement = ROWMOVEMENT_ENABLE;
        }

        if (0 == pg_strcasecmp(storeChar, ORIENTATION_COLUMN)) {
            CheckCStoreUnsupportedFeature(stmt);
            CheckCStoreRelOption(std_opt);
            ForbidToSetOptionsForColTbl(stmt->options);
            if (stmt->partTableState) {
                if (stmt->partTableState->rowMovement == ROWMOVEMENT_DISABLE) {
                    ereport(NOTICE,
                        (errmsg("disable row movement is invalid for column stored tables."
                                " They always enable row movement between partitions.")));
                }
                /* always enable rowmovement for column stored tables */
                stmt->partTableState->rowMovement = ROWMOVEMENT_ENABLE;
            }
        } else if (0 == pg_strcasecmp(storeChar, ORIENTATION_TIMESERIES)) {
            /* check both support coloumn store and row store */
            CheckCStoreUnsupportedFeature(stmt);
            CheckCStoreRelOption(std_opt);
            if (stmt->partTableState) {
                if (stmt->partTableState->rowMovement == ROWMOVEMENT_DISABLE)
                    ereport(NOTICE,
                        (errmsg("disable row movement is invalid for timeseries stored tables."
                                " They always enable row movement between partitions.")));
                /* always enable rowmovement for column stored tables */
                stmt->partTableState->rowMovement = ROWMOVEMENT_ENABLE;
            }
            if (relkind == RELKIND_RELATION) {
                /* only care heap relation. ignore foreign table and index relation */
                forbid_to_set_options_for_timeseries_tbl(stmt->options);
            }

            /* construct distribute keys using tstag if not specified */
            if (stmt->distributeby == NULL) {
                ListCell* cell = NULL;
                DistributeBy* newnode = makeNode(DistributeBy);
                List* colnames = NIL;
                newnode->disttype = DISTTYPE_HASH;

                foreach (cell, schema) {
                    ColumnDef* colDef = (ColumnDef*)lfirst(cell);
                    if (colDef->kvtype == ATT_KV_TAG && IsTypeDistributable(colDef->typname->typeOid)) {
                        colnames = lappend(colnames, makeString(colDef->colname));
                    }
                }
                if (list_length(colnames) == 0) {
                    ereport(ERROR,
                        (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                            errmsg("No column can be used as distribution column.")));
                }
                newnode->colname = colnames;
                stmt->distributeby = newnode;
            /* if specified hidetag, add a hidden column as distribution column */
            } else if (stmt->distributeby->disttype == DISTTYPE_HIDETAG &&
                       stmt->distributeby->colname == NULL) {
                bool has_distcol = false;
                ListCell* cell;
                foreach (cell, schema) {
                    ColumnDef* colDef = (ColumnDef*)lfirst(cell);
                    if (colDef->kvtype == ATT_KV_TAG && IsTypeDistributable(colDef->typname->typeOid)) {
                        has_distcol = true;
                    }
                }
                if (!has_distcol) {
                    ereport(ERROR,
                        (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                            errmsg("No column can be used as distribution column.")));
                }
                ColumnDef* colDef = makeColumnDef(TS_PSEUDO_DIST_COLUMN, "char");
                colDef->kvtype = ATT_KV_HIDE;
                stmt->tableElts = lappend(stmt->tableElts, colDef);
                /* still use hash logic later */
                DistributeBy* distnode = stmt->distributeby;
                distnode->disttype = DISTTYPE_HASH;

                distnode->colname = lappend(distnode->colname, makeString(colDef->colname));
                ereport(LOG, (errmodule(MOD_TIMESERIES), errmsg("use implicit distribution column method.")));
            }
        } else {
            if (relkind == RELKIND_RELATION) {
                /* only care heap relation. ignore foreign table and index relation */
                ForbidToSetOptionsForRowTbl(stmt->options);
            }
        }
        pfree_ext(std_opt);
    }

    if (pg_strcasecmp(storeChar, ORIENTATION_ROW) == 0) {
        RowTblCheckCompressionOption(stmt->options);
    }

    if (stmt->ofTypename) {
        AclResult aclresult;

        ofTypeId = typenameTypeId(NULL, stmt->ofTypename);

        aclresult = pg_type_aclcheck(ofTypeId, GetUserId(), ACL_USAGE);
        if (aclresult != ACLCHECK_OK)
            aclcheck_error_type(aclresult, ofTypeId);
        if (isalter) {
            ofTypeId = typenameTypeId(NULL, stmt->ofTypename);

            aclresult = pg_type_aclcheck(ofTypeId, ownerId, ACL_USAGE);
            if (aclresult != ACLCHECK_OK)
                aclcheck_error_type(aclresult, ofTypeId);
        }
    } else
        ofTypeId = InvalidOid;

    /*
     * Look up inheritance ancestors and generate relation schema, including
     * inherited attributes.
     */
    schema = MergeAttributes(
        schema, stmt->inhRelations, stmt->relation->relpersistence, &inheritOids, &old_constraints, &parentOidCount);

    /*
     * Create a tuple descriptor from the relation schema.	Note that this
     * deals with column names, types, and NOT NULL constraints, but not
     * default values or CHECK constraints; we handle those below.
     */
    if (relkind == RELKIND_COMPOSITE_TYPE)
        descriptor = BuildDescForRelation(schema, orientedFrom, relkind);
    else
        descriptor = BuildDescForRelation(schema, orientedFrom);

    /* Must specify at least one column when creating a table. */
    if (descriptor->natts == 0 && relkind != RELKIND_COMPOSITE_TYPE) {
        ereport(ERROR, (errcode(ERRCODE_FEATURE_NOT_SUPPORTED), errmsg("must have at least one column")));
    }

    if (stmt->partTableState) {
        List* pos = NIL;

        /* get partitionkey's position */
        pos = GetPartitionkeyPos(stmt->partTableState->partitionKey, schema);

        /* check partitionkey's datatype */
        if (stmt->partTableState->partitionStrategy == PART_STRATEGY_VALUE) {
            CheckValuePartitionKeyType(descriptor->attrs, pos);
        } else if (stmt->partTableState->partitionStrategy == PART_STRATEGY_INTERVAL) {
            CheckIntervalPartitionKeyType(descriptor->attrs, pos);
            CheckIntervalValue(descriptor->attrs, pos, stmt->partTableState->intervalPartDef);
        } else if (stmt->partTableState->partitionStrategy == PART_STRATEGY_RANGE) {
            CheckRangePartitionKeyType(descriptor->attrs, pos);
        } else if (stmt->partTableState->partitionStrategy == PART_STRATEGY_LIST) {
            CheckListPartitionKeyType(descriptor->attrs, pos);
        } else if (stmt->partTableState->partitionStrategy == PART_STRATEGY_HASH) {
            CheckHashPartitionKeyType(descriptor->attrs, pos);
        } else {
            list_free_ext(pos);
            ereport(ERROR,
                    (errcode(ERRCODE_FEATURE_NOT_SUPPORTED), errmsg("Unsupported partition table!")));
        }

        /*
         * Check partitionkey's value for none value-partition table as for value
         * partition table, partition value is known until data get loaded.
         */
        if (stmt->partTableState->partitionStrategy != PART_STRATEGY_VALUE && 
            stmt->partTableState->partitionStrategy != PART_STRATEGY_HASH &&
            stmt->partTableState->partitionStrategy != PART_STRATEGY_LIST)
            ComparePartitionValue(pos, descriptor->attrs, stmt->partTableState->partitionList);
        else if (stmt->partTableState->partitionStrategy == PART_STRATEGY_LIST)
            CompareListValue(pos, descriptor->attrs, stmt->partTableState);

        list_free_ext(pos);
    }

    localHasOids = interpretOidsOption(stmt->options);
    descriptor->tdhasoid = (localHasOids || parentOidCount > 0);

    if ((pg_strcasecmp(storeChar, ORIENTATION_COLUMN) == 0 || pg_strcasecmp(storeChar, ORIENTATION_TIMESERIES) == 0) &&
        localHasOids) {
        ereport(ERROR,
            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                errmsg("Local OID column not supported in column/timeseries store tables.")));
    }

    bool is_gc_fdw = false;
    if (!isRestoreMode && IsA(stmt, CreateForeignTableStmt) &&
        (IsSpecifiedFDW(((CreateForeignTableStmt*)stmt)->servername, GC_FDW))) {
        is_gc_fdw = true;
    }

    /*
     * Find columns with default values and prepare for insertion of the
     * defaults.  Pre-cooked (that is, inherited) defaults go into a list of
     * CookedConstraint structs that we'll pass to heap_create_with_catalog,
     * while raw defaults go into a list of RawColumnDefault structs that will
     * be processed by AddRelationNewConstraints.  (We can't deal with raw
     * expressions until we can do transformExpr.)
     *
     * We can set the atthasdef flags now in the tuple descriptor; this just
     * saves StoreAttrDefault from having to do an immediate update of the
     * pg_attribute rows.
     */
    rawDefaults = NIL;
    cookedDefaults = NIL;
    attnum = 0;

    foreach (listptr, schema) {
        ColumnDef* colDef = (ColumnDef*)lfirst(listptr);

        attnum++;

        if (is_gc_fdw) {
            if (colDef->constraints != NULL || colDef->is_not_null == true) {
                ereport(ERROR,
                    (errcode(ERRCODE_WRONG_OBJECT_TYPE),
                        errmsg("column constraint on postgres foreign tables are not supported")));
            }

            Type ctype = typenameType(NULL, colDef->typname, NULL);

            if (ctype) {
                Form_pg_type typtup = (Form_pg_type)GETSTRUCT(ctype);
                if (typtup->typrelid > 0) {
                    ereport(ERROR,
                        (errcode(ERRCODE_WRONG_OBJECT_TYPE),
                            errmsg("relation type column on postgres foreign tables are not supported")));
                }

                ReleaseSysCache(ctype);
            }
        }

        if (colDef->raw_default != NULL) {
            RawColumnDefault* rawEnt = NULL;

            if (relkind == RELKIND_FOREIGN_TABLE) {
                if (!(IsA(stmt, CreateForeignTableStmt) && (
#ifdef ENABLE_MOT
                        isMOTTableFromSrvName(((CreateForeignTableStmt*)stmt)->servername) ||
#endif
                        isPostgresFDWFromSrvName(((CreateForeignTableStmt*)stmt)->servername))))
                    ereport(ERROR,
                        (errcode(ERRCODE_WRONG_OBJECT_TYPE),
                            errmsg("default values on foreign tables are not supported")));
            }

            if (relkind == RELKIND_STREAM) {
                ereport(ERROR,
                    (errcode(ERRCODE_WRONG_OBJECT_TYPE), errmsg("default values on streams are not supported")));
            }

            Assert(colDef->cooked_default == NULL);

            rawEnt = (RawColumnDefault*)palloc(sizeof(RawColumnDefault));
            rawEnt->attnum = attnum;
            rawEnt->raw_default = colDef->raw_default;
            rawDefaults = lappend(rawDefaults, rawEnt);
            descriptor->attrs[attnum - 1]->atthasdef = true;
        } else if (colDef->cooked_default != NULL) {
            CookedConstraint* cooked = NULL;

            cooked = (CookedConstraint*)palloc(sizeof(CookedConstraint));
            cooked->contype = CONSTR_DEFAULT;
            cooked->name = NULL;
            cooked->attnum = attnum;
            cooked->expr = colDef->cooked_default;
            cooked->skip_validation = false;
            cooked->is_local = true; /* not used for defaults */
            cooked->inhcount = 0;    /* ditto */
            cooked->is_no_inherit = false;
            cookedDefaults = lappend(cookedDefaults, cooked);
            descriptor->attrs[attnum - 1]->atthasdef = true;
        }
        if (colDef->clientLogicColumnRef != NULL) {
            CeHeapInfo *ceHeapInfo = NULL;
            ceHeapInfo = (CeHeapInfo*) palloc(sizeof(CeHeapInfo));
            ceHeapInfo->attnum = attnum;
            set_column_encryption(colDef, ceHeapInfo);
            ceLst = lappend (ceLst, ceHeapInfo);
        }
    }


    /*Get hash partition key based on relation distribution info*/

    bool createbucket = false;
    /* restore mode */
    if (isRestoreMode) {
        /* table need hash partition */
        if (hashbucket == true) {
            /* here is dn */
            if (u_sess->storage_cxt.dumpHashbucketIds != NULL) {
                Assert(stmt->distributeby == NULL);
                createbucket = true;
            } else {
                 if (unlikely(stmt->distributeby == NULL)) {
                    ereport(ERROR,
                        (errcode(ERRCODE_UNEXPECTED_NULL_VALUE), errmsg("distributeby is NULL.")));
                }
            }

            bucketinfo = GetRelationBucketInfo(stmt->distributeby, descriptor, &createbucket, InvalidOid, true);

            Assert((createbucket == true && bucketinfo->bucketlist != NULL && bucketinfo->bucketcol != NULL) ||
                   (createbucket == false && bucketinfo->bucketlist == NULL && bucketinfo->bucketcol != NULL));
        }
    } else {
        /* here is normal mode */
        /* check if the table can be hash partition */
        if (!IS_SINGLE_NODE && !IsInitdb && (relkind == RELKIND_RELATION) && !IsSystemNamespace(namespaceId) &&
            !IsCStoreNamespace(namespaceId) && (0 == pg_strcasecmp(storeChar, ORIENTATION_ROW)) &&
            (stmt->relation->relpersistence == RELPERSISTENCE_PERMANENT)) {
            if (hashbucket == true || u_sess->attr.attr_storage.enable_hashbucket) {
                if (IS_PGXC_DATANODE) {
                    createbucket = true;
                }
                bucketinfo = GetRelationBucketInfo(stmt->distributeby, descriptor, 
                    &createbucket, stmt->oldBucket, hashbucket);

                Assert((bucketinfo == NULL && u_sess->attr.attr_storage.enable_hashbucket) ||
                       (createbucket == true && bucketinfo->bucketlist != NULL && bucketinfo->bucketcol != NULL) ||
                       (createbucket == false && bucketinfo->bucketlist == NULL && bucketinfo->bucketcol != NULL));
            }
        } else if (hashbucket == true) {
            ereport(ERROR,
                (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                    errmsg("The table %s do not support hash bucket", stmt->relation->relname)));
        }
    }

    /*
     * Create the relation.  Inherited defaults and constraints are passed in
     * for immediate handling --- since they don't need parsing, they can be
     * stored immediately.
     */
    relationId = heap_create_with_catalog(relname,
        namespaceId,
        tablespaceId,
        InvalidOid,
        InvalidOid,
        ofTypeId,
        ownerId,
        descriptor,
        list_concat(cookedDefaults, old_constraints),
        relkind,
        stmt->relation->relpersistence,
        relisshared,
        relisshared,
        localHasOids,
        parentOidCount,
        stmt->oncommit,
        reloptions,
        true,
        (g_instance.attr.attr_common.allowSystemTableMods || u_sess->attr.attr_common.IsInplaceUpgrade),
        stmt->partTableState,
        stmt->row_compress,
        stmt->oldNode,
        bucketinfo,
        true,
        ceLst);
    if (bucketinfo != NULL) {
        pfree_ext(bucketinfo->bucketcol);
        pfree_ext(bucketinfo->bucketlist);
        pfree_ext(bucketinfo);
    }

    /* Store inheritance information for new rel. */
    StoreCatalogInheritance(relationId, inheritOids);

    /*
     * We must bump the command counter to make the newly-created relation
     * tuple visible for opening.
     */
    CommandCounterIncrement();

#ifdef PGXC
    /*
     * Add to pgxc_class.
     * we need to do this after CommandCounterIncrement
     * Distribution info is to be added under the following conditions:
     * 1. The create table command is being run on a coordinator
     * 2. The create table command is being run in restore mode and
     *    the statement contains distribute by clause.
     *    While adding a new datanode to the cluster an existing dump
     *    that was taken from a datanode is used, and
     *    While adding a new coordinator to the cluster an exiting dump
     *    that was taken from a coordinator is used.
     *    The dump taken from a datanode does NOT contain any DISTRIBUTE BY
     *    clause. This fact is used here to make sure that when the
     *    DISTRIBUTE BY clause is missing in the statemnet the system
     *    should not try to find out the node list itself.
     * 3. When the sum of shmemNumDataNodes and shmemNumCoords equals to one,
     *    the create table command is executed on datanode.In this case, we
     *    do not write created table info in pgxc_class.
     */
    if ((*t_thrd.pgxc_cxt.shmemNumDataNodes + *t_thrd.pgxc_cxt.shmemNumCoords) == 1)
        isInitdbOnDN = true;

    if ((!u_sess->attr.attr_common.IsInplaceUpgrade || !IsSystemNamespace(namespaceId)) &&
        (IS_PGXC_COORDINATOR || (isRestoreMode && stmt->distributeby != NULL && !isInitdbOnDN)) &&
        (relkind == RELKIND_RELATION || relkind == RELKIND_MATVIEW ||
            (relkind == RELKIND_STREAM && stmt->distributeby != NULL) ||
#ifdef ENABLE_MOT
            (relkind == RELKIND_FOREIGN_TABLE && (stmt->distributeby != NULL ||
                (IsA(stmt, CreateForeignTableStmt) &&
                    isMOTTableFromSrvName(((CreateForeignTableStmt*)stmt)->servername)))))) {
#else
            (relkind == RELKIND_FOREIGN_TABLE && stmt->distributeby != NULL))) {
#endif
        char* logic_cluster_name = NULL;
        PGXCSubCluster* subcluster = stmt->subcluster;
        bool isinstallationgroup = (dfsTablespace || relkind == RELKIND_FOREIGN_TABLE 
                                    || relkind == RELKIND_STREAM);
        if (in_logic_cluster()) {
            isinstallationgroup = false;
            if (subcluster == NULL) {
                logic_cluster_name = PgxcGroupGetCurrentLogicCluster();
                if (logic_cluster_name != NULL) {
                    subcluster = makeNode(PGXCSubCluster);
                    subcluster->clustertype = SUBCLUSTER_GROUP;
                    subcluster->members = list_make1(makeString(logic_cluster_name));
                }
            }
        }

        /* assemble referenceoid for slice reference table creation */
        FetchSliceReftableOid(stmt, namespaceId);

        AddRelationDistribution(
            relname, relationId, stmt->distributeby, subcluster, inheritOids, descriptor, isinstallationgroup);

        if (logic_cluster_name != NULL && subcluster != NULL) {
            list_free_deep(subcluster->members);
            pfree_ext(subcluster);
            pfree_ext(logic_cluster_name);
        }

        CommandCounterIncrement();
        /* Make sure locator info gets rebuilt */
        RelationCacheInvalidateEntry(relationId);
    }
    /* If no Datanodes defined, do not create foreign table  */
    if (IS_PGXC_COORDINATOR && (relkind == RELKIND_FOREIGN_TABLE || relkind == RELKIND_STREAM) 
        && u_sess->pgxc_cxt.NumDataNodes == 0) {
        ereport(ERROR, (errcode(ERRCODE_UNDEFINED_OBJECT), errmsg("No Datanode defined in cluster")));
    }
#endif
    /*
     * Open the new relation and acquire exclusive lock on it.	This isn't
     * really necessary for locking out other backends (since they can't see
     * the new rel anyway until we commit), but it keeps the lock manager from
     * complaining about deadlock risks.
     */
    rel = relation_open(relationId, AccessExclusiveLock);

    /*
     * Now add any newly specified column default values and CHECK constraints
     * to the new relation.  These are passed to us in the form of raw
     * parsetrees; we need to transform them to executable expression trees
     * before they can be added. The most convenient way to do that is to
     * apply the parser's transformExpr routine, but transformExpr doesn't
     * work unless we have a pre-existing relation. So, the transformation has
     * to be postponed to this final step of CREATE TABLE.
     */
    if (rawDefaults != NULL || stmt->constraints != NULL) {
        List *tmp = AddRelationNewConstraints(rel, rawDefaults, stmt->constraints, true, true);
        list_free_ext(tmp);
    }

    /*
     * Now add any cluter key constraint for relation if has.
     */
    if (stmt->clusterKeys)
        AddRelClusterConstraints(rel, stmt->clusterKeys);

    /*
     * Clean up.  We keep lock on new relation (although it shouldn't be
     * visible to anyone else anyway, until commit).
     */
    relation_close(rel, NoLock);
    list_free_ext(rawDefaults);
    list_free_ext(ceLst);

    return relationId;
}

可以看到 DefineRelation 函数非常的长，没关系，我们只看我们需要的部分就可以啦。
首先，来看一下 heap_reloptions 函数， heap_reloptions 函数用于获取表的存储选项，它需要传入表的类型 relkind（如 RELKIND_RELATION 表示普通关系表，RELKIND_FOREIGN_TABLE 表示外部表等）以及 reloptions，它是一个存储选项列表。这些选项可以包括各种关于表的存储细节的信息。
heap_reloptions 函数源码如下：（路径：src/gausskernel/storage/access/common/reloptions.cpp）

/*
 * 解析堆、视图和 TOAST 表的选项。
 */
bytea *heap_reloptions(char relkind, Datum reloptions, bool validate)
{
    StdRdOptions *rdopts = NULL;

    // 根据关系类型选择相应的选项解析
    switch (relkind) {
        case RELKIND_TOASTVALUE:
            // 对于 TOAST 表，使用默认选项解析，类型为 RELOPT_KIND_TOAST
            rdopts = (StdRdOptions *)default_reloptions(reloptions, validate, RELOPT_KIND_TOAST);
            if (rdopts != NULL) {
                /* 调整仅适用于 TOAST 关系的默认参数 */
                rdopts->fillfactor = 100;
                rdopts->autovacuum.analyze_threshold = -1;
                rdopts->autovacuum.analyze_scale_factor = -1;
            }
            return (bytea *)rdopts;
        case RELKIND_RELATION:
            // 对于堆关系，使用默认选项解析，类型为 RELOPT_KIND_HEAP
            return default_reloptions(reloptions, validate, RELOPT_KIND_HEAP);
        case RELKIND_VIEW:
        case RELKIND_CONTQUERY:
        case RELKIND_MATVIEW:
            // 对于视图、连续查询和物化视图，使用默认选项解析，类型为 RELOPT_KIND_VIEW
            return default_reloptions(reloptions, validate, RELOPT_KIND_VIEW);
        default:
            /* 不支持其他关系类型 */
            return NULL;
    }
}

其中，RELKIND_TOASTVALUE、RELKIND_RELATION、RELKIND_VIEW、RELKIND_CONTQUERY和RELKIND_MATVIEW分别代表不同类型的数据库关系，表示以下含义：

数据库关系类型	含义
RELKIND_TOASTVALUE	用于存储大对象（Large Object，如大文本或大二进制数据）的分片数据。这些分片数据通常是对原始数据进行分段存储，以便在需要时进行透明的读取和管理。
RELKIND_RELATION	这是普通的堆表（Heap Table），也就是一般的数据表。它用于存储实际的行数据，以及与之关联的各种列信息。
RELKIND_VIEW	这是一个视图（View），它是一个虚拟的表，由查询定义而来。视图不存储实际的数据，而是提供对其他关系数据的逻辑视图。
RELKIND_CONTQUERY	这是一种持续查询（Continuous Query），用于处理流数据（Stream Data）。持续查询关系允许用户定义一种查询，它可以随着新数据的到达而动态更新结果。
RELKIND_MATVIEW	这是物化视图（Materialized View），也是一种虚拟的表，但是与普通视图不同，物化视图会实际存储计算结果，以提高查询性能。

default_reloptions 函数的作用是获取一个指向表的默认关系选项的指针，以便后续的处理和使用。总而言之，heap_reloptions 函数的作用是提取存储信息，对表的 reloptions 进行提取，存储到 StdRdOptions 结构体中。
以案例中的 SQL 语句为例：

openGauss=# CREATE TABLE customer_test2
(
  state_ID   CHAR(2),
  state_NAME VARCHAR2(40),
  area_ID    NUMBER
)
WITH (ORIENTATION = COLUMN);

调试信息如下：

接着再来分析如下判断条件：

if (pg_strcasecmp(ORIENTATION_COLUMN, StdRdOptionsGetStringData(std_opt, orientation, ORIENTATION_ROW)) == 0) {
            orientedFrom = (Node*)makeString(ORIENTATION_COLUMN);
            storeChar = ORIENTATION_COLUMN;
        }

首先，它使用 StdRdOptionsGetStringData(std_opt, orientation, ORIENTATION_ROW) 从存储选项中获取方向信息，然后通过 pg_strcasecmp 函数将获取到的方向信息与字符串常量 ORIENTATION_COLUMN 进行不区分大小写的比较。
如果比较的结果为 0，表示存储选项中的方向信息与 ORIENTATION_COLUMN 相匹配，那么就会执行以下操作：

将变量 orientedFrom 设置为一个表示列存储方向的节点，使用 makeString(ORIENTATION_COLUMN) 创建这个节点。

将变量 storeChar 设置为字符串常量 ORIENTATION_COLUMN，以便后续的操作可以使用这个标识来表示方向信息。

换句话说，这段代码的作用是检查存储选项中的方向信息是否为列存储，如果是，则设置相应的变量来表示这个信息。

由实际案例的调试信息可以看到方向信息是列存储

接着再来分析如下判断条件：

        // Set kvtype to ATT_KV_UNDEFINED in row-oriented or column-oriented table.
        if (0 != pg_strcasecmp(storeChar, ORIENTATION_TIMESERIES)) {
            clear_kvtype_row_column(stmt);
        }

这个判断是在检查存储选项中的方向信息是否为 "TIMESERIES"，如果不是的话，就执行一个函数 clear_kvtype_row_column(stmt) 来设置表的 kvtype 属性为 ATT_KV_UNDEFINED。
换句话说，当存储选项中的方向信息不是 "TIMESERIES" 时，将执行一些操作来将表的 kvtype 设置为未定义状态。
最后，再来分析如下判断条件：

if (0 == pg_strcasecmp(storeChar, ORIENTATION_COLUMN)) {
            CheckCStoreUnsupportedFeature(stmt);
            CheckCStoreRelOption(std_opt);
            ForbidToSetOptionsForColTbl(stmt->options);
            if (stmt->partTableState) {
                if (stmt->partTableState->rowMovement == ROWMOVEMENT_DISABLE) {
                    ereport(NOTICE,
                        (errmsg("disable row movement is invalid for column stored tables."
                                " They always enable row movement between partitions.")));
                }
                /* always enable rowmovement for column stored tables */
                stmt->partTableState->rowMovement = ROWMOVEMENT_ENABLE;
            }
        } else if (0 == pg_strcasecmp(storeChar, ORIENTATION_TIMESERIES)) {
            /* check both support coloumn store and row store */
            CheckCStoreUnsupportedFeature(stmt);
            CheckCStoreRelOption(std_opt);
            if (stmt->partTableState) {
                if (stmt->partTableState->rowMovement == ROWMOVEMENT_DISABLE)
                    ereport(NOTICE,
                        (errmsg("disable row movement is invalid for timeseries stored tables."
                                " They always enable row movement between partitions.")));
                /* always enable rowmovement for column stored tables */
                stmt->partTableState->rowMovement = ROWMOVEMENT_ENABLE;
            }
            if (relkind == RELKIND_RELATION) {
                /* only care heap relation. ignore foreign table and index relation */
                forbid_to_set_options_for_timeseries_tbl(stmt->options);
            }

            /* construct distribute keys using tstag if not specified */
            if (stmt->distributeby == NULL) {
                ListCell* cell = NULL;
                DistributeBy* newnode = makeNode(DistributeBy);
                List* colnames = NIL;
                newnode->disttype = DISTTYPE_HASH;

                foreach (cell, schema) {
                    ColumnDef* colDef = (ColumnDef*)lfirst(cell);
                    if (colDef->kvtype == ATT_KV_TAG && IsTypeDistributable(colDef->typname->typeOid)) {
                        colnames = lappend(colnames, makeString(colDef->colname));
                    }
                }
                if (list_length(colnames) == 0) {
                    ereport(ERROR,
                        (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                            errmsg("No column can be used as distribution column.")));
                }
                newnode->colname = colnames;
                stmt->distributeby = newnode;
            /* if specified hidetag, add a hidden column as distribution column */
            } else if (stmt->distributeby->disttype == DISTTYPE_HIDETAG &&
                       stmt->distributeby->colname == NULL) {
                bool has_distcol = false;
                ListCell* cell;
                foreach (cell, schema) {
                    ColumnDef* colDef = (ColumnDef*)lfirst(cell);
                    if (colDef->kvtype == ATT_KV_TAG && IsTypeDistributable(colDef->typname->typeOid)) {
                        has_distcol = true;
                    }
                }
                if (!has_distcol) {
                    ereport(ERROR,
                        (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                            errmsg("No column can be used as distribution column.")));
                }
                ColumnDef* colDef = makeColumnDef(TS_PSEUDO_DIST_COLUMN, "char");
                colDef->kvtype = ATT_KV_HIDE;
                stmt->tableElts = lappend(stmt->tableElts, colDef);
                /* still use hash logic later */
                DistributeBy* distnode = stmt->distributeby;
                distnode->disttype = DISTTYPE_HASH;

                distnode->colname = lappend(distnode->colname, makeString(colDef->colname));
                ereport(LOG, (errmodule(MOD_TIMESERIES), errmsg("use implicit distribution column method.")));
            }
        } else {
            if (relkind == RELKIND_RELATION) {
                /* only care heap relation. ignore foreign table and index relation */
                ForbidToSetOptionsForRowTbl(stmt->options);
            }
        }

这段代码根据存储选项中的方向信息（storeChar）执行一系列操作。

如果存储选项的方向是 "COLUMN"，则执行以下操作：

调用 CheckCStoreUnsupportedFeature(stmt)，检查是否支持列存储的特性。

调用 CheckCStoreRelOption(std_opt)，检查列存储的关系选项。

调用 ForbidToSetOptionsForColTbl(stmt->options)，禁止为列存储表设置特定的选项。

如果存在分区表状态（stmt->partTableState），则根据分区表状态设置行移动属性为 "ROWMOVEMENT_ENABLE"，因为列存储表总是启用分区间的行移动。

如果存储选项的方向是 "TIMESERIES"，则执行以下操作：

调用 CheckCStoreUnsupportedFeature(stmt)，检查是否支持列存储的特性。

调用 CheckCStoreRelOption(std_opt)，检查列存储的关系选项。

如果存在分区表状态（stmt->partTableState），则根据分区表状态设置行移动属性为 "ROWMOVEMENT_ENABLE"。

如果表的类型是普通表（relkind == RELKIND_RELATION），则禁止为时序存储表设置特定的选项。

构建分布键使用时间戳标签列作为分布列，如果未指定分布键的话。

如果指定了隐藏标签（"HIDETAG"）的分布方式，且未指定分布列，则添加一个隐藏列作为分布列。

如果存储选项的方向不是 "COLUMN" 或 "TIMESERIES"，则执行以下操作：

如果表的类型是普通表（relkind == RELKIND_RELATION），则禁止为行存储表设置特定的选项。

其次，我们进入到 CheckCStoreUnsupportedFeature 函数来看看吧，这个函数用于检查列存储表是否支持指定的特性，如果不支持则报告错误。
CheckCStoreUnsupportedFeature 函数源码如下：（路径：src/gausskernel/optimizer/commands/tablecmds.cpp）

// all unsupported features are checked and error reported here for cstore table
static void CheckCStoreUnsupportedFeature(CreateStmt* stmt)
{
    Assert(stmt);

    if (stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP) {
        ereport(ERROR,
            (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                errmsg("global temporary table can only support heap table")));
    }

    if (stmt->ofTypename)
        ereport(ERROR,
            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                errmsg("Unsupport feature"),
                errdetail("cstore/timeseries don't support relation defination "
                          "with composite type using CREATE TABLE OF TYPENAME.")));

    if (stmt->inhRelations) {
        ereport(ERROR,
            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                errmsg("Unsupport feature"),
                errdetail("cstore/timeseries don't support relation defination with inheritance.")));
    }
    
    if (stmt->relation->schemaname != NULL &&
        IsSystemNamespace(get_namespace_oid(stmt->relation->schemaname, false))) {
        ereport(ERROR,
            (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                errmsg("Unsupport feature"),
                errdetail("cstore/timeseries don't support relation defination with System namespace.")));
    }
    CheckPartitionUnsupported(stmt);
    // Check constraints
    ListCell* lc = NULL;
    foreach (lc, stmt->tableEltsDup) {
        Node* element = (Node*)lfirst(lc);
        /* check table-level constraints */
        if (IsA(element, Constraint) && !CSTORE_SUPPORT_CONSTRAINT(((Constraint*)element)->contype)) {
            ereport(ERROR,
                (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                    errmsg("column/timeseries store unsupport constraint \"%s\"",
                        GetConstraintType(((Constraint*)element)->contype))));
        } else if (IsA(element, ColumnDef)) {
            List* colConsList = ((ColumnDef*)element)->constraints;
            ListCell* lc2 = NULL;
            /* check column-level constraints */
            foreach (lc2, colConsList) {
                Constraint* colCons = (Constraint*)lfirst(lc2);
                if (!CSTORE_SUPPORT_CONSTRAINT(colCons->contype)) {
                    ereport(ERROR,
                        (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
                            errmsg("column/timeseries store unsupport constraint \"%s\"",
                                GetConstraintType(colCons->contype))));
                }
            }
        }
    }
}

下面是函数中每个部分的解释：

首先，函数使用 Assert(stmt) 确保传入的 CreateStmt 结构体非空。

如果要创建的表是全局临时表（stmt->relation->relpersistence == RELPERSISTENCE_GLOBAL_TEMP），则报告错误，因为列存储表不支持全局临时表。

如果表的定义中使用了 CREATE TABLE OF TYPENAME，报告错误，因为列存储表不支持使用复合类型定义。

如果表的定义使用了继承（stmt->inhRelations），报告错误，因为列存储表不支持继承。

如果表的模式名不为空且属于系统命名空间，报告错误，因为列存储表不支持在系统命名空间中定义。

调用 CheckPartitionUnsupported(stmt) 检查分区相关的不支持特性。

遍历 stmt->tableEltsDup 中的每个元素（表元素，如列定义、约束等），检查是否存在不受支持的约束类型。如果存在不受支持的约束，报告错误。

针对表级约束，检查约束类型是否受支持。

针对列级约束，检查每个列的约束列表中的约束类型是否受支持。

其次，我们再来看看 CheckCStoreRelOption 函数，该函数主要检查 PARTIAL_CLUSTER_ROWS 是否小于 MAX_BATCHROW 的值。StdRdOptions 是一个用于存储关系选项的数据结构，它在代码中用于表示存储引擎的特定选项。
其源码如下：（路径：src/gausskernel/optimizer/commands/tablecmds.cpp）

void CheckCStoreRelOption(StdRdOptions* std_opt)
{
    Assert(std_opt);
    if (std_opt->partial_cluster_rows < std_opt->max_batch_rows && std_opt->partial_cluster_rows >= 0) {
        ereport(ERROR,
            (errcode(ERRCODE_INVALID_TABLE_DEFINITION),
                errmsg("PARTIAL_CLUSTER_ROWS cannot be less than MAX_BATCHROW."),
                errdetail("PARTIAL_CLUSTER_ROWS must be greater than or equal to MAX_BATCHROW."),
                errhint("PARTIAL_CLUSTER_ROWS is MAX_BATCHROW multiplied by an integer.")));
    }
}

以下是函数的解释：

首先，函数使用 Assert(std_opt) 确保传入的 StdRdOptions 结构体非空。

如果 PARTIAL_CLUSTER_ROWS 的值小于 MAX_BATCHROW 并且大于等于0，报告错误。这是因为在列存储表中，PARTIAL_CLUSTER_ROWS 表示部分数据块的行数，而 MAX_BATCHROW 表示每个批处理的最大行数。这两个参数应该满足 PARTIAL_CLUSTER_ROWS >= MAX_BATCHROW 的关系。
报告的错误信息包括：

"PARTIAL_CLUSTER_ROWS" 不能小于 "MAX_BATCHROW"。

"PARTIAL_CLUSTER_ROWS" 必须大于或等于 "MAX_BATCHROW"。

提示说明 "PARTIAL_CLUSTER_ROWS" 是 "MAX_BATCHROW" 乘以一个整数。

了解完了函数，我们再分别来看一下函数中的以下两个函数

AlterTableCreateToastTable(rel_oid, toast_options, ((CreateStmt *)stmt)->oldToastNode);
AlterCStoreCreateTables(rel_oid, toast_options, (CreateStmt*)stmt);

其中，AlterTableCreateToastTable 函数的作用是为表创建 TOAST（The Oversized-Attribute Storage Technique）表，用于存储那些超过一定大小的大型列数据。TOAST 表存储的是被压缩和分割成块的列值，以优化数据库性能和存储空间的使用。

参数解释：

rel_oid：要创建 TOAST 表的主表的对象标识符（OID）。

toast_options：创建 TOAST 表的选项，包括压缩、存储引擎等设置。

((CreateStmt *)stmt)->oldToastNode ：源表的 TOAST 表节点（如果存在的话），用于在执行 ALTER TABLE 操作时将现有的 TOAST 表与新创建的 TOAST 表进行合并。

AlterTableCreateToastTable 函数源码如下：（路径：src/common/backend/catalog/toasting.cpp）

/*
 * AlterTableCreateToastTable
 *		If the table needs a toast table, and doesn't already have one,
 *		then create a toast table for it.
 *
 * reloptions for the toast table can be passed, too.  Pass (Datum) 0
 * for default reloptions.
 *
 * We expect the caller to have verified that the relation is a table and have
 * already done any necessary permission checks.  Callers expect this function
 * to end with CommandCounterIncrement if it makes any changes.
 */
void AlterTableCreateToastTable(Oid relOid, Datum reloptions, List *filenodelist)
{
    Relation rel;
    bool rel_is_partitioned = check_rel_is_partitioned(relOid);
    if (!rel_is_partitioned) {
        /*
         * Grab an exclusive lock on the target table, since we'll update its
         * pg_class tuple. This is redundant for all present uses, since caller
         * will have such a lock already.  But the lock is needed to ensure that
         * concurrent readers of the pg_class tuple won't have visibility issues,
         * so let's be safe.
         */
        rel = heap_open(relOid, AccessExclusiveLock);
        if (needs_toast_table(rel))
            (void)create_toast_table(rel, InvalidOid, InvalidOid, reloptions, false, filenodelist);
    } else {
        rel = heap_open(relOid, AccessShareLock);
        if (needs_toast_table(rel))
            (void)createToastTableForPartitionedTable(rel, reloptions, filenodelist);
    }

    heap_close(rel, NoLock);
}

在 AlterTableCreateToastTable 函数中， if (needs_toast_table(rel)) 判断的是是否需要为某个表创建 TOAST 表。其中，needs_toast_table 函数中有如下代码段：

    // column-store relations don't need any toast tables.
    if (RelationIsColStore(rel))
        return false;

因为 TOAST 表的创建和维护会增加一些开销，而对于列存储表来说，通常已经具备了高效存储和压缩的特性，所以不像行存储表那样需要单独的 TOAST 表来处理大型列数据。

AlterCStoreCreateTables 函数的作用是为一个列存储表执行一些列存储特定的操作，主要包括以下几个方面：

创建 CStore 子表（Delta 表） ：对于列存储表，通常会有一个主表和一个或多个子表（如 Delta 表）。Delta 表用于存储新增和修改的数据，以便在之后的时间点将这些变更合并到主表中。这个函数可能会创建或配置 Delta 表。

配置存储选项：列存储表可能有一些特定的存储选项，这些选项可能会影响数据的存储、压缩、索引等方面。函数可能会根据提供的参数进行相应的存储选项配置。

处理 TOAST 表：尽管列存储表不需要创建 TOAST 表，但在某些情况下可能需要处理 TOAST 相关的选项，例如对于那些不同存储方式混合的列存储表

AlterCStoreCreateTables 函数源码如下所示：（路径：src/common/backend/catalog/cstore_ctlg.cpp）

/*
 * AlterTableCreateDeltaTable
 * 如果是一个 ColStore 表，就应该调用这个函数。
 * 这个函数用于创建一个 Delta 表。
 */
void AlterCStoreCreateTables(Oid relOid, Datum reloptions, CreateStmt* mainTblStmt)
{
    Relation rel;

    /*
     * 获取目标表的排它锁，因为我们将会更新它的 pg_class 元组。
     * 这对于目前的所有使用情况来说都是多余的，因为调用者已经有了这样的锁。
     * 但是为了确保并发读取 pg_class 元组的其他进程不会出现可见性问题，我们保险起见加上这个锁。
     */
    rel = heap_open(relOid, AccessExclusiveLock);
    
    /*
     * Dfs 表将会使用 AlterDfsCreateTables 函数处理。
     */
    if (!RelationIsCUFormat(rel)) {
        heap_close(rel, NoLock);
        return;
    }

    if (!RELATION_IS_PARTITIONED(rel)) {
        /* create_delta_table 函数完成所有工作 */
        // 用于创建 Delta 表的，Delta 表存储了列存储表中发生的数据变更（如插入、更新、删除操作）的信息，以便后续进行数据恢复或查询。
        (void)CreateDeltaTable(rel, reloptions, false, mainTblStmt);
        // 用于创建 CUDesc 表，也就是变更描述表,CUDesc 表用于记录列存储表中数据变更的信息，如插入、更新、删除的数据。
        (void)CreateCUDescTable(rel, reloptions, false);
        // 通过静态方法调用来创建列存储表的存储空间
        CStore::CreateStorage(rel, InvalidOid);
    } else {
        createCUDescTableForPartitionedTable(rel, reloptions);
        createDeltaTableForPartitionedTable(rel, reloptions, mainTblStmt);
        CreateStorageForPartition(rel);
    }

    heap_close(rel, NoLock);
}

这里重点看一下 CStore::CreateStorage ，CreateStorage 为 CStore 类中的静态方法，用来创建列存储表的存储空间，源码如下所示：（路径：src/gausskernel/storage/cstore/cstore_am.cpp）

/* DONT call in redo */
// 提醒不要在恢复（redo）过程中调用这个函数
void CStore::CreateStorage(Relation rel, Oid newRelFileNode)
{
	// 获取表的元组描述（Tuple Descriptor）。
    TupleDesc desc = RelationGetDescr(rel);
    // 获取表的属性数量。
    int nattrs = desc->natts;
    // 获取表的属性信息数组。
    Form_pg_attribute* attrs = desc->attrs;
    // 获取表的持久性信息，即表是持久性表还是临时表。
    char relpersistence = rel->rd_rel->relpersistence;

	// 获取表的关系文件节点信息。
    RelFileNode rd_node = rel->rd_node;
    // 如果 newRelFileNode 是有效的（即指定了新的关系文件节点），则将当前表的关系文件节点更新为新的关系文件节点。
    if (OidIsValid(newRelFileNode)) {
        // use the new filenode if *newRelFileNode* is valid.
        rd_node.relNode = newRelFileNode;
    }

    for (int i = 0; i < nattrs; i++) {
    	// 如果当前属性已被标记为删除（attisdropped 为 true），则跳过此属性。
        if (attrs[i]->attisdropped)
            continue;
        // 获取当前属性的属性编号。
        int attrid = attrs[i]->attnum;
		
		// 创建一个 CFileNode 实例，用于表示关系文件节点和属性编号。
        CFileNode cnode(rd_node, attrid, MAIN_FORKNUM);

        // create cu file in disk.
        // 创建一个 CUStorage 实例，表示列存储单元（Column Unit）的存储。
        CUStorage* custorage = New(CurrentMemoryContext) CUStorage(cnode);
        Assert(custorage);
        // 调用 custorage 的 CreateStorage 方法来创建存储空间。它会在磁盘上创建相应的 CU 文件。
        custorage->CreateStorage(0, false);
        // 删除之前创建的 custorage 实例。
        DELETE_EX(custorage);

        // log and insert into the pending delete list.
        // 将关系文件节点、属性编号、持久性信息和表的拥有者信息传递给它，以记录创建存储空间的操作。
        CStoreRelCreateStorage(&rd_node, attrid, relpersistence, rel->rd_rel->relowner);
    }
}

调试信息如下所示：

这里我们对 Form_pg_attribute* attrs = desc->attrs; 稍作解析：

{attrelid = 24646, attname = {data = "state_id", '\000' <repeats 55 times>}, atttypid = 1042, attstattarget = -1, attlen = -1, attnum = 1, attndims = 0,
  attcacheoff = -1, atttypmod = 6, attbyval = false, attstorage = 120 'x', attalign = 105 'i', attnotnull = false, atthasdef = false, attisdropped = false,
  attislocal = true, attcmprmode = 127 '\177', attinhcount = 0, attcollation = 100, attkvtype = 0 '\000'}

参数	含义
attrelid = 24646	表示这个属性所属的表的关系 ID。
attname = {data = “state_id”, ‘\000’ }	表示属性的名称，这里是 “state_id”。
atttypid = 1042	表示属性的数据类型的 OID。在这个例子中，OID 为 1042，对应的数据类型是字符类型。
attstattarget = -1	表示在自动统计分析期间收集统计信息的目标值。在这里是 -1，表示未指定。
attlen = -1	表示属性的长度（字节数）。在这里是 -1，表示长度是可变的。
attnum = 1	表示属性的编号（从 1 开始）。在这里是 1。
attndims = 0	表示属性的维度数目。在这里是 0，表示这是一个标量属性。
attcacheoff = -1	表示属性在元组中的偏移量。在这里是 -1，表示未指定。
atttypmod = 6	表示属性的类型修饰符。在这里是 6，具体含义取决于属性的数据类型。
attbyval = false	表示属性是否按值传递。在这里是 false，表示不是按值传递。
attstorage = 120 ‘x’	表示属性的存储方式。在这里是 ‘x’，表示外部存储。
attalign = 105 ‘i’	表示属性的对齐方式。在这里是 ‘i’，表示按照 int 类型的对齐方式。
attnotnull = false	表示属性是否可以为 NULL。在这里是 false，表示可以为 NULL。
atthasdef = false	表示属性是否有默认值。在这里是 false，表示没有默认值。
attisdropped = false	表示属性是否被标记为已删除。在这里是 false，表示没有被标记为删除。
attislocal = true	表示属性是否是本地属性。在这里是 true，表示是本表的属性。
attcmprmode = 127 ‘\177’	表示属性的压缩模式。在这里是 127，具体含义取决于属性的数据类型和存储方式。
attinhcount = 0	表示从父表继承的次数。在这里是 0，表示没有从父表继承。
attcollation = 100	表示属性的排序规则的 OID。在这里是 100，对应的排序规则。
attkvtype = 0 ‘\000’	表示属性的键值类型。在这里是 0，表示不是键值属性。

总结

到此，本文初步介绍了列存储创建表的大致流程，其中很多的细节可能并没有详细展开。此外，列存储所涉及的模块和相关知识也非常多，在后续的学习中会不断的跟进。

你可能感兴趣的:(OpenGauss,gaussdb,postgresql,数据库)

分布式系统ID生成方案深度解析：雪花算法 vs UUID vs 其他主流方案可曾去过倒悬山算法后端
分布式系统ID生成方案深度解析：雪花算法vsUUIDvs其他主流方案在分布式系统中，如何高效生成全局唯一ID是一个关键挑战。本文将深入剖析雪花算法、UUID及多种主流ID生成方案，帮助开发者根据业务场景选择最佳方案。一、为什么需要分布式ID？在分布式系统中，传统数据库自增ID存在明显瓶颈：单点故障：依赖单数据库实例扩展困难：分库分表时ID冲突安全风险：连续ID暴露业务量性能瓶颈：高并发下成为系统瓶
基于MySQL的分布式锁实现（Spring Boot + MyBatis） weixin_43833540 mysql 分布式 spring boot
基于MySQL的分布式锁实现（SpringBoot+MyBatis）实现原理基于数据库的唯一索引特性实现分布式锁，通过插入唯一索引记录表示获取锁，删除记录表示释放锁。1.创建锁表首先需要在MySQL中创建一个锁表，用于存储锁信息：CREATETABLE`distributed_lock`(`id`bigint(20)NOTNULLAUTO_INCREMENT,`lock_key`varchar(6
Python HTTP服务监控：Prometheus与自定义Exporter开发指南
在微服务架构中，HTTP服务的高效监控对保障系统稳定性至关重要。Prometheus作为云原生监控标杆，通过其Pull模型与灵活的指标体系，结合Python开发的自定义Exporter，可实现HTTP服务性能、可用性及业务指标的全面观测。Prometheus监控核心机制Prometheus采用时间序列数据库存储指标数据，每条数据由指标名称（如http_requests_total）、标签（如met
解决报错:错误1130- Host xxx is not allowed to connect to this MariaDb server phymat.nico 系统内核
这个问题是因为在数据库服务器中的mysql数据库中的user的表中没有权限(也可以说没有用户)，下面将记录我遇到问题的过程及解决的方法。在搭建完LNMP环境后用Navicate连接出错遇到这个问题首先到mysql所在的服务器上用连接进行处理1、连接服务器:mysql-uroot-p2、看当前所有数据库：showdatabases;3、进入mysql数据库：usemysql;4、查看mysql数据库
C#使用ExcelDataReader高效读取excel文件写入数据库香煎三文鱼 .net core .Net6 C#C#读取excel
分享一个库ExcelDataReader，它专注读取、支持.xls/.xlsx、内存优化。首先安装NuGet包dotnetaddpackageExcelDataReaderdotnetaddpackageSystem.Text.Encoding.CodePages编码内存优化：每次仅读取一行，适合处理百万级数据。类型安全方法：可用GetString(0)、GetDouble(1)等强类型方法（需确
工业控制系统安全综述罗思付之技术屋物联网及AI前沿技术专栏安全网络 web安全
摘要工业控制系统除了应用于生产制造行业外，还广泛应用于交通、水利和电力等关键基础设施.随着工业数字化、网络化、智能化的推进，许多新技术应用于工业控制系统，提高了工业控制系统的智能化水平，但其也给工业控制系统的安全带来严峻的挑战.因此，工业控制系统的安全倍受研究人员的关注.为了让研究人员系统化地了解目前的研究进展，调研了近3年WebofScience核心数据库、EI数据库和CCF推荐网络与信息安全国
嵌入式linux下基于boa cgic sqlite3的ajax web服务器搭建モザイクカケラ嵌入式linux-web 嵌入式系统开发 boa cgic sqlite3 嵌入式linux ajax
先上大家的资源全部亲测可用sqlite3数据库c语言常用接口应用实例sqlite3数据库交叉编译并移植到嵌入式开发环境步骤fprintf与stderr、stdout的使用Windows中IIS服务器被防火墙阻止导致外网无法访问sqlite3.OperationalError:unabletoopendatabasefileSQLiteDelete语句SQLite数据库中rowid使用基本操作交叉编
MyBatis逆向工程生成 (生成pojo、mapper.xml、mapper.java) weixin_30701521 java 数据库
MyBatis逆向工程生成mybatis需要程序员自己编写sql语句，mybatis官方提供逆向工程，可以针对单表自动生成mybatis执行所需要的代码（mapper.java、mapper.xml、pojo…），可以让程序员将更多的精力放在繁杂的业务逻辑上。企业实际开发中，常用的逆向工程方式：由数据库的表生成java代码。之所以强调单表两个字，是因为Mybatis逆向工程生成的Mapper所进行
Django ORM 1. 创建模型（Model）博观而约取 Python django 数据库 python
1.ORM介绍什么是ORM？ORM，全称Object-RelationalMapping（对象关系映射），一种通过对象操作数据库的技术。它的核心思想是：我们不直接写SQL，而是用Python对象（类/实例）来操作数据库表和记录。ORM就像一个“翻译官”，帮我们把Python代码翻译成数据库能听懂的SQL命令。为什么使用ORM?Django中的ORM提供了一个高层次、抽象化的接口来操作数据库，它的优
探秘SQLite：打造高效嵌入式数据库应用的实用指南 dfvcbipanjr 数据库 sqlite oracle python
探秘SQLite：打造高效嵌入式数据库应用的实用指南SQLite是一种广泛应用的嵌入式数据库引擎，因其不依赖于独立的服务器进程，且在各大操作系统、浏览器、手机等设备中都能找到它的身影，成为开发者的首选。这篇文章旨在介绍SQLite的基本概念、使用方法以及一些实用的编程示例，帮助您更好地在应用中嵌入SQLite数据库。主要内容1.SQLite简介SQLite是用C语言编写的一个轻量级数据库引擎，被设
SQLite 数据库在大数据分析中的应用潜力数据库管理艺术数据库 sqlite 数据分析 ai
SQLite数据库在大数据分析中的应用潜力关键词：SQLite、大数据分析、轻量级数据库、嵌入式数据库、数据仓库、OLAP、性能优化摘要：本文深入探讨了SQLite这一轻量级嵌入式数据库在大数据分析领域的应用潜力。我们将从SQLite的核心架构出发，分析其在大数据场景下的优势和限制，并通过实际案例展示如何通过优化策略和扩展技术使SQLite能够处理大规模数据集。文章包含性能对比测试、优化技巧和实际
实体，dto，vo三种pojo的区别和联系不爱吃大饼 java
在软件开发，特别是Java应用程序中，实体（Entity）、数据传输对象（DTO，DataTransferObject）和视图对象（VO，ViewObject）是三种常见的对象类型。它们各自有不同的责任和用途。下面是对它们的定义、区别和联系的详细解释。1.实体（Entity）定义：实体是与数据库表直接对应的对象，通常用于持久化层。它映射到数据库中的一行记录，每个实体对象的属性对应数据库表中的字段。
SQLite3 在嵌入式系统中的应用指南指令集诗人 sqlite3 sqlite 数据库嵌入式实时数据库
SQLite3在嵌入式系统中的应用指南一、嵌入式系统中SQLite3的优势SQLite3是嵌入式系统的理想数据库解决方案，具有以下核心优势：特性嵌入式系统价值典型指标轻量级适合资源受限环境库大小：500-700KB零配置无需数据库管理员开箱即用无服务器减少系统复杂性无后台进程低功耗延长电池寿命读操作：~0.001mAh高可靠性应对意外断电ACID事务保证单文件存储简化数据管理单个.db文件二、嵌入
DTO、VO、POJO与实体类使用方案（结合Mapper.xml） csdn_HPL xml windows
结合MyBatis的Mapper.xml文件，展示完整的层级数据流转和数据库操作。1.实体类优化（Entity）//User.java@Data@NoArgsConstructor@AllArgsConstructor@TableName("sys_user")publicclassUser{@TableId(type=IdType.AUTO)privateLonguserId;@NotBlank
鸿蒙线程池全揭秘：让你的应用快、稳、省资源 harmonyos
摘要在现代应用开发中，多线程已经成为提升程序性能、优化用户体验的关键手段。尤其是在HarmonyOS（鸿蒙系统）这种强调分布式、并发处理的系统架构中，合理使用多线程不仅可以让程序运行更高效，还能帮助我们处理复杂的后台任务，比如文件下载、数据库操作、网络请求等。引言鸿蒙系统作为面向多设备融合的新一代操作系统，其支持的多线程模型与传统Android十分类似。很多Java的线程操作方法在鸿蒙中依然适用。
鸿蒙关系型数据库实战：高效数据存储与管理数据库harmonyos
在鸿蒙应用开发中，关系型数据库（RDB）是结构化数据存储的核心方案。通过深度实践，其基于SQLite的轻量级实现不仅性能出色，更提供了强大的事务支持和类型安全。以下是关键经验总结：三大核心优势：SQL兼容：完整支持SQL92标准语法线程安全：内置多线程读写锁机制加密存储：支持AES-256加密敏感数据关系型数据库实战封装及使用：在Utils目录下新建一个RdbUtils文件//./src/main
【Golang】用gorm实现分页的功能在成都搬砖的鸭鸭 Golang golang 开发语言后端 1024程序员节
目录1、背景2、go库下载3、初始化数据【1】建表【2】插入数据【3】查看数据4、代码示例【1】gorm结构体定义【2】分页结构体定义【3】封装分页方法【4】封装获取数据库连接方法【5】查询列表接口【6】启动http服务【7】调用获取列表接口5、总结1、背景在提供列表接口时一般要用到分页，对于存储在某些数据库中的数据进行分页起来非常的方便，下文给出一个通过gorm进行分页并通过http返回数据的例
后端技术：利用 MySQL 实现数据加密大厂资深架构师 Spring Boot 开发实战 mysql 数据库 ai
后端技术：利用MySQL实现数据加密关键词：MySQL数据加密、AES加密、数据库安全、数据保护、加密算法、密钥管理、SQL注入防御摘要：本文深入探讨如何在MySQL数据库中实现数据加密，保护敏感信息免受未授权访问。我们将从加密的基本原理出发，详细讲解MySQL支持的多种加密方式，包括AES、SHA等算法的实现方法。文章包含完整的代码示例和最佳实践，帮助开发者在实际项目中应用数据加密技术，同时讨论
DAO模式红中马喽 java 数据库开发语言笔记学习后端设计模式
前言DAO（DataAccessObject）模式是一种常用的设计模式，主要用于将数据访问逻辑与业务逻辑分离。它提供了一种抽象层，使得应用程序可以与不同的数据源（如数据库、文件系统等）进行交互，而无需了解底层数据存储的细节。DAO模式的核心思想是将数据访问操作封装在独立的类中，从而提高代码的可维护性、可扩展性和可重用性。如何使用DAO模式1.首先导入这个包（有需要的可以私聊我）然后添加配置文件，为
番外：MySQL的一些事务处理红中马喽 mysql 数据库学习笔记开发语言后端
前言因为前天没更新，多补一更，简单介绍一下后端数据库MySQL的事务处理什么是事务处理事务（Transaction）：事务是一组SQL语句的执行单元，这些语句被视为一个单独的工作单元。事务的主要目的是保证数据库操作的原子性，即这些操作要么全部执行，要么全部不执行简单来说，事务是用来保证数据库的一致性，完整性的，关于事务处理我们需要提到ACID性A.原子性（Atomicity）：事务中的所有操作要么
安装mysql数据库的一系列心得
以下是详细的MySQL数据库安装教程：Windows系统一、下载安装包1.打开浏览器，访问MySQL官方网站（https://dev.mysql.com/downloads/mysql/）。2.在下载页面，根据你的Windows操作系统版本（32位或64位）选择合适的MySQLCommunityServer安装包。一般推荐下载最新的稳定版本。3.下载完成后，找到安装文件（.msi格式）。二、安装过
GORM深度解析：模型定义与数据库迁移最佳实践 Golang编程笔记数据库 oracle ai
GORM深度解析：模型定义与数据库迁移最佳实践关键词：GORM、模型定义、数据库迁移、最佳实践、Go语言摘要：本文深入探讨了GORM这一强大的Go语言ORM库，详细介绍了模型定义的方法和技巧，以及数据库迁移的最佳实践。通过通俗易懂的语言和丰富的实例，帮助读者理解GORM的核心概念，掌握如何利用GORM高效地进行数据库操作。背景介绍目的和范围在Go语言开发中，与数据库进行交互是一项常见的任务。GOR
鸿蒙线程池全揭秘：让你的应用快、稳、省资源前端世界 harmonyos harmonyos 华为
摘要在现代应用开发中，多线程已经成为提升程序性能、优化用户体验的关键手段。尤其是在HarmonyOS（鸿蒙系统）这种强调分布式、并发处理的系统架构中，合理使用多线程不仅可以让程序运行更高效，还能帮助我们处理复杂的后台任务，比如文件下载、数据库操作、网络请求等。引言鸿蒙系统作为面向多设备融合的新一代操作系统，其支持的多线程模型与传统Android十分类似。很多Java的线程操作方法在鸿蒙中依然适用。
MySQL 中的锁机制详解：原理、实现方式与实战解析！程序猿Mr.wu MySQL mysql 数据库
MySQL中的锁机制详解：原理、实现方式与实战解析！锁的世界，比你想象得更精彩！一、为什么要有锁？在并发环境下，多线程操作数据库的同一份数据时，如果没有锁机制，可能会出现以下问题：脏读：读取了另一个事务未提交的数据。不可重复读：同一事务中多次读取结果不一致。幻读：读取时发现记录“凭空”出现或消失。锁的存在，就是为了保证并发情况下的数据一致性与隔离性。二、MySQL中锁的分类1.按作用范围分类分类说
Mysql回表查询：深入解析与实战应用需要重新演唱 mysql mysql 数据库
Mysql回表查询：深入解析与实战应用今天，我们将深入探讨Mysql中的回表查询。回表查询是Mysql索引机制中的一个重要概念，理解它的工作原理和优化方法，对于提升数据库查询性能至关重要。让我们一起揭开回表查询的神秘面纱。1.什么是回表查询？回表查询（LookupQuery）是指在使用非聚集索引（Non-ClusteredIndex）进行查询时，如果需要获取的数据不在索引页中，就需要根据索引页中的
SQL Server的个人学习笔记萌尛喵 sql 学习数据库
1.基础SQLServer是由Microsoft开发和销售的关系数据库管理系统或RDBMS。SQLServer建立于SOL之上，是一种用于关系数据交互的标准编程语言。2.组件SQLServer主要由数据库引擎和SQLOS两个组件组成。①数据库引擎SQLServer的核心组件是数据库引擎。数据库引擎由处理查询的关系引擎和管理数据库文件、页面、索引等的存储组成。数据库引擎也创建并执行数据库对象，如存储
什么是 Paxos和Raft MonkeyKing.sun paxos raft
Raft和Paxos是两种经典的分布式一致性算法（ConsensusAlgorithms），广泛应用于数据库、分布式系统、微服务架构中，用来确保在多个节点中即使有部分节点故障，系统仍然可以就“某一值”达成一致（即：分布式共识）。它们不是区块链专属，但在联盟链、私有链或数据库复制系统中常被用来替代PoW、PBFT等共识机制。一、什么是Paxos？定义：Paxos是一种保证在部分节点失效或网络延迟时，
SQLserver数据库学习笔记溪衡学习
小记1：1.newid()我觉得是一个生成唯一键的好方法，不用自增控制主键，可以用这个试试，注意不做处理的话，需要36位。例如：在数据库中直接使用语句selectnewid()2.nolock按我的理解是“不上锁的”，所谓的脏读，大多用的都是这个东西，据说可以提高查询速度。3.go批处理语句，将前面的代码作为一批处理。4.内连接与简单多表在数据量少的时候查询速度差距并不明显。5.删除和更新数据时，
SQL学习笔记1
1.数据库1、什么是数据库数据库（DB）即用于存放数据的服务器，如MySQL等软件是数据库管理系统（DBMS），用于管理存放在数据库中的数据，SQL是用于操作DBMS的标准语言。2、数据库的类型数据库分为关系型数据库和非关系型数据库；关系型数据库是指用建立在关系模型上互相关联的二维表组成的数据库，MySQL是用于管理关系型数据库的数据库管理系统2.MySQL启动与连接1、MySQL启动安装好MyS
Centos7.9+mysql8.0开启指定IP远程连接数据库洋滔服务器数据库 tcp/ip mysql
公司服务器换了，需要重新搭建下web环境，在配置mysql远程连接的时候碰到了几个坑，之前也配置过，但这次由于mysql版本的不同，配置方法稍微不同，这里做个记录。首先是，创建mysql用户，命令如下CREATEUSER'jkxtc178'@'215.55.284.149';@‘IP’，如果你不想指定ip访问，使用%即可，下边的命令出现@'IP’的都是这样。然后是设置用户登陆密码：ALTERUSE
[黑洞与暗粒子]没有光的世界 comsci
无论是相对论还是其它现代物理学,都显然有个缺陷,那就是必须有光才能够计算但是,我相信,在我们的世界和宇宙平面中,肯定存在没有光的世界.... 那么,在没有光的世界,光子和其它粒子的规律无法被应用和考察,那么以光速为核心的 &nbs
jQuery Lazy Load 图片延迟加载 aijuans jquery
基于 jQuery 的图片延迟加载插件，在用户滚动页面到图片之后才进行加载。对于有较多的图片的网页，使用图片延迟加载，能有效的提高页面加载速度。版本： jQuery v1.4.4+ jQuery Lazy Load v1.7.2 注意事项：需要真正实现图片延迟加载，必须将真实图片地址写在 data-original 属性中。若 src
使用Jodd的优点 Kai_Ge jodd
1. 简化和统一 controller ，抛弃 extends SimpleFormController ，统一使用 implements Controller 的方式。 2. 简化 JSP 页面的 bind, 不需要一个字段一个字段的绑定。 3. 对 bean 没有任何要求，可以使用任意的 bean 做为 formBean。使用方法简介
jpa Query转hibernate Query 120153216 Hibernate
public List<Map> getMapList(String hql, Map map) { org.hibernate.Query jpaQuery = entityManager.createQuery(hql); if (null != map) { for (String parameter : map.keySet()) { jp
Django_Python3添加MySQL/MariaDB支持 2002wmj mariaDB
现状首先，[email protected] 中默认的引擎为 django.db.backends.mysql 。但是在Python3中如果这样写的话，会发现 django.db.backends.mysql 依赖 MySQLdb[5] ，而 MySQLdb 又不兼容 Python3 于是要找一种新的方式来继续使用MySQL。 MySQL官方的方案首先据MySQL文档[3]说，自从MySQL
在SQLSERVER中查找消耗IO最多的SQL 357029540 SQL Server
返回做IO数目最多的50条语句以及它们的执行计划。 select top 50 (total_logical_reads/execution_count) as avg_logical_reads, (total_logical_writes/execution_count) as avg_logical_writes, (tot
spring UnChecked 异常官方定义！ 7454103 spring
如果你接触过spring的事物管理！那么你必须明白 spring的非捕获异常！即 unchecked 异常！因为 spring 默认这类异常事物自动回滚！！ public static boolean isCheckedException(Throwable ex) { return !(ex instanceof RuntimeExcep
mongoDB 入门指南、示例 adminjun java mongodb 操作
一、准备工作 1、下载mongoDB 下载地址：http://www.mongodb.org/downloads 选择合适你的版本相关文档：http://www.mongodb.org/display/DOCS/Tutorial 2、安装mongoDB A、不解压模式：将下载下来的mongoDB-xxx.zip打开，找到bin目录，运行mongod.exe就可以启动服务，默
CUDA 5 Release Candidate Now Available aijuans CUDA
The CUDA 5 Release Candidate is now available at http://developer.nvidia.com/<wbr></wbr>cuda/cuda-pre-production. Now applicable to a broader set of algorithms, CUDA 5 has advanced fe
Essential Studio for WinRT网格控件测评 Axiba JavaScript html5
Essential Studio for WinRT界面控件包含了商业平板应用程序开发中所需的所有控件，如市场上运行速度最快的grid 和chart、地图、RDL报表查看器、丰富的文本查看器及图表等等。同时，该控件还包含了一组独特的库，用于从WinRT应用程序中生成Excel、Word以及PDF格式的文件。此文将对其另外一个强大的控件——网格控件进行专门的测评详述。网格控件功能 1、
java 获取windows系统安装的证书或证书链 bewithme windows
有时需要获取windows系统安装的证书或证书链，比如说你要通过证书来创建java的密钥库。有关证书链的解释可以查看此处。 public static void main(String[] args) { SunMSCAPI providerMSCAPI = new SunMSCAPI(); S
NoSQL数据库之Redis数据库管理(set类型和zset类型) bijian1013 redis 数据库 NoSQL
4.sets类型 Set是集合，它是string类型的无序集合。set是通过hash table实现的，添加、删除和查找的复杂度都是O(1)。对集合我们可以取并集、交集、差集。通过这些操作我们可以实现sns中的好友推荐和blog的tag功能。 sadd：向名称为key的set中添加元
异常捕获何时用Exception，何时用Throwable bingyingao
用Exception的情况 try { //可能发生空指针、数组溢出等异常 } catch (Exception e) {
【Kafka四】Kakfa伪分布式安装 bit1129 kafka
在http://bit1129.iteye.com/blog/2174791一文中，实现了单Kafka服务器的安装，在Kafka中，每个Kafka服务器称为一个broker。本文简单介绍下，在单机环境下Kafka的伪分布式安装和测试验证 1. 安装步骤 Kafka伪分布式安装的思路跟Zookeeper的伪分布式安装思路完全一样，不过比Zookeeper稍微简单些(不
Project Euler bookjovi haskell
Project Euler是个数学问题求解网站，网站设计的很有意思，有很多problem，在未提交正确答案前不能查看problem的overview，也不能查看关于problem的discussion thread，只能看到现在problem已经被多少人解决了，人数越多往往代表问题越容易。看看problem 1吧： Add all the natural num
Java-Collections Framework学习与总结-ArrayDeque BrokenDreams Collections
表、栈和队列是三种基本的数据结构，前面总结的ArrayList和LinkedList可以作为任意一种数据结构来使用，当然由于实现方式的不同，操作的效率也会不同。这篇要看一下java.util.ArrayDeque。从命名上看
读《研磨设计模式》-代码笔记-装饰模式-Decorator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.io.BufferedOutputStream; import java.io.DataOutputStream; import java.io.FileOutputStream; import java.io.Fi
Maven学习(一) chenyu19891124 Maven私服
学习一门技术和工具总得花费一段时间，5月底6月初自己学习了一些工具，maven+Hudson+nexus的搭建，对于maven以前只是听说，顺便再自己的电脑上搭建了一个maven环境，但是完全不了解maven这一强大的构建工具，还有ant也是一个构建工具，但ant就没有maven那么的简单方便，其实简单点说maven是一个运用命令行就能完成构建，测试，打包，发布一系列功
[原创]JWFD工作流引擎设计----节点匹配搜索算法(用于初步解决条件异步汇聚问题) 补充 comsci 算法工作 PHP 搜索引擎嵌入式
本文主要介绍在JWFD工作流引擎设计中遇到的一个实际问题的解决方案，请参考我的博文"带条件选择的并行汇聚路由问题"中图例A2描述的情况(http://comsci.iteye.com/blog/339756),我现在把我对图例A2的一个解决方案公布出来，请大家多指点节点匹配搜索算法(用于解决标准对称流程图条件汇聚点运行控制参数的算法) 需要解决的问题：已知分支
Linux中用shell获取昨天、明天或多天前的日期 daizj linux shell 上几年昨天获取上几个月
在Linux中可以通过date命令获取昨天、明天、上个月、下个月、上一年和下一年 # 获取昨天 date -d 'yesterday' # 或 date -d 'last day' # 获取明天 date -d 'tomorrow' # 或 date -d 'next day' # 获取上个月 date -d 'last month' #
我所理解的云计算 dongwei_6688 云计算
在刚开始接触到一个概念时，人们往往都会去探寻这个概念的含义，以达到对其有一个感性的认知，在Wikipedia上关于“云计算”是这么定义的，它说： Cloud computing is a phrase used to describe a variety of computing co
YII CMenu配置 dcj3sjt126com yii
Adding id and class names to CMenu We use the id and htmlOptions to accomplish this. Watch. //in your view $this->widget('zii.widgets.CMenu', array( 'id'=>'myMenu', 'items'=>$this-&g
设计模式之静态代理与动态代理 come_for_dream 设计模式
静态代理与动态代理代理模式是java开发中用到的相对比较多的设计模式，其中的思想就是主业务和相关业务分离。所谓的代理设计就是指由一个代理主题来操作真实主题，真实主题执行具体的业务操作，而代理主题负责其他相关业务的处理。比如我们在进行删除操作的时候需要检验一下用户是否登陆，我们可以删除看成主业务，而把检验用户是否登陆看成其相关业务
【转】理解Javascript 系列 gcc2ge JavaScript
理解Javascript_13_执行模型详解摘要: 在《理解Javascript_12_执行模型浅析》一文中,我们初步的了解了执行上下文与作用域的概念，那么这一篇将深入分析执行上下文的构建过程，了解执行上下文、函数对象、作用域三者之间的关系。函数执行环境简单的代码:当调用say方法时，第一步是创建其执行环境，在创建执行环境的过程中，会按照定义的先后顺序完成一系列操作:1.首先会创建一个
Subsets II hcx2013 set
Given a collection of integers that might contain duplicates, nums, return all possible subsets. Note: Elements in a subset must be in non-descending order. The solution set must not conta
Spring4.1新特性——Spring缓存框架增强 jinnianshilongnian spring4
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
shell嵌套expect执行命令 liyonghui160com
一直都想把expect的操作写到bash脚本里,这样就不用我再写两个脚本来执行了,搞了一下午终于有点小成就,给大家看看吧. 系统:centos 5.x 1.先安装expect yum -y install expect 2.脚本内容: cat auto_svn.sh #!/bin/bash
Linux实用命令整理 pda158 linux
0. 基本命令　　linux 基本命令整理　　1. 压缩解压　　tar -zcvf a.tar.gz a #把a压缩成a.tar.gz 　　tar -zxvf a.tar.gz #把a.tar.gz解压成a 　　2. vim小结　　2.1 vim替换　　:m,ns/word_1/word_2/gc
独立开发人员通向成功的29个小贴士 shoothao 独立开发
概述：本文收集了关于独立开发人员通向成功需要注意的一些东西,对于具体的每个贴士的注解有兴趣的朋友可以查看下面标注的原文地址。明白你从事独立开发的原因和目的。保持坚持制定计划的好习惯。万事开头难，第一份订单是关键。培养多元化业务技能。提供卓越的服务和品质。谨小慎微。营销是必备技能。学会组织，有条理的工作才是最有效率的。 “独立
JAVA中堆栈和内存分配原理 uule java
1、栈、堆 1.寄存器：最快的存储区, 由编译器根据需求进行分配,我们在程序中无法控制.2. 栈：存放基本类型的变量数据和对象的引用，但对象本身不存放在栈中，而是存放在堆（new 出来的对象）或者常量池中（字符串常量对象存放在常量池中。）3. 堆：存放所有new出来的对象。4. 静态域：存放静态成员（static定义的）5. 常量池：存放字符串常量和基本类型常量（public static f