qxy0503

[2021-06-15]The Internals of PostgreSQL (一)

Chapter 1

Database Cluster, Databases, and Tables

This chapter and the next chapter summarize the basic knowledge of PostgreSQL to help to read the subsequent chapters. In this chapter, following topics are described:

The logical structure of a database cluster
The physical structure of a database cluster
The internal layout of a heap table file
The methods of writing and reading data to a table

If you are already familiar with them, you may skip over this chapter.

1.1. Logical Structure of Database Cluster

A database cluster is a collection of databases managed by a PostgreSQL server. If you hear this definition now for the first time, you might be wondering about it, but the term ‘database cluster’ in PostgreSQL does not mean ‘a group of database servers’. A PostgreSQL server runs on a single host and manages a single database cluster.

Figure 1.1 shows the logical structure of a database cluster. A database is a collection of database objects. In the relational database theory, a database object is a data structure used either to store or to reference data. A (heap) table is a typical example of it, and there are many more like an index, a sequence, a view, a function and so on. In PostgreSQL, databases themselves are also database objects and are logically separated from each other. All other database objects (e.g., tables, indexes, etc) belong to their respective databases.

Fig. 1.1. Logical structure of a database cluster.

All the database objects in PostgreSQL are internally managed by respective object identifiers (OIDs), which are unsigned 4-byte integers. The relations between database objects and the respective OIDs are stored in appropriate system catalogs, depending on the type of objects. For example, OIDs of databases and heap tables are stored in pg_database and pg_class respectively, so you can find out the OIDs you want to know by issuing the queries such as the following:

sampledb=# SELECT datname, oid FROM pg_database WHERE datname = 'sampledb';
 datname  |  oid  
----------+-------
 sampledb | 16384
(1 row)

sampledb=# SELECT relname, oid FROM pg_class WHERE relname = 'sampletbl';
  relname  |  oid  
-----------+-------
 sampletbl | 18740 
(1 row)

1.2. Physical Structure of Database Cluster

A database cluster basically is one directory referred to as base directory, and it contains some subdirectories and lots of files. If you execute the initdb utility to initialize a new database cluster, a base directory will be created under the specified directory. Though it is not compulsory, the path of the base directory is usually set to the environment variable PGDATA.

Figure 1.2 shows an example of database cluster in PostgreSQL. A database is a subdirectory under the base subdirectory, and each of the tables and indexes is (at least) one file stored under the subdirectory of the database to which it belongs. Also there are several subdirectories containing particular data, and configuration files. While PostgreSQL supports tablespaces, the meaning of the term is different from other RDBMS. A tablespace in PostgreSQL is one directory that contains some data outside of the base directory.

Fig. 1.2. An example of database cluster.

In the following subsections, the layout of a database cluster, databases, files associated with tables and indexes, and the tablespace in PostgreSQL are described.

1.2.1. Layout of a Database Cluster

The layout of database cluster has been described in the official document. Main files and subdirectories in a part of the document have been listed in Table 1.1:

table 1.1: Layout of files and subdirectories under the base directory (From the official document)

files	description
PG_VERSION	A file containing the major version number of PostgreSQL
pg_hba.conf	A file to control PosgreSQL's client authentication
pg_ident.conf	A file to control PostgreSQL's user name mapping
postgresql.conf	A file to set configuration parameters
postgresql.auto.conf	A file used for storing configuration parameters that are set in ALTER SYSTEM (version 9.4 or later)
postmaster.opts	A file recording the command line options the server was last started with
subdirectories	description
base/	Subdirectory containing per-database subdirectories.
global/	Subdirectory containing cluster-wide tables, such as pg_database and pg_control.
pg_commit_ts/	Subdirectory containing transaction commit timestamp data. Version 9.5 or later.
pg_clog/ (Version 9.6 or earlier)	Subdirectory containing transaction commit state data. It is renamed to pg_xact in Version 10. CLOG will be described in Section 5.4.
pg_dynshmem/	Subdirectory containing files used by the dynamic shared memory subsystem. Version 9.4 or later.
pg_logical/	Subdirectory containing status data for logical decoding. Version 9.4 or later.
pg_multixact/	Subdirectory containing multitransaction status data (used for shared row locks)
pg_notify/	Subdirectory containing LISTEN/NOTIFY status data
pg_repslot/	Subdirectory containing replication slot data. Version 9.4 or later.
pg_serial/	Subdirectory containing information about committed serializable transactions (version 9.1 or later)
pg_snapshots/	Subdirectory containing exported snapshots (version 9.2 or later). The PostgreSQL's function pg_export_snapshot creates a snapshot information file in this subdirectory.
pg_stat/	Subdirectory containing permanent files for the statistics subsystem.
pg_stat_tmp/	Subdirectory containing temporary files for the statistics subsystem.
pg_subtrans/	Subdirectory containing subtransaction status data
pg_tblspc/	Subdirectory containing symbolic links to tablespaces
pg_twophase/	Subdirectory containing state files for prepared transactions
pg_wal/ (Version 10 or later)	Subdirectory containing WAL (Write Ahead Logging) segment files. It is renamed from pg_xlog in Version 10.
pg_xact/ (Version 10 or later)	Subdirectory containing transaction commit state data. It is renamed from pg_clog in Version 10. CLOG will be described in Section 5.4.
pg_xlog/ (Version 9.6 or earlier)	Subdirectory containing WAL (Write Ahead Logging) segment files. It is renamed to pg_wal in Version 10.

1.2.2. Layout of Databases

A database is a subdirectory under the base subdirectory; and the database directory names are identical to the respective OIDs. For example, when the OID of the database sampledb is 16384, its subdirectory name is 16384.

$ cd $PGDATA
$ ls -ld base/16384
drwx------  213 postgres postgres  7242  8 26 16:33 16384

1.2.3. Layout of Files Associated with Tables and Indexes

Each table or index whose size is less than 1GB is a single file stored under the database directory it belongs to. Tables and indexes as database objects are internally managed by individual OIDs, while those data files are managed by the variable, relfilenode. The relfilenode values of tables and indexes basically but not always match the respective OIDs, the details are described below.

Let's show the OID and relfilenode of the table sampletbl:

sampledb=# SELECT relname, oid, relfilenode FROM pg_class WHERE relname = 'sampletbl';
  relname  |  oid  | relfilenode
-----------+-------+-------------
 sampletbl | 18740 |       18740 
(1 row)

From the result above, you can see that both oid and relfilenode values are equal. You can also see that the data file path of the table sampletbl is 'base/16384/18740'.

$ cd $PGDATA
$ ls -la base/16384/18740
-rw------- 1 postgres postgres 8192 Apr 21 10:21 base/16384/18740

The relfilenode values of tables and indexes are changed by issuing some commands (e.g., TRUNCATE, REINDEX, CLUSTER). For example, if we truncate the table sampletbl, PostgreSQL assigns a new relfilenode (18812) to the table, removes the old data file (18740), and creates a new one (18812).

sampledb=# TRUNCATE sampletbl;
TRUNCATE TABLE

sampledb=# SELECT relname, oid, relfilenode FROM pg_class WHERE relname = 'sampletbl';
  relname  |  oid  | relfilenode
-----------+-------+-------------
 sampletbl | 18740 |       18812 
(1 row)

In version 9.0 or later, the built-in function pg_relation_filepath is useful as this function returns the file path name of the relation with the specified OID or name.

sampledb=# SELECT pg_relation_filepath('sampletbl');
 pg_relation_filepath 
----------------------
 base/16384/18812
(1 row)

When the file size of tables and indexes exceeds 1GB, PostgreSQL creates a new file named like relfilenode.1 and uses it. If the new file has been filled up, next new file named like relfilenode.2 will be created, and so on.

$ cd $PGDATA
$ ls -la -h base/16384/19427*
-rw------- 1 postgres postgres 1.0G  Apr  21 11:16 data/base/16384/19427
-rw------- 1 postgres postgres  45M  Apr  21 11:20 data/base/16384/19427.1
...

The maximum file size of tables and indexes can be changed using the configuration, option --with-segsize when building PostgreSQL.

Looking carefully at the database subdirectories, you will find out that each table has two associated files suffixed respectively with '_fsm' and '_vm'. Those are referred to as free space map and visibility map, storing the information of the free space capacity and the visibility on each page within the table file, respectively (see more detail in Section 5.3.4 and Section 6.2). Indexes only have individual free space maps and don't have visibility map.

A specific example is shown below:

$ cd $PGDATA
$ ls -la base/16384/18751*
-rw------- 1 postgres postgres  8192 Apr 21 10:21 base/16384/18751
-rw------- 1 postgres postgres 24576 Apr 21 10:18 base/16384/18751_fsm
-rw------- 1 postgres postgres  8192 Apr 21 10:18 base/16384/18751_vm

They may also be internally referred to as the forks of each relation; the free space map is the first fork of the table/index data file (the fork number is 1), the visibility map the second fork of the table's data file (the fork number is 2). The fork number of the data file is 0.

1.2.4. Tablespaces

A tablespace in PostgreSQL is an additional data area outside the base directory. This function has been implemented in version 8.0.

Figure 1.3 shows the internal layout of a tablespace, and the relationship with the main data area.

Fig. 1.3. A Tablespace in the Database Cluster.

A tablespace is created under the directory specified when you issue CREATE TABLESPACE statement, and under that directory, the version-specific subdirectory (e.g., PG_14_202011044) will be created. The naming method for version-specific one is shown below.

PG _ 'Major version' _ 'Catalogue version number'

For example, if you create a tablespace 'new_tblspc' at '/home/postgres/tblspc', whose oid is 16386, a subdirectory such as 'PG_14_202011044' would be created under the tablespace.

$ ls -l /home/postgres/tblspc/
total 4
drwx------ 2 postgres postgres 4096 Apr 21 10:08 PG_14_202011044

The tablespace directory is addressed by a symbolic link from the pg_tblspc subdirectory, and the link name is the same as the OID value of tablespace.

$ ls -l $PGDATA/pg_tblspc/
total 0
lrwxrwxrwx 1 postgres postgres 21 Apr 21 10:08 16386 -> /home/postgres/tblspc

If you create a new database (OID is 16387) under the tablespace, its directory is created under the version-specific subdirectory.

$ ls -l /home/postgres/tblspc/PG_14_202011044/
total 4
drwx------ 2 postgres postgres 4096 Apr 21 10:10 16387

If you create a new table which belongs to the database created under the base directory, first, the new directory, whose name is the same as the existing database OID, is created under the version specific subdirectory, and then the new table file is placed under the created directory.

sampledb=# CREATE TABLE newtbl (.....) TABLESPACE new_tblspc;

sampledb=# SELECT pg_relation_filepath('newtbl');
             pg_relation_filepath             
---------------------------------------------
 pg_tblspc/16386/PG_14_202011044/16384/18894

1.3. Internal Layout of a Heap Table File

Inside the data file (heap table and index, as well as the free space map and visibility map), it is divided into pages (or blocks) of fixed length, the default is 8192 byte (8 KB). Those pages within each file are numbered sequentially from 0, and such numbers are called as block numbers. If the file has been filled up, PostgreSQL adds a new empty page to the end of the file to increase the file size.

Internal layout of pages depends on the data file types. In this section, the table layout is described as the information will be required in the following chapters.

Fig. 1.4. Page layout of a heap table file.

A page within a table contains three kinds of data described as follows:

heap tuple(s) – A heap tuple is a record data itself. They are stacked in order from the bottom of the page. The internal structure of tuple is described in Section 5.2 and Chapter 9 as the knowledge of both Concurrency Control(CC) and WAL in PostgreSQL are required.
line pointer(s) – A line pointer is 4 byte long and holds a pointer to each heap tuple. It is also called an item pointer.
Line pointers form a simple array, which plays the role of index to the tuples. Each index is numbered sequentially from 1, and called offset number. When a new tuple is added to the page, a new line pointer is also pushed onto the array to point to the new one.
header data – A header data defined by the structure PageHeaderData is allocated in the beginning of the page. It is 24 byte long and contains general information about the page. The major variables of the structure are described below.
- pd_lsn – This variable stores the LSN of XLOG record written by the last change of this page. It is an 8-byte unsigned integer, related to the WAL (Write-Ahead Logging) mechanism. The details are described in Chapter 9.
- pd_checksum – This variable stores the checksum value of this page. (Note that this variable is supported in version 9.3 or later; in earlier versions, this part had stored the timelineId of the page.)
- pd_lower, pd_upper – pd_lower points to the end of line pointers, and pd_upper to the beginning of the newest heap tuple.
- pd_special – This variable is for indexes. In the page within tables, it points to the end of the page. (In the page within indexes, it points to the beginning of special space which is the data area held only by indexes and contains the particular data according to the kind of index types such as B-tree, GiST, GiN, etc.)

An empty space between the end of line pointers and the beginning of the newest tuple is referred to as free space or hole.

To identify a tuple within the table, tuple identifier (TID) is internally used. A TID comprises a pair of values: the block number of the page that contains the tuple, and the offset number of the line pointer that points to the tuple. A typical example of its usage is index. See more detail in Section 1.4.2.

The structure PageHeaderData is defined in src/include/storage/bufpage.h.

In addition, heap tuple whose size is greater than about 2 KB (about 1/4 of 8 KB) is stored and managed using a method called TOAST (The Oversized-Attribute Storage Technique). Refer PostgreSQL documentation for details.

1.4. The Methods of Writing and Reading Tuples

In the end of this chapter, the methods of writing and reading heap tuples are described.

1.4.1. Writing Heap Tuples

Suppose a table composed of one page which contains just one heap tuple. The pd_lower of this page points to the first line pointer, and both the line pointer and the pd_upper point to the first heap tuple. See Fig. 1.5(a).

When the second tuple is inserted, it is placed after the first one. The second line pointer is pushed onto the first one, and it points to the second tuple. The pd_lower changes to point to the second line pointer, and the pd_upper to the second heap tuple. See Fig. 1.5(b). Other header data within this page (e.g., pd_lsn, pg_checksum, pg_flag) are also rewritten to appropriate values; more details are described in Section 5.3 and Chapter 9.

Fig. 1.5. Writing of a heap tuple.

1.4.2. Reading Heap Tuples

Two typical access methods, sequential scan and B-tree index scan, are outlined here:

Sequential scan – All tuples in all pages are sequentially read by scanning all line pointers in each page. See Fig. 1.6(a).
B-tree index scan – An index file contains index tuples, each of which is composed of an index key and a TID pointing to the target heap tuple. If the index tuple with the key that you are looking for has been found, PostgreSQL reads the desired heap tuple using the obtained TID value. (The description of the way to find the index tuples in B-tree index is not explained here as it is very common and the space here is limited. See the relevant materials.) For example, in Fig. 1.6(b), TID value of the obtained index tuple is ‘(block = 7, Offset = 2)’. It means that the target heap tuple is 2nd tuple in the 7th page within the table, so PostgreSQL can read the desired heap tuple without unnecessary scanning in the pages.

Fig. 1.6. Sequential scan and index scan.

Indexes Internals

This document does not explain indexes in details. To understand them, I recommend to read the valuable posts shown below:

Indexes in PostgreSQL — 1
Indexes in PostgreSQL — 2
Indexes in PostgreSQL — 3 (Hash)
Indexes in PostgreSQL — 4 (Btree)
Indexes in PostgreSQL — 5 (GiST)
Indexes in PostgreSQL — 6 (SP-GiST)
Indexes in PostgreSQL — 7 (GIN)
Indexes in PostgreSQL — 9 (BRIN)

PostgreSQL also supports TID-Scan, Bitmap-Scan, and Index-Only-Scan.

TID-Scan is a method that accesses a tuple directly by using TID of the desired tuple. For example, to find the 1st tuple in the 0-th page within the table, issue the following query:

sampledb=# SELECT ctid, data FROM sampletbl WHERE ctid = '(0,1)';
 ctid  |   data    
-------+-----------
 (0,1) | AAAAAAAAA
(1 row)

Index-Only-Scan will be described in details in Chapter 7.

[LeetCode]day4 977.有序数组的平方因兹菜 leetcode 算法数据结构
977.有序数组的平方-力扣（LeetCode）一.题目给你一个按非递减顺序排序的整数数组nums，返回每个数字的平方组成的新数组，要求也按非递减顺序排序。示例1：输入：nums=[-4,-1,0,3,10]输出：[0,1,9,16,100]解释：平方后，数组变为[16,1,0,9,100]排序后，数组变为[0,1,9,16,100]示例2：输入：nums=[-7,-3,2,3,11]输出：[4,
基于Matlab的秃鹰算法求解最优目标问题代码编织匠人算法 matlab 开发语言 Matlab
基于Matlab的秃鹰算法求解最优目标问题秃鹰算法是一种基于仿生学原理的优化算法，灵感来源于秃鹰在捕食过程中的搜索策略。该算法通过模拟秃鹰的捕食行为，寻找最优解决方案。在本文中，我们将使用Matlab实现秃鹰算法，并利用该算法解决一个最优目标问题。首先，让我们定义要解决的最优目标问题。假设我们有一个函数f(x)，其中x是一个向量，表示优化问题的变量。我们的目标是找到使函数f(x)取得最小值的x值。
android 软键盘的显示和隐藏方法飞_哥 android 软键盘隐藏显示 android
方法一：在OnCreate()函数中，加上下面的代码getWindow().setSoftInputMode(WindowManager.LayoutParams.SOFT_INPUT_STATE_HIDDEN);12方法二：在AndroidManifest.xml中，在所要设置的activity中设置以下属性就行了activityandroid:windowSoftInputMode="stat
linux asio网络编程理论及实现辣椒卷二王网络 boost/asio 网络编程并发编程
最近在B站看了恋恋风辰大佬的asio网络编程，质量非常高。在本章中将对ASIO异步网络编程的整体及一些实现细节进行完整的梳理，用于复习与分享。大佬的博客：恋恋风辰官方博客Preactor/Reactor模式在网络编程中，通常根据事件处理的触发时机和处理逻辑的分工可以分为reactor模式和preactor模式。reator是非阻塞同步网络模式,preactor是异步网络模式。阻塞I/O我们知道re
Cursor小试2.pdf转图片大渔歌_ AI pdf
在开发过程中,可能会遇到文件相互转换的需求,比如pdf转图片等,现在市面上都是一些收费的,比如WPS需要会员才可以实现,也有一些小程序相关的,都是收费的,如何实现本地免费的把pdf转图片的小工具呢,可以直接使用Cursor编程,可以在COMPOSER中,直接提问:"你是一个高级开发工程师,请实现一个网页端的把pdf转成图片的功能页面"然后它会来帮你实现一个将PDF转换为图片的网页工具1这里将使用p
PennyLane: 探索量子计算的新里程戴艺音
PennyLane:探索量子计算的新里程项目地址:https://gitcode.com/gh_mirrors/pe/pennylane项目简介是一个开源软件框架，专注于混合量子和经典计算。由PennyLaneAI团队开发，该项目提供了一个直观且灵活的方式来设计、训练和优化涉及量子硬件的机器学习模型。其目标是让研究人员和开发者能够轻松地在本地或云端的量子计算机上进行实验。技术分析PennyLane
软件工程概论试题三 minaMoonGirl 软件工程
一、单选1.需求确认主要检査五个方面的内容，其中那一项是为了保证文档中的需求不互相冲突(即不应该有相互矛盾的约束或者对同一个系统功能有不同的描述)。A.现实性B.可验证性C.一致性D.正确性E.完整性正答：C2.下列开发方法中，()不属于敏捷开发方法,A.自适应软件开发B.螺旋模型C.水晶方法D.极限编程正答：B3.下列关于敏捷方法的叙述，错误的是()。A.敏捷方法强调小版本发布B.敏捷方法强调可
五类推理（逻辑推理、概率推理、图推理、基于深度学习的推理）的开源库（一）由数入道深度学习开源人工智能
在开发中，有一些开源库可以实现不同类型的推理，包括逻辑推理、概率推理、图推理、基于深度学习的推理等。以下是五类推理（逻辑推理、概率推理、图推理、基于深度学习的推理）的现成开源库，它们各自的功能、特点和适用场景的详细介绍，并进行对比分析。1.逻辑推理推理：PyDatalog库介绍：PyDatalog是一个Python的逻辑编程库，它将逻辑编程的功能引入到Python中，提供了在Python中进行规则
Penny Lane blackpuppy 享受生命 go
就像罗大佑的鹿港小镇一样，这个PennyLane是否真有这么一个地方呢，有的话又在哪里呢？PennyLanebyTheBeatlesInPennyLanethereisabarbershowingphotographsOfev’ryheadhe’shadthepleasuretoknow.AndallthepeoplethatcomeandgoStopandsay‘Hello’.Onthecorn
vscode python pylint 问题汇总解决嘉禾天成
目录一、问题类型：二、解决问题2.1、Unabletoimport'xxx'pylint2.2、Noname'xxx'inmoudle'xxx'2.3、Accesstomemberxxxbeforedefinitionpylint!!!!本文章长期更新!!!!一、问题类型：1、Unabletoimport'xxx'pylint2、Noname'xxx'inmoudle'xxx'3、Accessto
点击屏幕隐藏软键盘、软键盘显示/隐藏、windowSoftInputMode常用属性说明（禁止软键盘自动弹出、键盘遮挡问题） Mr_Leixiansheng android ui
转载请注明出处：点击屏幕隐藏软键盘、软键盘显示/隐藏、windowSoftInputMode常用属性说明（禁止软键盘自动弹出、键盘遮挡问题）_当软键盘显示时更多面板应该隐藏_Mr_Leixiansheng的博客-CSDN博客点击屏幕隐藏软键盘1，实现方法一：通过给当前界面布局文件的父layout设置点击事件（相当于给整个Activity设置点击事件），在事件里进行键盘隐藏加上id和clickabl
【论文复现】一种改进哈里斯鹰优化算法用于连续和离散优化问题小O的算法实验室智能算法智能算法改进论文复现算法智能算法应用论文复现
目录1.摘要2.哈里斯鹰算法HHO原理3.改进策略4.结果展示5.参考文献6.代码获取1.摘要哈里斯鹰优化（HHO）是一种基于种群的元启发式优化算法，已被广泛应用于各种测试函数和实际问题。本文提出了一种改进的HHO算法，旨在通过简化算法结构并改进随机参数的确定方式，来提升算法性能。改进分为三个阶段：1.重新设计了确定随机参数的方法；2.更新了产生新解的策略；3.将决策机制从六步简化为四步。2.哈里
【智能算法】人工蜂鸟算法（AHA)原理及实现小O的算法实验室智能算法算法智能算法
目录1.背景2.算法原理2.1算法思想2.2算法过程3.代码实现4.参考文献1.背景2021年，Zhao等人受到蜂鸟飞行和捕食行为启发，提出了人工蜂鸟算法(ArtificialHummingbirdAgorithm,AHA)。2.算法原理2.1算法思想AHA算法是一种基于蜂鸟智能行为的生物启发优化算法，旨在解决优化问题。其主要思想包括：食物源模拟：将问题的解空间表示为食物源，每个食物源对应一个解向
主流编程语言的优劣分析及学习建议我的青春不太冷学习 java 开发语言 android 经验分享笔记
不同语言的特性主流编程语言的优劣分析及学习建议1.Python优点缺点学习建议适用于哪些人？2.JavaScript优点缺点学习建议适用于哪些人？3.Java优点缺点学习建议适用于哪些人？4.C++优点缺点学习建议适用于哪些人？5.Swift优点缺点学习建议适用于哪些人？结论主流编程语言的优劣分析及学习建议对于年轻人或者即将开始编程学习的人来说，选择一门合适的编程语言至关重要。不同的编程语言有各自
Docker 深度解析：从入门到精通杰哥的编程世界 javaee docker 容器运维
引言在当今的软件开发领域，容器化技术已经成为一种趋势。Docker作为容器化技术的代表，以其轻量级、可移植性和易用性，被广泛应用于各种场景。本文将从Docker的基本概念入手，详细介绍Docker的安装、基本操作、网络配置、数据存储、镜像管理以及一些高级应用。Docker简介Docker是一个开源的应用容器引擎，它允许开发者将应用及其依赖环境打包到一个可移植的容器中。Docker使用Linux容器
开源 OA 办公系统自不量力的A同学人工智能
目前尚未有关于勾股OA5.6.8新春版发布的相关确切信息，但勾股OAv5.6已于2025年1月19日发布2。勾股OA是一款基于ThinkPHP6+Layui+MySql打造的实用的开源的企业办公系统2。勾股OAv5.6的相关信息如下2：系统特点系统各功能模块一目了然，操作简单，拥有通用型的后台权限管理框架，可全覆盖跟踪员工的操作记录。集成了系统设置、人事管理、行政管理、消息管理、企业公告、知识库、
Docker Desktop 解决从开发到部署的高效容器化工作流问题由数入道容器编排 docker 容器运维
一、基本概念DockerDocker是一个开源的容器化平台，提供了一种轻量级的方式来打包、运行和管理应用程序及其依赖项。通过Docker，你可以：容器化应用程序：将代码、依赖项和配置打包成一个独立的容器镜像。便携性：一次构建，随处运行，无论是开发环境、测试环境还是生产环境。高效资源利用：容器共享操作系统内核，比虚拟机更轻量。Kubernetes(K8s)Kubernetes是一个用于自动化容器部署
软键盘显示/交互问题大渔歌_ Android android
日常开发会经常遇到软键盘覆盖界面布局的问题,比如:我有一个fragment,中心布局了EditText,正常情况是,当点击这个EditText的时候,输入法会弹出来,但是输入控件会覆盖掉EditText,看不到输入的内容,这种应该怎么处理呢这个问题通常是因为当软键盘弹出时，EditText被遮挡导致无法看到输入的内容。解决这个问题的方法有几种，常见的方式是调整布局的行为，让输入法弹出时，布局可以自
Debezium系列之：基于Debezium JDBC connector消费Topic数据到Mysql数据库快乐骑行^_^ debezium Debezium系列 JDBC connector 消费Topic数据 Mysql数据库
Debezium系列之：基于DebeziumJDBCconnector消费Topic数据到Mysql数据库一、需求背景二、相关技术博客三、创建表四、使用Debezium2.Xmysqlconnector采集数据五、数据库插入数据和查看Topic数据六、DebeziumJDBCconnector完整配置七、DebeziumJDBCconnector参数详解八、源库插入数据，查看debeziumjdb
Debezium系列之：debezium把sqlserver数据库多张表的数据发送到一个kafka topic 快乐骑行^_^ debezium Debezium sqlserver数据库 kafka topic
Debezium系列之：debezium把sqlserver数据库多张表的数据发送到一个kafkatopic一、需求二、debezium实现sqlservercdc三、相关参数详解四、完整参数一、需求把一个sqlserver数据库多张表的数据全部发送到kafka集群的一个topic二、debezium实现sqlservercdc相关技术实现参考博主以下几篇博客：Debezium系列之：安装部署de
Debezium系列之：实现增量快照incremental技术的详细步骤快乐骑行^_^ debezium Debezium系列实现增量快照技术详细步骤
Debezium系列之：实现增量快照incremental技术的详细步骤一、Debezium增量快照技术二、增量快照过程三、创建信号表四、增加增量快照属性五、修改快照数据事件类型六、完整connector配置七、激活增量快照八、增量快照参数详解九、消费topic查看数据十、再次触发增量快照十一、增量快照增加条件筛选数据一、Debezium增量快照技术为了提供管理快照的灵活性，Debezium包括一
「Python系列」Python random模块、hashlib模块 ·零落· Python入门到掌握 python 开发语言 random
文章目录一、Pythonrandom模块1.`random.random()`2.`random.uniform(a,b)`3.`random.randint(a,b)`4.`random.randrange(start,stop,step)`5.`random.choice(seq)`6.`random.shuffle(seq)`7.`random.sample(seq,k)`8.`random
bulk-seq数据和单细胞数据的联合分析追风少年ii python 算法人工智能
作者，EvilGenius随着现在研究的不断深入，越来越多的情况需要我们对多种数据的联合分析，其中在单细胞没有出来之前，普通转录组（bulk-seq）的测序结果是非常多的，也解决了我们很多的生物学问题，单细胞技术的出现，更高分辨率的同时，与普通转录组的联合分析也是现在分析的一个关注点。在文章《Distinctandtemporary-restrictedepigeneticmechanismsre
MySQL 常用命令云水一方数据库 mysql 大数据
MySQL是一种流行的关系型数据库管理系统，其高效的性能和丰富的功能使其成为众多开发者的首选。在日常操作中，掌握MySQL的常用命令至关重要。以下是一些MySQL常用命令及其解释。️数据库操作命令1.登录数据库mysql-u[username]-p-u指定用户名。-p提示输入密码。2.显示所有数据库SHOWDATABASES;查看当前MySQL实例中的所有数据库。3.✨创建数据库CREATEDAT
Linux 上 MySQL 8.0 的备份与恢复实战指南云水一方 mysql linux 大数据数据仓库运维数据库
在数据库运维过程中，备份与恢复是保障数据安全的重要手段。MySQL8.0在Linux环境中提供了多种备份和恢复方案，包括逻辑备份和物理备份。本文将介绍这些备份方式的操作步骤与逻辑实现，帮助您高效管理数据库。一、备份与恢复的作用和意义数据安全：防止因误操作、硬件故障或恶意攻击导致的数据丢失。容灾恢复：在灾难发生后快速恢复业务功能，减少停机时间。数据迁移：支持数据库迁移至新环境或硬件。二、备份方式概览
windows进阶-cmd命令云水一方运维 windows c语言
在Windows操作系统中，CMD提供了许多强大的命令，能够帮助用户执行一些更为高级的任务。这些命令不仅仅限于文件和目录的管理，还包括了服务管理、远程连接、注册表操作等功能。本文将介绍一些常见但较为特殊的CMD命令及其功能，帮助你更好地使用Windows命令行。1.sc–服务管理sc命令用于与Windows服务交互，可以启动、停止、配置或查询系统服务。这是一个功能强大的工具，尤其在进行系统管理时非
使用Ansible进行Red Hat Linux自动化运维云水一方运维自动化 ansible linux 分布式服务器
在现代IT运维中，自动化是提高效率、减少人为错误和增强可维护性的关键。Ansible作为一款简单但强大的自动化工具，已被广泛应用于系统配置管理、应用部署以及任务自动化等场景。什么是Ansible？Ansible是一款开源的IT自动化工具，它用于配置管理、应用部署以及任务自动化。Ansible具有以下几个特点：简单易用：Ansible使用YAML格式的Playbook进行配置，语法简单，易于理解和使
java 字节文件类型_Java根据字节数据判断文件类型 weixin_39795284 java 字节文件类型
通常，在WEB系统中，上传文件时都需要做文件的类型校验，大致有如下几种方法：1.通过后缀名，如exe,jpg,bmp,rar,zip等等。2.通过读取文件，获取文件的Content-type来判断。3.通过读取文件流，根据文件流中特定的一些字节标识来区分不同类型的文件。4.若是图片，则通过缩放来判断，可以缩放的为图片，不可以的则不是。然而，在安全性较高的业务场景中，1，2两种方法的校验会被轻易绕过
python round函数_python中round函数如何使用 weixin_39823017 python round函数
round函数很简单，对浮点数进行近似取值，保留几位小数。比如>>>round(10.0/3,2)3.33>>>round(20/7)3第一个参数是一个浮点数，第二个参数是保留的小数位数，可选，如果不写的话默认保留到整数。这么简单的函数，能有什么坑呢？1、round的结果跟python版本有关我们来看看python2和python3中有什么不同：$pythonPython2.7.8(default
Java 代码实现pdf转word文件 | 无损转换完整代码教程泰山AI Java高级技术 java 开发语言 poi pdf
前言最近有个需求，我自己有个pdf想转word去修改，百度很多工具都是注册账号前一两次免费，后面就要收费，由于，本人之前的也转换过好几次，免费额度都用完了。百度了半天也没找到一个免费，于是决定自己用代码实现转换，觉得应该不难，后来，调试1-2个小时的代码终于实现了！pdf转word实现思路代码实现主要依赖两个第三方jar包，一个是apache-poi，一个是aspose-pdf。apache-po
开发者关心的那些事圣子足道 ios 游戏编程 apple 支付
我要在app里添加IAP，必须要注册自己的产品标识符（product identifiers）。产品标识符是什么？产品标识符（Product Identifiers）是一串字符串，它用来识别你在应用内贩卖的每件商品。App Store用产品标识符来检索产品信息，标识符只能包含大小写字母（A-Z）、数字（0-9）、下划线（-）、以及圆点(.)。你可以任意排列这些元素，但我们建议你创建标识符时使用
负载均衡器技术Nginx和F5的优缺点对比 bijian1013 nginx F5
对于数据流量过大的网络中，往往单一设备无法承担，需要多台设备进行数据分流，而负载均衡器就是用来将数据分流到多台设备的一个转发器。目前有许多不同的负载均衡技术用以满足不同的应用需求，如软/硬件负载均衡、本地/全局负载均衡、更高
LeetCode[Math] - #9 Palindrome Number Cwind java Algorithm 题解 LeetCode Math
原题链接：#9 Palindrome Number 要求：判断一个整数是否是回文数，不要使用额外的存储空间难度：简单分析：题目限制不允许使用额外的存储空间应指不允许使用O(n)的内存空间，O(1)的内存用于存储中间结果是可以接受的。于是考虑将该整型数反转，然后与原数字进行比较。注：没有看到有关负数是否可以是回文数的明确结论，例如
画图板的基本实现 15700786134 画图板
要实现画图板的基本功能，除了在qq登陆界面中用到的组件和方法外，还需要添加鼠标监听器，和接口实现。首先，需要显示一个JFrame界面： public class DrameFrame extends JFrame { //显示
linux的ps命令被触发 linux
Linux中的ps命令是Process Status的缩写。ps命令用来列出系统中当前运行的那些进程。ps命令列出的是当前那些进程的快照，就是执行ps命令的那个时刻的那些进程，如果想要动态的显示进程信息，就可以使用top命令。要对进程进行监测和控制，首先必须要了解当前进程的情况，也就是需要查看当前进程，而 ps 命令就是最基本同时也是非常强大的进程查看命令。使用该命令可以确定有哪些进程正在运行
Android 音乐播放器下一曲连续跳几首歌肆无忌惮_ android
最近在写安卓音乐播放器的时候遇到个问题。在MediaPlayer播放结束时会回调 player.setOnCompletionListener(new OnCompletionListener() { @Override public void onCompletion(MediaPlayer mp) { mp.reset(); Log.i("H
java导出txt文件的例子知了ing java servlet
代码很简单就一个servlet,如下： package com.eastcom.servlet; import java.io.BufferedOutputStream; import java.io.IOException; import java.net.URLEncoder; import java.sql.Connection; import java.sql.Resu
Scala stack试玩, 提高第三方依赖下载速度矮蛋蛋 scala sbt
原文地址： http://segmentfault.com/a/1190000002894524 sbt下载速度实在是惨不忍睹, 需要做些配置优化下载typesafe离线包, 保存为ivy本地库 wget http://downloads.typesafe.com/typesafe-activator/1.3.4/typesafe-activator-1.3.4.zip 解压r
phantomjs安装(linux，附带环境变量设置) ，以及casperjs安装。 alleni123 linux spider
1. 首先从官网 http://phantomjs.org/下载phantomjs压缩包，解压缩到/root/phantomjs文件夹。 2. 安装依赖 sudo yum install fontconfig freetype libfreetype.so.6 libfontconfig.so.1 libstdc++.so.6 3. 配置环境变量 vi /etc/profil
JAVA IO FileInputStream和FileOutputStream，字节流的打包输出百合不是茶 java核心思想 JAVA IO操作字节流
在程序设计语言中，数据的保存是基本，如果某程序语言不能保存数据那么该语言是不可能存在的，JAVA是当今最流行的面向对象设计语言之一，在保存数据中也有自己独特的一面，字节流和字符流 1，字节流是由字节构成的，字符流是由字符构成的字节流和字符流都是继承的InputStream和OutPutStream ,java中两种最基本的就是字节流和字符流类 FileInputStream
Spring基础实例（依赖注入和控制反转） bijian1013 spring
前提条件：在http://www.springsource.org/download网站上下载Spring框架，并将spring.jar、log4j-1.2.15.jar、commons-logging.jar加载至工程1.武器接口 package com.bijian.spring.base3; public interface Weapon { void kil
HR看重的十大技能 bijian1013 提升能力 HR 成长
一个人掌握何种技能取决于他的兴趣、能力和聪明程度，也取决于他所能支配的资源以及制定的事业目标，拥有过硬技能的人有更多的工作机会。但是，由于经济发展前景不确定，掌握对你的事业有所帮助的技能显得尤为重要。以下是最受雇主欢迎的十种技能。　　一、解决问题的能力　　每天，我们都要在生活和工作中解决一些综合性的问题。那些能够发现问题、解决问题并迅速作出有效决
【Thrift一】Thrift编译安装 bit1129 thrift
什么是Thrift The Apache Thrift software framework, for scalable cross-language services development, combines a software stack with a code generation engine to build services that work efficiently and s
【Avro三】Hadoop MapReduce读写Avro文件 bit1129 mapreduce
Avro是Doug Cutting(此人绝对是神一般的存在）牵头开发的。开发之初就是围绕着完善Hadoop生态系统的数据处理而开展的（使用Avro作为Hadoop MapReduce需要处理数据序列化和反序列化的场景）,因此Hadoop MapReduce集成Avro也就是自然而然的事情。这个例子是一个简单的Hadoop MapReduce读取Avro格式的源文件进行计数统计，然后将计算结果
nginx定制500，502，503，504页面 ronin47 nginx　错误显示
server { listen 80; error_page 500/500.html; error_page 502/502.html; error_page 503/503.html; error_page 504/504.html; location /test {return502;}} 配置很简单，和配
java-1.二叉查找树转为双向链表 bylijinnan 二叉查找树
import java.util.ArrayList; import java.util.List; public class BSTreeToLinkedList { /* 把二元查找树转变成排序的双向链表题目：输入一棵二元查找树，将该二元查找树转换成一个排序的双向链表。要求不能创建任何新的结点，只调整指针的指向。 10 / \ 6 14 / \
Netty源码学习-HTTP-tunnel bylijinnan java netty
Netty关于HTTP tunnel的说明： http://docs.jboss.org/netty/3.2/api/org/jboss/netty/channel/socket/http/package-summary.html#package_description 这个说明有点太简略了一个完整的例子在这里： https://github.com/bylijinnan
JSONUtil.serialize(map)和JSON.toJSONString(map)的区别 coder_xpf jquery json map val()
JSONUtil.serialize(map)和JSON.toJSONString(map)的区别数据库查询出来的map有一个字段为空通过System.out.println()输出 JSONUtil.serialize(map)： {"one":"1","two":"nul
Hibernate缓存总结 cuishikuan 开源 ssh javaweb hibernate缓存三大框架
一、为什么要用Hibernate缓存？ Hibernate是一个持久层框架，经常访问物理数据库。为了降低应用程序对物理数据源访问的频次，从而提高应用程序的运行性能。缓存内的数据是对物理数据源中的数据的复制，应用程序在运行时从缓存读写数据，在特定的时刻或事件会同步缓存和物理数据源的数据。二、Hibernate缓存原理是怎样的？ Hibernate缓存包括两大类：Hib
CentOs6 dalan_123 centos
首先su - 切换到root下面1、首先要先安装GCC GCC-C++ Openssl等以来模块：yum -y install make gcc gcc-c++ kernel-devel m4 ncurses-devel openssl-devel2、再安装ncurses模块yum -y install ncurses-develyum install ncurses-devel3、下载Erang
10款用 jquery 实现滚动条至页面底端自动加载数据效果 dcj3sjt126com JavaScript
无限滚动自动翻页可以说是web2.0时代的一项堪称伟大的技术，它让我们在浏览页面的时候只需要把滚动条拉到网页底部就能自动显示下一页的结果，改变了一直以来只能通过点击下一页来翻页这种常规做法。无限滚动自动翻页技术的鼻祖是微博的先驱：推特(twitter)，后来必应图片搜索、谷歌图片搜索、google reader、箱包批发网等纷纷抄袭了这一项技术，于是靠滚动浏览器滚动条
ImageButton去边框&Button或者ImageButton的背景透明 dcj3sjt126com imagebutton
在ImageButton中载入图片后，很多人会觉得有图片周围的白边会影响到美观，其实解决这个问题有两种方法一种方法是将ImageButton的背景改为所需要的图片。如：android:background="@drawable/XXX" 第二种方法就是将ImageButton背景改为透明，这个方法更常用在XML里； <ImageBut
JSP之c:foreach eksliang jsp forearch
原文出自：http://www.cnblogs.com/draem0507/archive/2012/09/24/2699745.html <c:forEach>标签用于通用数据循环，它有以下属性属性描述是否必须缺省值 items 进行循环的项目否无 begin 开始条件否 0 end 结束条件否集合中的最后一个项目 step 步长否 1
Android实现主动连接蓝牙耳机 gqdy365 android
在Android程序中可以实现自动扫描蓝牙、配对蓝牙、建立数据通道。蓝牙分不同类型，这篇文字只讨论如何与蓝牙耳机连接。大致可以分三步：一、扫描蓝牙设备： 1、注册并监听广播： BluetoothAdapter.ACTION_DISCOVERY_STARTED BluetoothDevice.ACTION_FOUND BluetoothAdapter.ACTION_DIS
android学习轨迹之四：org.json.JSONException: No value for hyz301 json
org.json.JSONException: No value for items 在JSON解析中会遇到一种错误，很常见的错误 06-21 12:19:08.714 2098-2127/com.jikexueyuan.secret I/System.out﹕ Result:{"status":1,"page":1,&
干货分享：从零开始学编程系列汇总 justjavac 编程
程序员总爱重新发明轮子，于是做了要给轮子汇总。从零开始写个编译器吧系列 (知乎专栏) 从零开始写一个简单的操作系统 (伯乐在线) 从零开始写JavaScript框架 (图灵社区) 从零开始写jQuery框架 (蓝色理想 ) 从零开始nodejs系列文章 (粉丝日志) 从零开始编写网络游戏
jquery-autocomplete 使用手册 macroli jquery Ajax 脚本
jquery-autocomplete学习一、用前必备官方网站：http://bassistance.de/jquery-plugins/jquery-plugin-autocomplete/ 当前版本：1.1 需要JQuery版本：1.2.6 二、使用 <script src="./jquery-1.3.2.js" type="text/ja
PLSQL-Developer或者Navicat等工具连接远程oracle数据库的详细配置以及数据库编码的修改超声波 oracle plsql
　　在服务器上将Oracle安装好之后接下来要做的就是通过本地机器来远程连接服务器端的oracle数据库，常用的客户端连接工具就是PLSQL-Developer或者Navicat这些工具了。刚开始也是各种报错，什么TNS:no listener;TNS:lost connection;TNS:target hosts...花了一天的时间终于让PLSQL-Developer和Navicat等这些客户
数据仓库数据模型之：极限存储--历史拉链表 superlxw1234 极限存储数据仓库数据模型拉链历史表
在数据仓库的数据模型设计过程中，经常会遇到这样的需求： 1. 数据量比较大; 2. 表中的部分字段会被update,如用户的地址，产品的描述信息，订单的状态等等; 3. 需要查看某一个时间点或者时间段的历史快照信息，比如，查看某一个订单在历史某一个时间点的状态，比如，查看某一个用户在过去某一段时间内，更新过几次等等; 4. 变化的比例和频率不是很大，比如，总共有10
10点睛Spring MVC4.1-全局异常处理 wiselyman spring mvc
10.1 全局异常处理使用@ControllerAdvice注解来实现全局异常处理; 使用@ControllerAdvice的属性缩小处理范围 10.2 演示演示控制器 package com.wisely.web; import org.springframework.stereotype.Controller; import org.spring