weixin_30608131

MySQL学习笔记——字符集

字符值包含字母、数字和特殊符号。在字符值可以存储之前，字母、数字和字符必须转换为数值代码。所以必须建立一个转换表，其中包含了每个相关字符的数值代码。这样的转换表就称为字符集，有时也称为代码字符集（code character set）和字符编码（character encoding）。

要想让计算机处理字符，不仅需要字符到数值的映射，还要考虑如果存储这些数值，所以便诞生了编码方案的概念。是定长存储呢，还是变长存储？是用一个字节还是用多个字节？仁者见仁，智者见智。依据需要的不同，诞生了很多的编码方案。对于Unicode，就存在UTF-8、UTF-16、UTF-32。

而在MySQL中，字符集的概念和编码方案的概念被看作是同义词。一个字符集(character set)是一个转换表和一个编码方案的组合。校对（collation）的概念是为了解决排序的顺序或字符的分组问题。因为字符的排序和分组需要字符之间的比较，校对就定义了这些比较的大小关系。

显示可用的字符集

SHOW CHARACTER SET
或者
SELECT CHARACTER_SET_NAME,DESCRIPTION,DEFAULT_COLLATE_NAME,MAXLEN
FROM INFORMATION_SCHEMA.CHARACTER_SETS

显示字符集utf8可用的校对

SHOW COLLATION LIKE 'utf8%'
或者
SELECT *
FROM INFOMATION_SCHEMA.COLLATIONS
WHERE COLLATION_NAME LIKE 'utf8%'

很多时候，数据库中或客户端显示乱码是由于字符集没有设置正确，用latin1字符集显示utf8字符集的数据当然会出现问题。这时需要查看数据库、表和列的字符集是否是你想要的；客户端的字符集是否的当。

如下是字符集和校对的系统变量

系统变量	说明
CHARACTER_SET_CLIENT	从客户机发送给服务器的语句的字符集
CHARACTER_SET_CONNECTION	客户机和服务器连接的字符集
CHARACTER_SET_DATABASE	当前数据库的默认字符集。每次使用USE语句来“跳转”到另一个数据库时，这个变量就会改变。如果没有当前数据库，其值为CHARACTER_SET_SERVER
CHARACTER_SET_RESULTS	从服务器发送到客户机的SELECT语句的最终结果的字符集，包括列的值，列的元数据——列名，错误信息
CHARACTER_SET_SERVER	服务器的默认字符集
CHARACTER_SET_SYSTEM	系统字符集。用于数据库中对象（如表和列）的名字，也用于存储在目录表中函数的名字。其值总是等于utf8
CHARACTER_SET_DIR	注册的所有字符的文件都在这个目录中
COLLATION_CONNECTION	当前连接的校对
COLLATION_DATABASE	当前日期的默认校对。每次使用USE语句来“跳转”到另一个数据库时，这个变量就会改变。
COLLATION_SERVER	服务器默认校对

数据库对象的字符集的指定有如下继承关系：

Server -> Database -> Table -> Column

也就是说，如果后者没有显示指定字符集，那么将采用前者的字符集。

Server Character Set and Collation

MySQL Server has a server character set and a server collation. These can be set at server startup on the command line or in an option file and changed at runtime.

Initially, the server character set and collation depend on the options that you use when you start mysqld. You can use --character-set-server for the character set. Along with it, you can add --collation-server for the collation. If you don't specify a character set, that is the same as saying --character-set-server=latin1. If you specify only a character set (for example, latin1) but not a collation, that is the same as saying --character-set-server=latin1 --collation-server=latin1_swedish_ci because latin1_swedish_ci is the default collation for latin1. Therefore, the following three commands all have the same effect:

shell> mysqld
shell> mysqld --character-set-server=latin1
shell> mysqld --character-set-server=latin1 \
           --collation-server=latin1_swedish_ci

The server character set and collation are used as default values if the database character set and collation are not specified in CREATE DATABASE statements. They have no other purpose.

The current server character set and collation can be determined from the values of the character_set_serverand collation_server system variables. These variables can be changed at runtime.

Database Character Set and Collation

Every database has a database character set and a database collation. The CREATE DATABASE and ALTER DATABASE statements have optional clauses for specifying the database character set and collation:

CREATE DATABASE db_name
    [[DEFAULT] CHARACTER SET charset_name]
    [[DEFAULT] COLLATE collation_name]

ALTER DATABASE db_name
    [[DEFAULT] CHARACTER SET charset_name]
    [[DEFAULT] COLLATE collation_name]

The keyword SCHEMA can be used instead of DATABASE.

The database character set and collation are used as default values for table definitions if the table character set and collation are not specified in CREATE TABLE statements. The database character set also is used by LOAD DATA INFILE. The character set and collation have no other purposes.

The character set and collation for the default database can be determined from the values of thecharacter_set_database and collation_database system variables. The server sets these variables whenever the default database changes. If there is no default database, the variables have the same value as the corresponding server-level system variables, character_set_server and collation_server.

Table Character Set and Collation

Every table has a table character set and a table collation. The CREATE TABLE and ALTER TABLE statements have optional clauses for specifying the table character set and collation:

CREATE TABLE tbl_name (column_list)
    [[DEFAULT] CHARACTER SET charset_name]
    [COLLATE collation_name]]

ALTER TABLE tbl_name
    [[DEFAULT] CHARACTER SET charset_name]
    [COLLATE collation_name]
The table character set and collation are used as default values for column definitions if the column character set and collation are not specified in individual column definitions. The table character set and collation are MySQL extensions; there are no such things in standard SQL.

Column Character Set and Collation

Every “character” column (that is, a column of type CHAR, VARCHAR, or TEXT) has a column character set and a column collation. Column definition syntax for CREATE TABLE and ALTER TABLE has optional clauses for specifying the column character set and collation:

col_name {CHAR | VARCHAR | TEXT} (col_length)
    [CHARACTER SET charset_name]
    [COLLATE collation_name]

These clauses can also be used for ENUM and SET columns:

col_name {ENUM | SET} (val_list)
    [CHARACTER SET charset_name]
    [COLLATE collation_name]

Examples:

CREATE TABLE t1
(
    col1 VARCHAR(5)
      CHARACTER SET latin1
      COLLATE latin1_german1_ci
);

ALTER TABLE t1 MODIFY
    col1 VARCHAR(5)
      CHARACTER SET latin1
      COLLATE latin1_swedish_ci;

If you use ALTER TABLE to convert a column from one character set to another, MySQL attempts to map the data values, but if the character sets are incompatible, there may be data loss.

转换字符集注意事项：

ALTER [IGNORE] TABLE table

CONVERT TO CHARACTER SET charset [COLLATE collation] | [DEFAULT]CHARACTER SET charset [COLLATE collation]

CONVERT子句可能带来数据上的问题。因此，在使用该子句前，请确保做过备份并再完成前检查转换的数据。如果你有字符集列，在转换过程中数据有可能丢失，首先应该把该列转换为二进制大对象（BLOB）数据类型，接着转换成想要的数据类型和字符集。通常情况下，这种做法极好，因为BLOB数据不能转换字符集。

Character String Literal Character Set and Collation

Every character string literal has a character set and a collation.

A character string literal may have an optional character set introducer and COLLATE clause:

[_charset_name]'string' [COLLATE collation_name]

Examples:

SELECT 'string';
SELECT _latin1'string';
SELECT _latin1'string' COLLATE latin1_danish_ci;

For the simple statement SELECT 'string', the string has the character set and collation defined by thecharacter_set_connection and collation_connection system variables.

The _charset_name expression is formally called an introducer. It tells the parser, “the string that is about to follow uses character set X.” Because this has confused people in the past, we emphasize that an introducer does not change the string to the introducer character set like CONVERT() would do. It does not change the string's value, although padding may occur. The introducer is just a signal. An introducer is also legal before standard hex literal and numeric hex literal notation (x'literal' and 0xnnnn), or before bit-field literal notation (b'literal'and 0bnnnn).

National Character Set

标准的SQL中使用NCHAR，NVARCHAR等表示国际字符集。但是MySQL不是，它只有CHAR和VARCHAR。需要通过设置字符集来达到存储存储其他字符的目的。

For example, these data type declarations are equivalent:

CHAR(10) CHARACTER SET utf8
NATIONAL CHARACTER(10)
NCHAR(10)

As are these:

VARCHAR(10) CHARACTER SET utf8
NATIONAL VARCHAR(10)
NCHAR VARCHAR(10)
NATIONAL CHARACTER VARYING(10)
NATIONAL CHAR VARYING(10)

You can use N'literal' (or n'literal') to create a string in the national character set. These statements are equivalent:

SELECT N'some text';
SELECT n'some text';
SELECT _utf8'some text';

Connection Character Sets and Collations

Two statements affect the connection-related character set variables as a group:

SET NAMES 'charset_name' [COLLATE 'collation_name']

SET NAMES indicates what character set the client will use to send SQL statements to the server. Thus, SET NAMES 'cp1251' tells the server, “future incoming messages from this client are in character set cp1251.” It also specifies the character set that the server should use for sending results back to the client. (For example, it indicates what character set to use for column values if you use a SELECT statement.)

A SET NAMES 'x' statement is equivalent to these three statements:
```
SET character_set_client = x;
SET character_set_results = x;
SET character_set_connection = x;
```
Setting character_set_connection to x also implicitly sets collation_connection to the default collation forx. It is unnecessary to set that collation explicitly. To specify a particular collation, use the optional COLLATE clause:
```
SET NAMES 'charset_name' COLLATE 'collation_name'
```
SET CHARACTER SET charset_name

SET CHARACTER SET is similar to SET NAMES but sets character_set_connection andcollation_connection to character_set_database and collation_database. A SET CHARACTER SETx statement is equivalent to these three statements:
```
SET character_set_client = x;
SET character_set_results = x;
SET collation_connection = @@collation_database;
```
Setting collation_connection also implicitly sets character_set_connection to the character set associated with the collation (equivalent to executing SET character_set_connection = @@character_set_database). It is unnecessary to set character_set_connection explicitly.

Note

ucs2, utf16, and utf32 cannot be used as a client character set, which means that they do not work for SET NAMES or SET CHARACTER SET.

The MySQL client programs mysql, mysqladmin, mysqlcheck, mysqlimport, and mysqlshow determine the default character set to use as follows:

In the absence of other information, the programs use the compiled-in default character set, usually latin1.
The programs can autodetect which character set to use based on the operating system setting, such as the value of the LANG or LC_ALL locale environment variable on Unix systems or the code page setting on Windows systems. For systems on which the locale is available from the OS, the client uses it to set the default character set rather than using the compiled-in default. For example, setting LANG to ru_RU.KOI8-R causes the koi8r character set to be used. Thus, users can configure the locale in their environment for use by MySQL clients.

The OS character set is mapped to the closest MySQL character set if there is no exact match. If the client does not support the matching character set, it uses the compiled-in default. For example, ucs2 is not supported as a connection character set.

C applications that wish to use character set autodetection based on the OS setting can invoke the followingmysql_options() call before connecting to the server:
```
mysql_options(mysql,
              MYSQL_SET_CHARSET_NAME,
              MYSQL_AUTODETECT_CHARSET_NAME);
```
The programs support a --default-character-set option, which enables users to specify the character set explicitly to override whatever default the client otherwise determines.

Note

Before MySQL 5.5, in the absence of other information, the MySQL client programs used the compiled-in default character set, usually latin1. An implication of this difference is that if your environment is configured to use a non-latin1 locale, MySQL client programs will use a different connection character set than previously, as though you had issued an implicit SET NAMES statement. If the previous behavior is required, start the client with the --default-character-set=latin1 option.

When a client connects to the server, it sends the name of the character set that it wants to use. The server uses the name to set the character_set_client, character_set_results, and character_set_connectionsystem variables. In effect, the server performs a SET NAMES operation using the character set name.

With the mysql client, if you want to use a character set different from the default, you could explicitly execute SET NAMES every time you start up. However, to accomplish the same result more easily, you can add the --default-character-set option setting to your mysql command line or in your option file. For example, the following option file setting changes the three connection-related character set variables set to koi8r each time you invoke mysql:

[mysql]
default-character-set=koi8r

To see the values of the character set and collation system variables that apply to your connection, use these statements:

SHOW VARIABLES LIKE 'character_set%';
SHOW VARIABLES LIKE 'collation%';

If you change the default character set or collation for a database, stored routines that use the database defaults must be dropped and recreated so that they use the new defaults. (In a stored routine, variables with character data types use the database defaults if the character set or collation are not specified explicitly.

校对命名规则

Collation Names

MySQL collation names follow these rules:

A name ending in _ci indicates a case-insensitive collation.
A name ending in _cs indicates a case-sensitive collation.
A name ending in _bin indicates a binary collation. Character comparisons are based on character binary code values.

Nonbinary strings have PADSPACE behavior for all collations, including_bin collations. Trailing spaces are insignificant in comparisons:（也就是说，字符串中末尾的空格不起作用）

mysql> SET NAMES utf8 COLLATE utf8_bin;
Query OK, 0 rows affected (0.00 sec)

mysql> SELECT 'a ' = 'a';
+------------+
| 'a ' = 'a' |
+------------+
|          1 |
+------------+
1 row in set (0.00 sec)

For binary strings, all characters are significant in comparisons, including trailing spaces:

mysql> SET NAMES binary;
Query OK, 0 rows affected (0.00 sec)

mysql> SELECT 'a ' = 'a';
+------------+
| 'a ' = 'a' |
+------------+
|          0 |
+------------+
1 row in set (0.00 sec)

The `BINARY` Operator

The BINARY operator casts the string following it to a binary string. This is an easy way to force a comparison to be done byte by byte rather than character by character. BINARY also causes trailing spaces to be significant.

mysql> SELECT 'a' = 'A';
        -> 1
mysql> SELECT BINARY 'a' = 'A';
        -> 0
mysql> SELECT 'a' = 'a ';
        -> 1
mysql> SELECT BINARY 'a' = 'a ';
        -> 0

BINARY str is shorthand for CAST(str AS BINARY).

The BINARY attribute in character column definitions has a different effect. A character column defined with theBINARY attribute is assigned the binary collation of the column character set. Every character set has a binary collation. For example, the binary collation for the latin1 character set is latin1_bin, so if the table default character set is latin1, these two column definitions are equivalent:

CHAR(10) BINARY
CHAR(10) CHARACTER SET latin1 COLLATE latin1_bin

Collation and `INFORMATION_SCHEMA` Searches

String columns in INFORMATION_SCHEMA tables have a collation of utf8_general_ci, which is case insensitive. However, searches in INFORMATION_SCHEMA string columns are also affected by file system case sensitivity. For values that correspond to objects that are represented in the file system, such as names of databases and tables, searches may be case sensitive if the file system is case sensitive. This section describes how to work around this issue if necessary; see also Bug #34921.

Suppose that a query searches the SCHEMATA.SCHEMA_NAME column for the test database. On Linux, file systems are case sensitive, so comparisons of SCHEMATA.SCHEMA_NAME with 'test' match, but comparisons with 'TEST'do not:

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'test';
+-------------+
| SCHEMA_NAME |
+-------------+
| test        |
+-------------+
1 row in set (0.01 sec)

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'TEST';
Empty set (0.00 sec)

On Windows or Mac OS X where file systems are not case sensitive, comparisons match both 'test' and 'TEST':

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'test';
+-------------+
| SCHEMA_NAME |
+-------------+
| test        |
+-------------+
1 row in set (0.00 sec)

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'TEST';
+-------------+
| SCHEMA_NAME |
+-------------+
| TEST        |
+-------------+
1 row in set (0.00 sec)

The value of the lower_case_table_names system variable makes no difference in this context.

This behavior occurs because the utf8_general_ci collation is not used for INFORMATION_SCHEMA queries when searching the file system for database objects. It is a result of optimizations implemented for INFORMATION_SCHEMAsearches in MySQL. For information about these optimizations, see Section 7.2.4, “OptimizingINFORMATION_SCHEMA Queries”.

Searches in INFORMATION_SCHEMA string columns for values that refer to INFORMATION_SCHEMA itself do use theutf8_general_ci collation because INFORMATION_SCHEMA is a “virtual” database and is not represented in the file system. For example, comparisons with SCHEMATA.SCHEMA_NAME match 'information_schema' or'INFORMATION_SCHEMA' regardless of platform:

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'information_schema';
+--------------------+
| SCHEMA_NAME        |
+--------------------+
| information_schema |
+--------------------+
1 row in set (0.00 sec)

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME = 'INFORMATION_SCHEMA';
+--------------------+
| SCHEMA_NAME        |
+--------------------+
| information_schema |
+--------------------+
1 row in set (0.00 sec)

If the result of a string operation on an INFORMATION_SCHEMA column differs from expectations, a workaround is to use an explicit COLLATE clause to force a suitable collation (Section 9.1.7.2, “Using COLLATE in SQL Statements”). For example, to perform a case-insensitive search, use COLLATE with the INFORMATION_SCHEMA column name:

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME COLLATE utf8_general_ci = 'test';
+-------------+
| SCHEMA_NAME |
+-------------+
| test        |
+-------------+
1 row in set (0.00 sec)

mysql> SELECT SCHEMA_NAME FROM INFORMATION_SCHEMA.SCHEMATA
    -> WHERE SCHEMA_NAME COLLATE utf8_general_ci = 'TEST';
| SCHEMA_NAME |
+-------------+
| test        |
+-------------+
1 row in set (0.00 sec)

You can also use the UPPER() or LOWER() function:

WHERE UPPER(SCHEMA_NAME) = 'TEST'
WHERE LOWER(SCHEMA_NAME) = 'test'

详细MySQL字符集参考帮助手册：http://dev.mysql.com/doc/refman/5.5/en/globalization.html

转载于:https://www.cnblogs.com/freewater/archive/2011/12/17/2289431.html

你可能感兴趣的:(MySQL学习笔记——字符集)

Regular Expression 正则表达式 Aimyon_36 Data Development 正则表达式 redis 数据库
RegularExpression前言1.基本匹配2.元字符2.1点运算符.2.2字符集2.2.1否定字符集2.3重复次数2.3.1*号2.3.2+号2.3.3?号2.4{}号2.5(...)特征标群2.6|或运算符2.7转码特殊字符2.8锚点2.8.1^号2.8.2$号3.简写字符集4.零宽度断言（前后预查）4.1?=...正先行断言4.2?!...负先行断言4.3?Thefatcatsaton
String方法(JDK9) 凯哥学堂
声明：本栏目所使用的素材都是凯哥学堂VIP学员所写，学员有权匿名，对文章有最终解释权；凯哥学堂旨在促进VIP学员互相学习的基础上公开笔记。String方法(JDK9)构造器：String#String()无参数构造器，默认给的是一个””空字符串String#String(java.lang.String)给你一个char数组，它就帮你进行ABCD输出GBK中文简体+繁体字符集GB2312中文简体字
【JAVA入门】Day42 - 转换流 Clown Piece JAVA入门 java python 开发语言
【JAVA入门】Day42-转换流文章目录【JAVA入门】Day42-转换流转换流是字符流和字节流之间的桥梁。转换流中的输入流叫做InputStreamReader，它可以把字节流转换为字符流。转换流的输出流叫做OutputStreamWriter，它可以把字符流转换成字节流。【使用例1】把一个GBK的文件中的中文读取到内存中，不能出现乱码。（作用1：按照指定的字符集读取数据）packageCon
Python——破解rar压缩包密码星和月 python 算法
破解RAR压缩包密码一般是通过穷举法来实现的，即尝试所有可能的密码组合，直到找到正确的密码为止。以下是使用Python编写的一个简单的RAR密码破解程序：importitertoolsimportrarfiledefcrack_rar_password(rar_file,password_length):#创建RAR文件对象rf=rarfile.RarFile(rar_file)#定义密码字符集合
【C语言】词法陷阱与缺陷之二：字符和字符串表示详解 byte轻骑兵编程语言精要 #C语言深度解析坊 c语言开发语言
在C语言中，字符和字符串的表示是编程基础中的关键部分，但同时也是容易引发词法陷阱和缺陷的地方。以下是对字符和字符串表示的详细解析。一、字符的表示1.1.基本概念在C语言中，字符被视为整数，其值对应于字符集中的位置。对于采用ASCII字符集的编译器而言，字符'a'的整数值为97（十进制）或0141（八进制）。字符用单引号'括起来，如'a'、'1'、'\n'等。1.2.多字符常量某些C编译器允许在一个
MySQL 数据库：原理、应用与发展专家大圣数据库数据库 mysql
摘要：本文深入探讨了MySQL数据库相关内容。首先介绍了MySQL作为开源关系型数据库管理系统的显著特点，包括易用性、跨平台性、高性能、可扩展性、开源免费以及数据安全性等方面。接着详细阐述了其安装与配置过程，涵盖在不同操作系统上的安装方式、配置文件参数的含义与设置，以及字符集和校对规则的设定。文中进一步讲解了MySQL的基本概念，如数据库与表的构成、多种数据类型、不同索引类型的特点与应用场景。并对
SQL server 日常运维命令一心只为学 sqlserver 数据库运维
一、基础命令查看当前数据库的版本SELECT@@VERSION;查看服务器部分特殊信息selectSERVERPROPERTY(N'edition')asEdition--数据版本，如企业版、开发版等,SERVERPROPERTY(N'collation')asCollation--数据库字符集,SERVERPROPERTY(N'servername')asServerName--服务名,@@VE
修改Mysql默认字符集 LeslieLiang
使用SHOWVARIABLESLIKE'character%'查看当前字符集Snipaste_2018-10-09_14-21-34.jpg1.进入Mysql的目录下，将my-default.txt复制为my.ini(影响不大)2.修改my.ini，在对应字段下添加以下内容[mysqld]character-set-server=utf8[client]default-character-set=
MySQL学习笔记2—基础+条件+排序+分组查询 Jake_SunJG MySQL学习 mysql
DQL语言学习—数据查询语言仅作为学习笔记，学习资源来源于B站视频：BV1xW411u7ax1.基础查询语法：select查询列表from表名特点：查询列表可以是：表中的字段、常量值、表达式、函数查询的结果是一个虚拟的表格USEmyemployees;#1.查询表中的单个字段SELECTlast_nameFROMemployees;#2.查询表中的多个字段，逗号分隔SELECTlast_name,
javase笔记3----正则表达式芝奥小婷笔记
正则表达式简介正则表达式（RegularExpressions），是一个特殊的字符串，可以对普通的字符串进行校验检测等工作，校验一个字符串是否满足预设的规则。基本语法字符集合[]:表示匹配括号里的任意一个字符。[abc]:匹配a或者b或者c[^abc]:匹配任意一个字符，只要不是a,或b,或c就表示匹配成功[a-z]:表示匹配所有的小写字母的任意一个。[A-Za-z]:表示匹配所有的小写字母和大写
MySQL 大小写问题天珩今日所得
场景在做mysql查询的时候，注意到一个问题，mysql默认是不区分大小写的通过简单的查询，发现通过关键字binary可以强制区分大小写参考每日所得--分页查询优化和mysql区分大小写问题那为什么MySQL不区分大小写呢参考文档mysql不区分大小写技术原理文章总结1、是否区分是取决于字符集和校对(Collation)部分所做的工作2、取决于字符集中是否声明了大小写敏感声明之后，开销增加参考ht
浅谈gbase与oracle 字符集差异 gbase_lmax java 前端开发语言
字符集字符集（CharacterSet）：按照一定的字符编码方案，将特定的符号集编码为计算机能够处理的数值的集合。常见字符集名称：ASCII字符集、Unicode字符集、GB2312字符集、BIG5字符集、GB18030字符集等。字符编码字符编码（CharacterEncoding）：是一套规则，对字符集进行编码的方案。如，Unicode是字符集，UTF-8、UTF-16、UTF-32是三种字符编
mysql字符集utf8 unicode_MySQL 编码utf8 与 utf8mb4 utf8mb4_unicode_ci 与 utf8mb4_general_ci weixin_39830175 mysql字符集utf8 unicode
参考：mysql字符集小结utf8mb4已成为MySQL8.0的默认字符集，在MySQL8.0.1及更高版本中将utf8mb4_0900_ai_ci作为默认排序规则。新项目只考虑utf8mb4UTF-8编码是一种变长的编码机制，可以用1~4个字节存储字符。因为历史遗留问题，MySQL中的utf8编码并不是真正的UTF-8，而是阉割版的，最长只有3个字节。当遇到占4个字节的UTF-8编码，例如emo
mysql指定字符集utf8mb4_MySQL字符集utf8修改为utf8mb4的方法步骤 weixin_39774219
对于mysql5.5而言，如果不设定字符集，mysql默认的字符集是latin1拉丁文字符集；但随着各种业务的进一步发展，除了各个国家的本身语言字符，经常也会有一些表情符号出现在应用程序中，而在mysql5.5之前，UTF-8编码只支持1-3个字节，支持BMP这部分的Unicode编码区；从MySQL5.5开始，可以支持4个字节UTF编码utf8mb4，一个字符能够支持更多的字符集，也能够支持更多
mysql怎么把utf8mb4_unicode_ci转为utf8mb4_general_ci 我是杨天 mysql ci/cd oracle 数据库
数据库相关学习资料：https://edu.51cto.com/video/655.htmlMySQL字符集转换方案：从utf8mb4_unicode_ci到utf8mb4_general_ci在MySQL数据库中，字符集和排序规则对于数据的存储和检索具有重要影响。utf8mb4_unicode_ci和utf8mb4_general_ci是两种常见的utf8mb4字符集的排序规则。其中，utf8m
mysql utf8mb4_general_ci_MySQL编码utf8与utf8mb4 utf8mb4_unicode_ci与utf8mb4_general_ci字符集小结... 程涛-supertim mysql
本篇文章小编给大家分享一下MySQL编码utf8与utf8mb4utf8mb4_unicode_ci与utf8mb4_general_ci字符集小结，小编觉得挺不错的，现在分享给大家供大家参考，有需要的小伙伴们可以来看看。utf8mb4已成为MySQL8.0的默认字符集，在MySQL8.0.1及更高版本中将utf8mb4_0900_ai_ci作为默认排序规则。新项目只考虑utf8mb4UTF-8编
PHP批量修改MySQL数据表字符集为utf8mb4/utf8mb4_unicode_ci 小松聊PHP进阶 MySQL PHP php mysql 数据库后端服务器 sql
编码大全可参考我之前的文章：快速理解ASCII、GBK、Unicode、UTF-8、ANSI批量修改注意这是DDL操作，操作过程会锁表（元数据锁），平均1秒能够转码3张表（数据量不大）。亲测操作过后没有数据异常，推荐执行前备份。//接手一些老项目，需要修改编码。$host='';$db='';$user='';$pass='';$charset='utf8mb4';$collate='utf8mb
python 实现第k个字典排列算法 luthane 算法 python 数据结构
第k个字典排列算法介绍"第k个字典排列"算法通常指的是在给定的字符集合（例如，字符串中的字符）中，找到所有可能排列的第k个排列。这个问题可以通过多种方法解决，但一个常见且高效的方法是使用“下一个排列”算法的变种，或称为“第k个排列”的直接算法。方法一：使用“下一个排列”的变种生成所有排列：首先生成所有排列，但显然这种方法对于较大的输入集合是不切实际的，因为它涉及到大量的计算和存储。排序并使用“下一
Mysql学习笔记凉风有信2020
第一次亲密接触一、数据库相关概念：①、数据库的好处：1、持久化数据到本地2、使用数据库管理软件进行结构化查询②、数据库常见概念1、DB：数据库，存储数据的容器2、DBMS：数据库管理系统（数据库软件、数据库产品）3、SQL：结构化数据查询语言，不是某个数据库特有的查询语言，而是几乎所有的结构化数据库通用的语言。③、数据库存储数据的特点1、数据存放到表中，表再放到库中2、一个库可以有多张表。每个表拥
LeetCode学习之路（C++）——字符串（3） Alex_SCY Leetcode leetcode
Leetcode题解-字符串目录Leetcode题解-字符串242.两个字符串包含的字符是否完全相同409.计算一组字符集合可以组成的回文字符串的最大长度205.字符串同构647.回文子字符串个数9.判断一个整数是否是回文数696.统计二进制字符串中连续1和连续0数量相同的子字符串个数242.两个字符串包含的字符是否完全相同242.ValidAnagram(Easy)Leetcode/力扣思路：可
MySQL库表设计规范 zhangkaixuan456 mysql 设计规范数据库
MySQL库表设计规范本文仅针对MySQL、Oracle表设计1)表必须定义主键，默认为ID，整型自增，如果不采用默认设计必须咨询DBA进行设计评估2)ID字段作为自增主键，禁止在非事务内作为上下文作为条件进行数据传递，禁止非自增非数字类型主键设计出现3)禁止使用外键,触发器,存储过程4)多表中的相同列，必须保证列定义一致5)表默认使用InnoDB，国内表字符集默认使用utf8mb4，国际默认使用
开发新系统时,数据库字符集怎么选择对中文的支持最好? New小青龙数据库 mysql 字符集
在新开发的系统时，如果你希望确保中文按拼音顺序正确排序，同时支持更多的特殊字符与符号，下面是对utf8mb4_zh_cn_ci、utf8mb4_unicode_ci和utf8mb4_unicode_520_ci这几种字符集和校对规则的分析以及推荐方案：校对规则分析utf8mb4_zh_cn_ci：特点：这是专为简体中文设计的校对规则，主要考虑了中文拼音的排序需求。它可以在一定程度上支持中文拼音排序
C语言从头学53——字符集 LaoWaiHang C语言从头学 c语言
在使用VS编程时，在项目设置中有一个关于字符集的选项。一是Unicode字符集（VS默认的字符集），二是多字节字符集。本文围绕这两个字符集做一简单介绍。一、先说一下多字节字符集最早的字符集是ANSI的ASCII字符集，它开始使用7位后来使用8位表示包括英文字母、数字、标点符号、制表符、控制符等共计256个字符。后来，随着各国在ASCII的基础上制定本国的字符集，这些从ANSI标准派生的字符集被习惯
正则表达式详解朱什么凡正则表达式 mysql 数据库
正则表达式（RegularExpression）1.定义与用途正则表达式是一种描述字符串匹配模式的工具，它可以用来检查一个字符串是否含有某种子串、将匹配的子串做替换或者从某个字符串中取出符合某个条件的子串等。正则表达式由普通字符（如a到z）和特殊字符（称为“元字符”）组成，用于定义搜索文本时要匹配的一个或多个字符串的模式。2.基本语法与规则2.1字符类备选字符集：用[]表示，匹配方括号中的任意字符
Python爬虫01 阿汤哥的程序之路 python python 爬虫 javascript
requests模块文档安装pip/pip3installrequestsresponse.text和response.content的区别1.response.text等价于response.content.decode("推测出的编码字符集")response.text类型：str编码类型：requests模块自动根据Http头部对响应的编码（response.encoding）作出有根据的推
BaseCTF 高校联合新生赛Week1(web) pink鱼 web安全安全 php
目录HTTP是什么呀喵喵喵´•ﻌ•`编辑md5绕过欸ADarkRoomuploadAura酱的礼物HTTP是什么呀url转义：是将URL中的特殊字符转换为有效的ASCII字符格式的过程，以确保URL的正确解析和传输。这个过程涉及到将非ASCII字符替换为“%hh”格式，其中hh为两位十六进制数，对应于该字符在‌ISO-8859-1字符集里的编码值。URL转义的主要目的是为了确保URL中的特殊字符不
Hive3：列注释、表注释等乱码解决方案生产队队长 HIVE hive
--在Hive的MySQL元数据库中执行usehive;1).修改字段注释字符集altertableCOLUMNS_V2modifycolumnCOMMENTvarchar(256)charactersetutf8;2).修改表注释字符集altertableTABLE_PARAMSmodifycolumnPARAM_VALUEvarchar(4000)charactersetutf8;3).修改分
php连接mysql数据库 Daly罗笔记心得 mysql php 数据库 query sql border
php和mysql，比较容易出现的中文乱码，没有办法详说各种编码的异同，简单而实用的处理办法是：在查询之前插入：mysql_query("setnamesgbk");其中gbk也可以改成其他中文字符集。（似乎比较难以在同一的类中调用，大概是和具体的查询前有时候涉及数据库的选择有关？）而且，在数据库导出，导入之前也最好插入这条语句，保持字符的一致性（否则，在数据库中也可能出现乱码）。04级新生名单I
对于IDEA中default encoding for properties file和transparent native-to-ascii conversion的理解不想做实验了 intellij-idea java ide
关于defaultencodingforpropertiesfile对于properties文件有两个设置，一个是左边下拉框选encoding字符集，默认的是iso8859-1编码和解码，先不勾选右边的transparentnative-to-ascii-conversion。如果这时选了别的encoding编码集，那么读取的时候解码仍然是按照iso8859-1（只变编码不管解码，很坑！），就肯定
protobuf cmakelist，msvc utf-8设置 yayapoi~ KBEngine 服务器
源字符集和执行字符集源字符集指的是cpp文件中字符串的编码方式执行字符集指的是exe文件中字符串的编码方式msvc编译器设置的命令行参数/source-charset:utf-8/execution-charset:utf-8cmake中设置add_compile_options(“:/source-charset:utf-8>”)add_compile_options(“:/execution-
JAVA基础灵静志远位运算加载 Date 字符串池覆盖
一、类的初始化顺序 1 （静态变量，静态代码块）-->（变量，初始化块）--> 构造器同一括号里的，根据它们在程序中的顺序来决定。上面所述是同一类中。如果是继承的情况，那就在父类到子类交替初始化。二、String 1 String a = "abc"; JAVA虚拟机首先在字符串池中查找是否已经存在了值为"abc"的对象，根
keepalived实现redis主从高可用 bylijinnan redis
方案说明两台机器（称为A和B），以统一的VIP对外提供服务 1.正常情况下，A和B都启动，B会把A的数据同步过来（B is slave of A） 2.当A挂了后，VIP漂移到B；B的keepalived 通知redis 执行：slaveof no one，由B提供服务 3.当A起来后，VIP不切换，仍在B上面；而A的keepalived 通知redis 执行slaveof B，开始
java文件操作大全 0624chenhong java
最近在博客园看到一篇比较全面的文件操作文章，转过来留着。 http://www.cnblogs.com/zhuocheng/archive/2011/12/12/2285290.html 转自http://blog.sina.com.cn/s/blog_4a9f789a0100ik3p.html 一.获得控制台用户输入的信息 &nbs
android学习任务不懂事的小屁孩工作
任务完成情况搞清楚带箭头的pupupwindows和不带的使用已完成熟练使用pupupwindows和alertdialog，并搞清楚两者的区别已完成熟练使用android的线程handler,并敲示例代码进行中了解游戏2048的流程，并完成其代码工作进行中-差几个actionbar 研究一下android的动画效果，写一个实例已完成复习fragem
zoom.js 换个号韩国红果果 oom
它的基于bootstrap 的 https://raw.github.com/twbs/bootstrap/master/js/transition.js transition.js模块引用顺序 <link rel="stylesheet" href="style/zoom.css"> <script src=&q
详解Oracle云操作系统Solaris 11.2 蓝儿唯美 Solaris
当Oracle发布Solaris 11时，它将自己的操作系统称为第一个面向云的操作系统。Oracle在发布Solaris 11.2时继续它以云为中心的基调。但是，这些说法没有告诉我们为什么Solaris是配得上云的。幸好，我们不需要等太久。Solaris11.2有4个重要的技术可以在一个有效的云实现中发挥重要作用：OpenStack、内核域、统一存档（UA）和弹性虚拟交换（EVS）。
spring学习——springmvc（一） a-john springMVC
Spring MVC基于模型-视图-控制器（Model-View-Controller，MVC）实现，能够帮助我们构建像Spring框架那样灵活和松耦合的Web应用程序。 1，跟踪Spring MVC的请求请求的第一站是Spring的DispatcherServlet。与大多数基于Java的Web框架一样，Spring MVC所有的请求都会通过一个前端控制器Servlet。前
hdu4342 History repeat itself-------多校联合五 aijuans 数论
水题就不多说什么了。 #include<iostream>#include<cstdlib>#include<stdio.h>#define ll __int64using namespace std;int main(){ int t; ll n; scanf("%d",&t); while(t--)
EJB和javabean的区别 asia007 bean ejb
EJB不是一般的JavaBean,EJB是企业级JavaBean,EJB一共分为3种,实体Bean,消息Bean,会话Bean,书写EJB是需要遵循一定的规范的,具体规范你可以参考相关的资料.另外,要运行EJB,你需要相应的EJB容器,比如Weblogic,Jboss等,而JavaBean不需要,只需要安装Tomcat就可以了 1.EJB用于服务端应用开发, 而JavaBeans
Struts的action和Result总结百合不是茶 struts Action配置 Result配置
一:Action的配置详解: 下面是一个Struts中一个空的Struts.xml的配置文件 <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE struts PUBLIC &quo
如何带好自已的团队 bijian1013 项目管理团队管理团队
在网上看到博客" 怎么才能让团队成员好好干活"的评论，觉得写的比较好。原文如下：我做团队管理有几年了吧，我和你分享一下我认为带好团队的几点： 1.诚信对团队内成员，无论是技术研究、交流、问题探讨，要尽可能的保持一种诚信的态度，用心去做好，你的团队会感觉得到。 2.努力提
Java代码混淆工具 sunjing ProGuard
Open Source Obfuscators ProGuard http://java-source.net/open-source/obfuscators/proguardProGuard is a free Java class file shrinker and obfuscator. It can detect and remove unused classes, fields, m
【Redis三】基于Redis sentinel的自动failover主从复制 bit1129 redis
在第二篇中使用2.8.17搭建了主从复制，但是它存在Master单点问题，为了解决这个问题，Redis从2.6开始引入sentinel，用于监控和管理Redis的主从复制环境，进行自动failover，即Master挂了后，sentinel自动从从服务器选出一个Master使主从复制集群仍然可以工作，如果Master醒来再次加入集群，只能以从服务器的形式工作。什么是Sentine
使用代理实现Hibernate Dao层自动事务白糖_ DAO spring AOP 框架 Hibernate
都说spring利用AOP实现自动事务处理机制非常好，但在只有hibernate这个框架情况下，我们开启session、管理事务就往往很麻烦。 public void save(Object obj){ Session session = this.getSession(); Transaction tran = session.beginTransaction(); try
maven3实战读书笔记 braveCS maven3
Maven简介是什么？ Is a software project management and comprehension tool.项目管理工具是基于POM概念(工程对象模型) [设计重复、编码重复、文档重复、构建重复，maven最大化消除了构建的重复] [与XP：简单、交流与反馈；测试驱动开发、十分钟构建、持续集成、富有信息的工作区] 功能：
编程之美-子数组的最大乘积 bylijinnan 编程之美
public class MaxProduct { /** * 编程之美子数组的最大乘积 * 题目: 给定一个长度为N的整数数组，只允许使用乘法，不能用除法，计算任意N-1个数的组合中乘积中最大的一组，并写出算法的时间复杂度。 * 以下程序对应书上两种方法，求得“乘积中最大的一组”的乘积——都是有溢出的可能的。 * 但按题目的意思，是要求得这个子数组，而不
读书笔记-2 chengxuyuancsdn 读书笔记
1、反射 2、oracle年-月-日时-分-秒 3、oracle创建有参、无参函数 4、oracle行转列 5、Struts2拦截器 6、Filter过滤器(web.xml) 1、反射 (1)检查类的结构在java.lang.reflect包里有3个类Field,Method,Constructor分别用于描述类的域、方法和构造器。 2、oracle年月日时分秒 s
[求学与房地产]慎重选择IT培训学校 comsci it
关于培训学校的教学和教师的问题,我们就不讨论了,我主要关心的是这个问题培训学校的教学楼和宿舍的环境和稳定性问题我们大家都知道，房子是一个比较昂贵的东西，特别是那种能够当教室的房子... &nb
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系 daizj oracle rman filesperset PARALLELISM
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系转 PARALLELISM --- 我们还可以通过parallelism参数来指定同时"自动"创建多少个通道： RMAN > configure device type disk parallelism 3 ; 表示启动三个通道，可以加快备份恢复的速度。
简单排序:冒泡排序 dieslrae 冒泡排序
public void bubbleSort(int[] array){ for(int i=1;i<array.length;i++){ for(int k=0;k<array.length-i;k++){ if(array[k] > array[k+1]){
初二上学期难记单词三 dcj3sjt126com sciet
concert 音乐会 tonight 今晚 famous 有名的；著名的 song 歌曲 thousand 千 accident 事故；灾难 careless 粗心的，大意的 break 折断；断裂；破碎 heart 心（脏） happen 偶尔发生，碰巧 tourist 旅游者；观光者 science （自然）科学 marry 结婚 subject 题目；
I.安装Memcahce 1. 安装依赖包libevent Memcache需要安装libevent,所以安装前可能需要执行 Shell代码收藏代码 dcj3sjt126com redis
wget http://download.redis.io/redis-stable.tar.gz tar xvzf redis-stable.tar.gz cd redis-stable make 前面3步应该没有问题，主要的问题是执行make的时候，出现了异常。异常一： make[2]: cc: Command not found 异常原因：没有安装g
并发容器 shuizhaosi888 并发容器
通过并发容器来改善同步容器的性能，同步容器将所有对容器状态的访问都串行化，来实现线程安全，这种方式严重降低并发性，当多个线程访问时，吞吐量严重降低。并发容器ConcurrentHashMap 替代同步基于散列的Map，通过Lock控制。 &nb
Spring Security（12）——Remember-Me功能 234390216 Spring Security Remember Me 记住我
Remember-Me功能目录 1.1 概述 1.2 基于简单加密token的方法 1.3 基于持久化token的方法 1.4 Remember-Me相关接口和实现
位运算焦志广位运算
一、位运算符Ｃ语言提供了六种位运算符： & 按位与 | 按位或 ^ 按位异或 ~ 取反 << 左移 >> 右移 1. 按位与运算按位与运算符"&"是双目运算符。其功能是参与运算的两数各对应的二进位相与。只有对应的两个二进位均为1时，结果位才为1 ，否则为0。参与运算的数以补码方式出现。例如：9&am
nodejs 数据库连接 mongodb mysql liguangsong mongodb mysql node 数据库连接
1.mysql 连接 package.json中dependencies加入 "mysql":"~2.7.0" 执行 npm install 在config 下创建文件 database.js
java动态编译 olive6615 java HotSpot jvm 动态编译
在HotSpot虚拟机中，有两个技术是至关重要的，即动态编译(Dynamic compilation)和Profiling。 HotSpot是如何动态编译Javad的bytecode呢？Java bytecode是以解释方式被load到虚拟机的。HotSpot里有一个运行监视器，即Profile Monitor,专门监视
Storm0.9.5的集群部署配置优化 roadrunners 优化 storm.yaml
nimbus结点配置（storm.yaml）信息： # Licensed to the Apache Software Foundation (ASF) under one # or more contributor license agreements. See the NOTICE file # distributed with this work for additional inf
101个MySQL 的调节和优化的提示 tomcat_oracle mysql
　1. 拥有足够的物理内存来把整个InnoDB文件加载到内存中——在内存中访问文件时的速度要比在硬盘中访问时快的多。　　2. 不惜一切代价避免使用Swap交换分区 – 交换时是从硬盘读取的，它的速度很慢。　　3. 使用电池供电的RAM（注：RAM即随机存储器）。　　4. 使用高级的RAID（注：Redundant Arrays of Inexpensive Disks，即磁盘阵列
zoj 3829 Known Notation(贪心) 阿尔萨斯 ZOJ
题目链接：zoj 3829 Known Notation 题目大意：给定一个不完整的后缀表达式，要求有2种不同操作，用尽量少的操作使得表达式完整。解题思路：贪心，数字的个数要要保证比∗的个数多1，不够的话优先补在开头是最优的。然后遍历一遍字符串，碰到数字+1，碰到∗-1,保证数字的个数大于等1，如果不够减的话，可以和最后面的一个数字交换位置（用栈维护十分方便），因为添加和交换代价都是1