最近在做TDS协议解析,但国内很少有TDS的资料,特此转载从国外一个网站弄来的TDS资料,不是特别全,可能也有些乱(比如今天做的RPC包的解析,看了好久才看明白,有机会的话我把RPC解析贴出来,RPC还是很重要的,参数替换的语句都是在RPC包发送的。)
文章来自:http://freetds.cvs.sourceforge.net/*checkout*/freetds/freetds/doc/tds.html
该网站是免费的专门介绍TDS协议的,网址是:http://www.freetds.org/
页面是我贴过来的,部分跳转有些问题,有兴趣的可以直接浏览原页面。
This document attempts to cover the TDS protocol for:
TDS Version | Supported Products |
4.2 | Sybase SQL Server < 10 and Microsoft SQL Server 6.5 |
5.0 | Sybase SQL Server >= 10 |
7.0 | Microsoft SQL Server 7.0 |
7.1 | Microsoft SQL Server 2000 |
7.2 | Microsoft SQL Server 2005 |
TDS protocol versions TDS 5.0 tds version 5.0 TDS 7.0 tds version 7.0 TDS 7.0+ tds version 7.0, 7.1 and 7.2 TDS 5.0- tds version 5.0 and previous Variable types used in this document: CHAR 8-bit char CHAR[6] string of 6 chars CHAR[n] variable length string XCHAR single byte (TDS 5.0-) or ucs2le (TDS 7.0+) characters INT8 8-bit int INT16 16-bit int INT32 32-bit int UCS2LE Unicode in UCS2LE format
Note: FreeTDS uses TDS_TINYINT for INT8 and TDS_SMALLINT for INT16.
These are TDS 4.2 and not meant to be 100% correct, but I thought they might be helpful to get an overall view of what goes on.
--> Login <-- Login acknowledgement --> INSERT SQL statement <-- Result Set Done --> SELECT SQL statement <-- Column Names <-- Column Info <-- Row Result <-- Row Result <-- Result Set Done --> call stored procedure <-- Column Names <-- Column Info <-- Row Result <-- Row Result <-- Done Inside Process <-- Column Names <-- Column Info <-- Row Result <-- Row Result <-- Done Inside Process <-- Return Status <-- Process Done
Every informations in TDS protocol (query, RPCs, responses and so on) is splitted in packets.
All packets start with the following 8 byte header.
INT8 INT8 INT16 4 bytes +----------+-------------+----------+--------------------+ | packet | last packet | packet | unknown | | type | indicator | size | | +----------+-------------+----------+--------------------+ Fields: packet type 0x01 TDS 4.2 or 7.0 query 0x02 TDS 4.2 or 5.0 login packet 0x03 RPC 0x04 responses from server 0x06 cancels 0x07 Used in Bulk Copy 0x0F TDS 5.0 query 0x10 TDS 7.0 login packet 0x11 TDS 7.0 authentication packet 0x12 TDS 8 prelogin packet last packet indicator 0x00 if more packets 0x01 if last packet packet size (in network byte order) unknown? always 0x00 this has something to do with server to server communication/rpc stuff
The remainder of the packet depends on the type of information it is providing. As noted above, packets break down into the types query, login, response, and cancels. Response packets are further split into multiple sub-types denoted by the first byte (a.k.a. the token) following the above header.
Note: A TDS packet that is longer than 512 bytes is split on the 512 byte boundary and the "more packets" bit is set. The full TDS packet is reassembled from its component 512 byte packets with the 8-byte headers stripped out. 512 is the block_size in the login packet, so it could be set to a different values. In Sybase you can configure a range of valid block sizes. TDS 7.0+ use a default of 4096 as block size.
Packet type (first byte) is 2. The numbers on the left are decimal offsets including the 8 byte packet header.
byte var type description ------------------------------ 8 CHAR[30] host_name 38 INT8 host_name_length 39 CHAR[30] user_name 69 INT8 user_name_length 70 CHAR[30] password 100 INT8 password_length 101 CHAR[30] host_process 131 INT8 host_process_length 132 ? magic1[6] /* mystery stuff */ 138 INT8 bulk_copy 139 ? magic2[9] /* mystery stuff */ 148 CHAR[30] app_name 178 INT8 app_name_length 179 CHAR[30] server_name 209 INT8 server_name_length 210 ? magic3[1] /* 0, don't know this one either */ 211 INT8 password2_length 212 CHAR[30] password2 242 CHAR[223] magic4 465 INT8 password2_length_plus2 466 INT16 major_version /* TDS version */ 468 INT16 minor_version /* TDS version */ 470 CHAR library_name[10] /* "Ct-Library" or "DB-Library" */ 480 INT8 library_length 481 INT16 major_version2 /* program version */ 483 INT16 minor_version2 /* program version */ 485 ? magic6[3] /* ? last two octets are 13 and 17 */ /* bdw reports last two as 12 and 16 here */ /* possibly a bitset flag */ 488 CHAR[30] language /* e.g. "us-english" */ 518 INT8 language_length 519 ? magic7[1] /* mystery stuff */ 520 INT16 old_secure /* explanation? */ 522 INT8 encrypted /* 1 means encrypted all password fields blank */ 523 ? magic8[1] /* no clue... zeros */ 524 CHAR sec_spare[9] /* explanation? */ 533 CHAR[30] char_set /* e.g. "iso_1" */ 563 INT8 char_set_length 564 INT8 magic9[1] /* 1 */ 565 CHAR[6] block_size /* in text */ 571 INT8 block_size_length 572 ? magic10[25] /* lots of stuff here...no clue */
Any help with the magic numbers would be most appreciated.
byte var type description --------------------------- 0 INT32 total packet size 4 INT8[4] TDS Version 0x00000070 7.0 0x01000071 7.1 0x02000972 7.2 (7.2.9?) 8 INT32 packet size (default 4096) 12 INT8[4] client program version 16 INT32 PID of client 20 INT32 connection id (usually 0) 24 INT8 option flags 1 0x80 enable warning messages if SET LANGUAGE issued 0x40 change to initial database must succeed 0x20 enable warning messages if USE <database> issued 0x10 enable BCP 0x08 use ND5000 floating point format (untested) 0x04 use VAX floating point format (untested) 0x02 use EBCDIC encoding (untested) 0x01 use big-endian byte order (untested) 25 INT8 option flags 2 0x80 enable domain login security 0x40 "USER_SERVER - reserved" 0x20 user type is "DQ login" 0x10 user type is "replication login" 0x08 "fCacheConnect" 0x04 "fTranBoundary" 0x02 client is an ODBC driver 0x01 change to initial language must succeed 26 INT8 0x04 spawn user instance (TDS 7.2) 0x02 XML data type instances are returned as binary XML (TDS 7.2) 0x01 password change requested (TDS 7.2) 27 INT8 0x01 SQL Type: 0 = use default, 1 = use T-SQL (TDS 7.2) 28 INT8[4] time zone (0x88ffffff ???) 32 INT8[4] collation information 36 INT16 position of client hostname (86) 38 INT16 hostname length 40 INT16 position of username 42 INT16 username length 44 INT16 position of password 46 INT16 password length 48 INT16 position of app name 50 INT16 app name length 52 INT16 position of server name 54 INT16 server name length 56 INT16 position of remote server/password pairs 58 INT16 remote server/password pairs length 60 INT16 position of library name 62 INT16 library name length 64 INT16 position of language 66 INT16 language name (for italian "Italiano", coded UCS2) 68 INT16 position of database name 70 INT16 database name length 72 INT8[6] MAC address of client 78 INT16 position of auth portion 80 INT16 NT authentication length 82 INT16 next position (same as total packet size) 84 INT16 0 86 UCS2LE[n] hostname UCS2LE[n] username UCS2LE[n] encrypted password UCS2LE[n] app name UCS2LE[n] server name UCS2LE[n] library name UCS2LE[n] language name UCS2LE[n] database name NT Authentication packet NT Authentication packet 0 CHAR[8] authentication id "NTLMSSP\0" 8 INT32 1 message type 12 INT32 0xb201 flags 16 INT16 domain length 18 INT16 domain length 20 INT32 domain offset 24 INT16 hostname length 26 INT16 hostname length 28 INT32 hostname offset 32 CHAR[n] hostname CHAR[n] domain See documentation on Samba for detail (or search ntlm authentication for IIS) For mssql 2005 before hostname (byte 86) you have 86 INT16 next position, or position of file name for a database to be attached during the connection process 88 INT16 database filename length 90 INT16 new password position 92 INT16 new password length 94 UCS2LE[n] hostname ... (as above)
"current pos" is the starting byte address for a Unicode string within the packet. The length of that Unicode string immediately follows. That implies there are at least 2 more strings that could be defined. (character set??)
Username and password are empty if domain authentication is used.
If the client uses an authentication packet, the server replies with an Authentication token followed by an Authentication packet.
varies +------+ | auth | +------+ auth authentication data for NTLM this message 3
This packet usually follows Authentication token.
HEX | DEC | type | protocol | nullable | size | collate |
---|---|---|---|---|---|---|
0x1F | 31 | SYBVOID | 7+ | no | 0 | |
0x22 | 34 | SYBIMAGE | yes | 4 | ||
0x23 | 35 | SYBTEXT | yes | 4 | yes | |
0x24 | 36 | SYBUNIQUE | 7+ | yes | 1 | |
0x25 | 37 | SYBVARBINARY | yes | 1 | ||
0x26 | 38 | SYBINTN | yes | 1 | ||
0x27 | 39 | SYBVARCHAR | yes | 1 | ||
0x2D | 45 | SYBBINARY | yes | 1 | ||
0x2F | 47 | SYBCHAR | yes | 1 | ||
0x30 | 48 | SYBINT1 | no | 0 | ||
0x32 | 50 | SYBBIT | no | 0 | ||
0x34 | 52 | SYBINT2 | no | 0 | ||
0x38 | 56 | SYBINT4 | no | 0 | ||
0x3A | 58 | SYBDATETIME4 | no | 0 | ||
0x3B | 59 | SYBREAL | no | 0 | ||
0x3C | 60 | SYBMONEY | no | 0 | ||
0x3D | 61 | SYBDATETIME | no | 0 | ||
0x3E | 62 | SYBFLT8 | no | 0 | ||
0x40 | 64 | SYBSINT1 | 5 | no | 0 | |
0x41 | 65 | SYBUINT2 | 5 | no | 0 | |
0x42 | 66 | SYBUINT4 | 5 | no | 0 | |
0x43 | 67 | SYBUINT8 | 5 | no | 0 | |
0x62 | 98 | SYBVARIANT | 7+ | yes | 4 | |
0x63 | 99 | SYBNTEXT | 7+ | yes | 4 | yes |
0x67 | 103 | SYBNVARCHAR | 7+ | yes | 1 | |
0x68 | 104 | SYBBITN | yes | 1 | ||
0x6A | 106 | SYBDECIMAL | yes | 1 | ||
0x6C | 108 | SYBNUMERIC | yes | 1 | ||
0x6D | 109 | SYBFLTN | yes | 1 | ||
0x6E | 110 | SYBMONEYN | yes | 1 | ||
0x6F | 111 | SYBDATETIMN | yes | 1 | ||
0x7A | 122 | SYBMONEY4 | no | 0 | ||
0x7F | 127 | SYBINT8 | no | 0 | ||
0xA5 | 165 | XSYBVARBINARY | 7+ | yes | 2 * | |
0xA7 | 167 | XSYBVARCHAR | 7+ | yes | 2 * | yes |
0xAD | 173 | XSYBBINARY | 7+ | yes | 2 | |
0xAF | 175 | XSYBCHAR | 7+ | yes | 2 | yes |
0xE1 | 225 | SYBLONGBINARY | 5 | yes | 4 | |
0xE7 | 231 | XSYBNVARCHAR | 7+ | yes | 2 * | yes |
0xEF | 239 | XSYBNCHAR | 7+ | yes | 2 | yes |
* Under TDS 7.2+ these types allow size to be -1, representing varchar(max), varbinary(max) and nvarchar(max). Data representation for them changes:
The collation structure contains information about the character set encoding and comparison method.
INT16 INT16 INT8 +----------+--------+------------+ | codepage | flags | charset_id | +----------+--------+------------+ codepage windows codepage (see http://www.microsoft.com/globaldev/nlsweb/) also specified in lcid column of master..syslanguages flags sort flags 0x100 binary compare 0x080 width insensitive 0x040 Katatype insensitive 0x020 accent insensitive 0x010 case insensitive If binary flag is specified other flags are not present Low nibble of flags is a charset specifier (like chinese dialect) charset_id charset id in master..syscharsets table or zero for no SQL collations
Collations names can be obtained from select name from ::fn_helpcollations()
query
INT8 XCHAR[n] INT8 INT32 INT8 +-------------+--------------+-------+-------+---------+ | column name | column name | flags | user | column | | length | | | type | type | +-------------+--------------+-------+-------+---------+ varies INT8 INT8 INT16 XCHAR[n] INT8 varies +-------------+----------+----------+----------+------------+--------+--------+ | column size |precision | scale | t length | table name | locale | locale | | | | | | | length | info | | (optional) |(optional)|(optional)|(optional)| (optional) | (opt) | (opt) | +-------------+----------+----------+----------+------------+--------+--------+ column name length column name column name in result set, not necessarily db column name flags bit flags 0x1 hidden (TDS 5.0) 0x2 key 0x10 writable 0x20 can be NULL 0x40 identity user type usertype column from syscolumns column type column type column size not present for fixed size columns precision present only for SYBDECIMAL and SYBNUMERIC scale present only for SYBDECIMAL and SYBNUMERIC t length present only for SYBTEXT and SYBIMAGE, length of table name table name present only for SYBTEXT and SYBIMAGE locale length length of locale info (in bytes) only for TDS 5.0 results (not for parameters) locale info unknown only for TDS 5.0 results (not for parameters)
Normal tokens (contained in packets 0xF)
TODO
Special packets
This sample packet contain just SQL commands. It's supported by all TDS version (although TDS 5.0 have others token with similar use)
XCHAR[n] +---------+ | string | +---------+ string SQL text
Do not confuse an RPC packet with an RPC token. The RPC packet is supported by all version of TDS; the RPC token is supported only by TDS 5.0 (and has different format). This is the oldest (and the only one in mssql) way to call directly an RPC. Sybase also documents it, but as 0xE.
INT16 XCHAR[n] INT16 +-------------+----------+-------+----------+ | name length | rpc name | flags | params | +-------------+----------+-------+----------+ name length length of RPC name in characters. mssql2k+ support some core RPC using numbers If a number is used instead of name name length is marked as -1 (null) and a INT16 is used for the name. 0x1 1 sp_cursor 0x2 2 sp_cursoropen 0x3 3 sp_cursorprepare 0x4 4 sp_cursorexecute 0x5 5 sp_cursorprepexec 0x6 6 sp_cursorunprepare 0x7 7 sp_cursorfetch 0x8 8 sp_cursoroption 0x9 9 sp_cursorclose 0xA 10 sp_executesql 0xB 11 sp_prepare 0xC 12 sp_execute ??? 0xD 13 sp_prepexec 0xE 14 sp_prepexecrpc 0xF 15 sp_unprepare sp_execute seems to have some problems, even MS ODBC use name version instead of number. rpc name name of RPC. flags bit flags. 0x1 1 recompile procedure (TDS 7.0+/TDS 5.0) 0x2 2 no metadata (TDS 7.0+) (I don't know meaning of "no metadata" -- freddy77) params parameters. See below
Every parameter has the following structure
+-----------+------+ | data info | data | +-----------+------+ data info data information. See below data data. See results for detail
INT8 XCHAR[n] INT8 INT32 +-------------+------------+-------+--------------------+ | name length | param name | flags | usertype (TDS 5.0) | +-------------+------------+-------+--------------------+ INT8 varies varies INT8[5] INT8 +------+-------+----------+---------------+------------------+ | type | size | optional | collate | locale | | | (opt) | (opt) | info(TDS 7.1) | length (TDS 5.0) | +------+-------+----------+---------------+------------------+ name length parameter name length (0 if unused) param name parameter name flags bit Name Meaning 0x1 TDS_RPC_OUTPUT output parameter 0x2 TDS_RPC_NODEF output parameter has no default value. Valid only with TDS_RPC_OUTPUT. usertype usertype type param type size see Results optional see Results. Blobs DO NOT have optional on input parameters (output blob parameters are not supported by any version of TDS). collate info only for type that want collate info and using TDS 7.1 locale length locale information length. Usually 0 (if not locale information follow, the structure is unknown)
Under TDS 7.0+ is possible to chain multiple RPCs together. This is useful to limit packets and round-trips with server. RPCs can be chained using byte 0x80 (TDS 7.0/TDS 7.1) or 0xFF (TDS 7.2).
INT8 INT8 +-----+------------------------+----------------+-----+ | RPC | 0x80 (TDS 7.0/TDS 7.1) | 0xFF (TDS 7.2) | RPC | ... +-----+------------------------+----------------+-----+
This documents a TDS 5.0 packet. It might be true for others....
INT8 INT16 INT32 +----------+-------------+--------------------+ | packet | last packet | packet | | type = 7 | indicator | size | +----------+-------------+--------------------+ followed by N row buffers, where N is computed by exhausting the packet size
INT16 INT8 INT8 INT16 +--------------+----------+----------+--------------+ | size | ncols | zero | size (again) | +--------------+----------+----------+--------------+ followed by column buffers (data), where first, the fixed-size datatype columns fixed size and count (determined by column definition) +--------------+ | data .... | [repeats once for each mandatory column] +--------------+ then, the variable-size (including nullable) datatype columns variable size and count +--------------+ | data .... | [repeats ncol times] +--------------+ followed by two tables (!) to describe the column buffers Adjustment Table (optional) INT8 INT8 +----------+----------+ | 1 + ncols| offset | [repeats ncol + 1 times] +----------+----------+ Offset Table (mandatory) INT8 INT8 +----------+----------+ | 1 + ncols| offset | [repeats ncol + 1 times] +----------+----------+
The BCP packet has a slightly different Packet header!?
The offset and adjustment tables describe the postion of the first byte of each variable-size column. The first element holds the count of elements in the offset/adjustment table. Thereafer, the offsets are arranged in reverse order: the last element — which is also the last byte of the row buffer — holds the offset from the start of the row of the first variable-size column. The next-to-last offset table element holds the starting position of the second variable-size column, and so on.
The first element is 5 because there are five elements in the list. There are 4 column data buffers with 5 endpoints. The first column's data begins at offset 4. Computations:
Any column not accounted for is implicitly NULL. To represent a NULL column between two dataful columns, the offset table will have adjacent entries of the same value.
The so-called adjustment table provides for longer rows. The reader will note the Offset table has 8-bit elements, which would limit the width of the table: the last variable column would have to end less than 256 bytes from the start fo the row. Rather than changing the definition of the Offset table, a second table, the Adjustment table, was introduced. It holds high-order bytes for the column offsets.
In other words, to compute a variable column's offset from the start of the row buffer, the server looks up its offset table value, then consults the same position in the adjustment table, and splices them together.
The BCP packet is very dense. The data formats are governed by the table definition. Non-NULL columns of course must be present; there is no need to count them or compute their size. The NULL columns are undelimited; their boundaries are defined by the minimalist offset table.
The Adjustment table seems silly at first glance. Why not just make the offset table's elements 16 or or even 32 bits? The reason is overhead. Most rows will have less than 256 bytes of variable column data. By using the adjustment table, the BCP packet avoids adding one or even three empty bytes per column per row.
Why is the table in reverse order? Because that places the first offset at a known location: the end of each row gives the start of the first column. The server can work its way down the offset table and compute the column sizes. If they don't add up — if there are data between the (presumed) end of the last column and the start of the offset table — the server knows it should look for an adjustment table. Because the scheme is infinitely repeatable, rows could one day grow to terabyte widths without redefining the packet structure.
Responses from the server start with a single octet (token) identifying its type. If variable length, they generally have the length as the second and third bytes
Tokens encountered thus far:
HEX | DEC | name | note |
---|---|---|---|
0x20 | 32 | Param Format 2 | 5.0 only |
0x21 | 33 | Language | 5.0 only, client-side |
0x22 | 34 | OrderBy 2 | 5.0 only?? |
0x61 | 97 | Row Format 2 | 5.0 only |
0x71 | 113 | "Logout" | 5.0? ct_close(), client-side? |
0x79 | 121 | Return Status | |
0x7C | 124 | Process ID | 4.2 only |
0x80 | 128 | Cursor Close | 5.0 only |
0x81 | 129 | Cursor Delete | 5.0 only |
0x81 | 129 | 7.0 Result | 7.0 only |
0x82 | 130 | Cursor Fetch | 5.0 only |
0x83 | 131 | Cursor Info | 5.0 only |
0x84 | 132 | Cursor Open | 5.0 only |
0x86 | 134 | Cursor Declare | 5.0 only |
0x88 | 136 | 7.0 Compute Result | 7.0 only |
0xA0 | 160 | Column Name | 4.2 only |
0xA1 | 161 | Column Format | 4.2 only |
0xA3 | 163 | Dynamic 2 | 5.0 only |
0xA4 | 164 | Table names | name of tables in a FOR BROWSE select |
0xA5 | 165 | Column Info | column information in a FOR BROWSE select |
0xA6 | 166 | Option Cmd | 5.0 only |
0xA7 | 167 | Compute Names | |
0xA8 | 168 | Compute Result | |
0xA9 | 169 | Order By | |
0xAA | 170 | Error Message | |
0xAB | 171 | Info Message | |
0xAC | 172 | Output Parameters | |
0xAD | 173 | Login Acknowledgement | |
0xAE | 174 | Control | |
0xD1 | 209 | Data --- Row Result | |
0xD3 | 211 | Data --- Compute Result | |
0xD7 | 215 | Params | 5.0 only |
0xE2 | 226 | Capability | 5.0 only. Information on server |
0xE3 | 227 | Environment Change | (database change, packet size, etc...) |
0xE5 | 229 | Extended Error Message | |
0xE6 | 230 | DBRPC | 5.0 only RPC calls |
0xE7 | 231 | Dynamic | 5.0 only |
0xEC | 236 | Param Format | 5.0 only |
0xED | 237 | Authentication | 7.0 only |
0xEE | 238 | Result Set | 5.0 only |
0xFD | 253 | Result Set Done | |
0xFE | 254 | Process Done | |
0xFF | 255 | Done inside Process |
INT32 INT8 CHAR[n] +--------+--------+-------+ | length | status | query | +--------+--------+-------+ length total token length status 0 no args 1 has args (followed by PARAMFMT/PARAMS) query query (total length - 1)
No information. (1 byte, value=0 ?)
INT32 +---------------+ | Return status | +---------------+
The return value of a stored procedure.
8 bytes +----------------+ | process number | +----------------+
Presumably the process ID number for an executing stored procedure. (I'm not sure how this would ever be used by a client. *mjs*)
INT16 +----------+-------------+ | #columns | column_info | +----------+-------------+
The TDS 7.0 column_info is formatted as follows for each column:
INT16 INT16 INT8 varies varies INT8[5] INT8 UCS2LE[n] +----------+-------+------+-------+----------+---------------+-------------+---------+ | usertype | flags | type | size | optional | collate | name length | name | | | | | (opt) | (opt) | info(TDS 7.1) | | | +----------+-------+------+-------+----------+---------------+-------------+---------+ usertype type modifier flags bit flags 0x1 can be NULL 0x8 can be written (it's not an expression) 0x10 identity type data type, values >128 indicate a large type size none for fixed size types 4 bytes for blob and text 2 bytes for large types 1 byte for all others optional INT8 INT8 +-----------+-------+ numeric/decimal types: | precision | scale | +-----------+-------+ INT16 UCS2LE[n] +-------------------+------------+ blob/text types: | table name length | table name | +-------------------+------------+ collate info are available only using TDS 7.1 and for characters types (but not for old type like short VARCHAR, only 2byte length versions)
TODO.
TODO.
TODO.
TODO.
TODO.
INT16 INT8 CHAR[n] INT8 CHAR[n] +--------------+---------+--------------+------+---------+--------------+ | total length | length1 | column1 name | .... | lengthN | columnN name | +--------------+---------+--------------+------+---------+--------------+
This token is the first token that contain result informations. Is usually followed by Column Format token (0xA1 161)
INT16 +--------------+-------------+ | total length | column_info | +--------------+-------------+
The number of columns is the same of previous Column Name token.
The TDS 4.2 column_info is formatted as follows for each column:
INT8[4] INT8 varies varies +-----------+------+-------+----------+ | usertype/ | type | size | optional | | flags | | (opt) | (opt) | +-----------+------+-------+----------+ usertype/flags for Sybase INT32 +----------+ | usertype | +----------+ usertype/flags for MSSQL INT16 +----------+-------+ | usertype | flags | +----------+-------+ usertype type modifier flags bit flags (only MSSQL) 0x1 can be NULL 0x8 can be written (it's not an expression) 0x10 identity type data type size none for fixed size types 4 bytes for blob and text 1 byte for all others (TDS 4.2 do not support large types) optional INT8 INT8 +-----------+-------+ numeric/decimal types: | precision | scale | (supported??) +-----------+-------+ INT16 CHAR[n] +-------------------+------------+ blob/text types: | table name length | table name | +-------------------+------------+
TODO.
TODO.
INT16 INT16 INT8 varies INT8 INT8[n] +--------------+------------+----------+-------------+---------+-------+ | total length | compute id | #columns | column info | #bycols | bycol | +--------------+------------+----------+-------------+---------+-------+
column info:
INT8 INT8 INT32 INT8 varies INT8 varies +----------+---------+----------+--------+-------+------------------+----------------+ | operator | operand | usertype | column | size | locale length | locale info | | | | | type | (opt) | info (TDS 5.0) | (TDS 5.0) | +----------+---------+----------+--------+-------+------------------+----------------+ operator operator 0x4b COUNT 0x4c UNSIGNED? COUNT 0x4d SUM 0x4e UNSIGNED? SUM 0x4f AVG 0x50 UNSIGNED? AVG 0x51 MIN 0x52 MAX 0x09 COUNT_BIG (mssql2k) 0x30 STDEV (mssql2k) 0x31 STDEVP (mssql2k) 0x32 VAR (mssql2k) 0x33 VARP (mssql2k) 0x72 CHECKSUM_AGG (mssql2k) operand ??? usertype usertype column type data type size data size locale length length of locale informations locale info locale informations (unknown)
Each bycol information contains column info for a specific column.
TODO: optional possible?? collate infos ??
TDS4/5/7:
INT16 INT16 XCHAR[n] +--------------+-------------+------------+ | total length | name length | table name | ... +--------------+-------------+------------+ name length table name length table name table name
TDS 7.1:
INT16 varies +--------------+-------------+ | total length | table names | +--------------+-------------+ table name: INT8 INT16 XCHAR[n] +-----------------+------------------+----------------+ | # of components | component length | component name | +-----------------+------------------+----------------+ ie: name -> 01 04 00 ucs2le "name" db..name -> 03 02 00 ucs2le "db" 00 00 04 00 ucs2le "name" db.dbo.name -> 03 02 00 ucs2le "db" 03 00 ucs2le "dbo" 04 00 ucs2le "name"
INT16 varies +--------------+--------------+ | total length | column infos | +--------------+--------------+
column info:
INT8 INT8 INT8 INT8 XCHAR[n] +-------+-------------+-------+-------------+-------------+ | index | table index | flags | name length | column name | | | | | (opt) | (opt) | +-------+-------------+-------+-------------+-------------+ index index in result format (1-based) table index index in previous TabName (1-based) 0 means no table (ie computed) flags set of flags 0x04 expression 0x08 key 0x10 hidden 0x20 column name present name length length of following column column name real column name (result contain the label)
This token follow TabName token
Miscellaneous note (from *bdw* ?) found with 0xAE:
has one byte for each column, comes between result(238) and first row(209), I believe computed column info is stored here, need to investigate
INT16 variable (1 byte per col) +--------+---------+ | length | orders | +--------+---------+ length Length of packet(and number of cols) orders one byte per order by indicating the column # in the output matching the order from Column Info and Column Names and data in following Row Data items. A 0 indicates the column is not in the resulting rows. an example: select first_name, last_name, number from employee order by salary, number assuming the columns are returned in the order queried: first_name then last_name, then number. we would have: ---------------- | 2 | 0 | 3 | ---------------- where length = 2 then the orders evaluate: 0 for salary, meaning there is no salary data returned 3 for number, meaning the 3rd data item corresponding to a column is the number
INT16 INT32 INT8 INT8 +--------+------------+-------+-------+ | length | msg number | state | level | +--------+------------+-------+-------+ INT16 XCHAR[n] INT8 XCHAR[n] INT8 XCHAR[n] INT16 INT32 +----------+---------+----------+---------+----------+---------+-----------------+-----------------+ | m length | message | s length | server | p length | process | line#(TDS 7.1-) | line# (TDS 7.2) | +----------+---------+----------+---------+----------+---------+-----------------+-----------------+ length Length of packet msg number SQL message number state ? level An error if level > 10, a message if level <= 10 m length Length of message message Text of error/message s length Length of server name server Name of "server" ? p length Length of process name process name Stored procedure name, if any line# Line number of input which generated the message
Output parameters of a stored procedure.
INT16 INT8 XCHAR[n] INT8 INT32 INT8 +--------+----------+---------+-------+----------+----------+------+ | length | c length | colname | flags | usertype | datatype | .... | +--------+----------+---------+-------+----------+----------+------+ length Length of packet c length Length of colname colname Name of column flags 0x1 Nullable usertype cf. systypes table in database datatype Type of data returned The trailing information depends on whether the datatype is a fixed size datatype. N bytes +---------+ Datatype of fixed size N | data | +---------+ INT8 INT8 N bytes +-------------+---------------+--------+ Otherwise | column size | actual size N | data | +-------------+---------------+--------+
INT16 INT8 4 bytes INT8 XCHAR[n] 4 bytes +--------+-------+---------+----------+--------+----------+ | length | ack | version | t length | text | ser_ver | +--------+-------+---------+----------+--------+----------+ length length of packet ack 0x01 success 4.2 0x05 success 5.0 0x06 failure 5.0 version TDS version 4 bytes: major.minor.?.? t length length of text text server name (ie 'Microsoft SQL Server') ser_ver Server version (with strange encoding, differring from TDS version)
INT8 variable size +----------+--------------------+ | token | row data | +----------+--------------------+
Row data starts with one byte (decimal 209), for variable length types, a one byte length field precedes the data, for fixed length records just the data appears.
Note: nullable integers and floats are variable length.
For example: sp_who
The first field is spid, a smallint
The second field is status a char(12), in our example "recv sleep "
The row would look like this:
byte 0 is the token bytes 1-2 are a smallint in low-endian byte 3 is the length of the char field bytes 4-15 is the char field byte 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 hex D1 01 00 0C 72 65 63 76 20 73 6C 65 65 70 20 20 209 1 0 12 r e c v ' ' s l e e p ' ' ' '
TODO.
INT16 variable +--------+--------------+ | length | capabilities | +--------+--------------+ length Length of capability string capabilities Server capabilities? Related to login magic?
INT16 INT8 INT8 CHAR[n] INT8 CHAR[n] +--------+----------+-----------+---------+-----------+---------+ | length | env code | t1 length | text1 | t2 length | text2 | +--------+----------+-----------+---------+-----------+---------+ env code Code for what part of environment changed 0x01 database context 0x02 language 0x03 character set 0x04 packet size 0x05 TDS 7.0+ LCID 0x06 TDS 7.0+ ??? (sort method? sql server encoding?) 0x07 Collation info text1 Old value text2 New value text1 and text2 are text information (coded in ucs2 in TDS 7.0+) except collation info that's a structure (see collation structure)
TODO.
TODO.
INT16 INT16 variable size +---------+------------+-------------------+ | length | number of | parameter info | | | parameters | | +---------+------------+-------------------+ length length of message following this field number of parameters number of parameter formats following list of formats I (*bdw*) imagine it uses the column format structure.
INT16 varies +---------+------+ | length | auth | +---------+------+ length length of authentication data following this field auth authentication data for NTLM this is message 2
Client reply with Authentication packet.
INT16 INT16 variable size +---------+------------+-----------------+ | length | number of | column info | | | columns | | +---------+------------+-----------------+ Fields: length length of message following this field number of columns number of columns in the result set, this many column information fields will follow. column info column info
Result Set Done (0xFD 253)
Process Done (0xFE 254)
Done Inside Process (0xFF 255)
INT16 INT16 INT32 INT64 +-----------+---------+----------------------+---------------------+ | bit flags | unknown | row count (TDS 7.1-) | row count (TDS 7.2) | +-----------+---------+----------------------+---------------------+ Fields: bit flags 0x01 more results 0x02 error (like invalid sql syntax) 0x10 row count is valid 0x20 cancelled unknown 2,0 /* something to do with block size perhaps */ row count number of rows affected / returned in the result set. row count is 64-bit using TDS 7.2. (FIXME check if "affected / returned" is correct)
"Result Set Complete" is the end of a query that doesn't create a process on the server. I.e., it doesn't call a stored procedure.
"Process Done" is the end of a stored procedure
"Done In Process" means that a query internal to a stored procedure has finished, but the stored procedure isn't done overall.
The following people have contributed to this document:
(short list)
$Id: tds.html,v 1.41 2008/11/25 23:38:33 jklowden Exp $