妤氱殑姒傚康锛屽杩涚煡璇嗭紝绫讳技浜庢墦RPG娓告垙鐨勫崌绾с�傛暣鐞嗚繖绡囨枃绔犵殑鍔ㄦ満鏄袱涓棶棰橈細
闂涓�锛� 浣跨敤Windows璁颁簨鏈殑鈥滃彟瀛樹负鈥濓紝鍙互鍦℅BK銆乁nicode銆乁nicode big endian鍜孶TF-8杩� 鍑犵缂栫爜鏂瑰紡闂寸浉浜掕浆鎹€�傚悓鏍锋槸txt鏂囦欢锛學indows鏄�庢牱璇嗗埆缂栫爜鏂瑰紡鐨勫憿锛�
鎴戝緢鏃╁墠灏卞彂鐜癠nicode銆乁nicode big endian鍜孶TF-8缂栫爜鐨則xt鏂囦欢鐨勫紑澶翠細澶氬嚭鍑犱釜瀛� 鑺傦紝鍒嗗埆鏄疐F銆丗E锛圲nicode锛�,FE銆丗F锛圲nicode big endian锛�,EF銆丅B銆丅F锛圲TF-8锛夈�� 浣嗚繖浜涙爣璁版槸鍩轰簬浠�涔堟爣鍑嗗憿锛�
闂浜岋細 鏈�杩戝湪缃戜笂鐪嬪埌涓�涓狢onvertUTF.c锛屽疄鐜颁簡UTF-32銆乁TF-16鍜孶TF-8杩欎笁绉嶇紪鐮佹柟寮忕殑鐩镐簰 杞崲銆傚浜嶶nicode(UCS2)銆丟BK銆乁TF-8杩欎簺缂栫爜鏂瑰紡锛屾垜鍘熸潵灏变簡瑙c�備絾杩欎釜绋嬪簭璁╂垜鏈� 浜涚硦娑傦紝鎯充笉璧锋潵UTF-16鍜孶CS2鏈変粈涔堝叧绯汇��
鏌ヤ簡鏌ョ浉鍏宠祫鏂欙紝鎬荤畻灏嗚繖浜涢棶棰樺紕娓呮浜嗭紝椤哄甫涔熶簡瑙d簡涓�浜沀nicode鐨勭粏鑺傘�備綔鑰呭啓鎴� 涓�绡囨枃绔狅紝閫佺粰鏈夎繃绫讳技鐤戦棶鐨勬湅鍙嬨�傛湰鏂囧湪鍐欎綔鏃跺敖閲忓仛鍒伴�氫織鏄撴噦锛屼絾瑕佹眰璇昏�呯煡閬� 浠�涔堟槸瀛楄妭锛屼粈涔堟槸鍗佸叚杩涘埗銆�
0銆乥ig endian鍜宭ittle endian big endian鍜宭ittle endian鏄疌PU澶勭悊澶氬瓧鑺傛暟鐨勪笉鍚屾柟寮忋�備緥濡傗�滄眽鈥濆瓧鐨刄nicode缂栫爜 鏄�6C49銆傞偅涔堝啓鍒版枃浠堕噷鏃讹紝绌剁珶鏄皢6C鍐欏湪鍓嶉潰锛岃繕鏄皢49鍐欏湪鍓嶉潰锛熷鏋滃皢6C鍐欏湪鍓� 闈紝灏辨槸big endian銆傚鏋滃皢49鍐欏湪鍓嶉潰锛屽氨鏄痩ittle endian銆�
鈥渆ndian鈥濊繖涓瘝鍑鸿嚜銆婃牸鍒椾經娓歌銆嬨�傚皬浜哄浗鐨勫唴鎴樺氨婧愪簬鍚冮浮铔嬫椂鏄┒绔熶粠澶уご(Big -Endian)鏁插紑杩樻槸浠庡皬澶�(Little-Endian)鏁插紑锛岀敱姝ゆ浘鍙戠敓杩囧叚娆″彌涔憋紝涓�涓殗甯濋�佷簡鍛� 锛屽彟涓�涓涪浜嗙帇浣嶃��
鎴戜滑涓�鑸皢endian缈昏瘧鎴愨�滃瓧鑺傚簭鈥濓紝灏哹ig endian鍜宭ittle endian绉颁綔鈥滃ぇ灏锯�濆拰鈥滃皬 灏锯�濄��
1銆佸瓧绗︾紪鐮併�佸唴鐮侊紝椤哄甫浠嬬粛姹夊瓧缂栫爜
瀛楃蹇呴』缂栫爜鍚庢墠鑳借璁$畻鏈哄鐞嗐�傝绠楁満浣跨敤鐨勭己鐪佺紪鐮佹柟寮忓氨鏄绠楁満鐨勫唴鐮併�傛棭鏈� 鐨勮绠楁満浣跨敤7浣嶇殑ASCII缂栫爜锛屼负浜嗗鐞嗘眽瀛楋紝绋嬪簭鍛樿璁′簡鐢ㄤ簬绠�浣撲腑鏂囩殑GB2312鍜岀敤 浜庣箒浣撲腑鏂囩殑big5銆�
GB2312(1980骞�)涓�鍏辨敹褰曚簡7445涓瓧绗︼紝鍖呮嫭6763涓眽瀛楀拰682涓叾瀹冪鍙枫�傛眽瀛楀尯鐨勫唴鐮� 鑼冨洿楂樺瓧鑺備粠B0-F7锛屼綆瀛楄妭浠嶢1-FE锛屽崰鐢ㄧ殑鐮佷綅鏄�72*94=6768銆傚叾涓湁5涓┖浣嶆槸D7FA- D7FE銆�
GB2312鏀寔鐨勬眽瀛楀お灏戙��1995骞寸殑姹夊瓧鎵╁睍瑙勮寖GBK1.0鏀跺綍浜�21886涓鍙凤紝瀹冨垎涓烘眽瀛楀尯 鍜屽浘褰㈢鍙峰尯銆傛眽瀛楀尯鍖呮嫭 21003涓瓧绗︺��2000骞寸殑GB18030鏄彇浠BK1.0鐨勬寮忓浗瀹舵爣鍑� 銆傝鏍囧噯鏀跺綍浜�27484涓眽瀛楋紝鍚屾椂杩樻敹褰曚簡钘忔枃銆佽挋鏂囥�佺淮鍚惧皵鏂囩瓑涓昏鐨勫皯鏁版皯鏃忔枃瀛� 銆傜幇鍦ㄧ殑PC骞冲彴蹇呴』鏀寔GB18030锛屽宓屽叆寮忎骇鍝佹殏涓嶄綔瑕佹眰銆傛墍浠ユ墜鏈恒�丮P3涓�鑸彧鏀寔 GB2312銆�
浠嶢SCII銆丟B2312銆丟BK鍒癎B18030锛岃繖浜涚紪鐮佹柟娉曟槸鍚戜笅鍏煎鐨勶紝鍗冲悓涓�涓瓧绗﹀湪杩欎簺鏂规 涓�绘槸鏈夌浉鍚岀殑缂栫爜锛屽悗闈㈢殑鏍囧噯鏀寔鏇村鐨勫瓧绗︺�傚湪杩欎簺缂栫爜涓紝鑻辨枃鍜屼腑鏂囧彲浠ョ粺涓� 鍦板鐞嗐�傚尯鍒嗕腑鏂囩紪鐮佺殑鏂规硶鏄珮瀛楄妭鐨勬渶楂樹綅涓嶄负0銆傛寜鐓х▼搴忓憳鐨勭О鍛硷紝GB2312銆丟BK 鍒� GB18030閮藉睘浜庡弻瀛楄妭瀛楃闆� (DBCS)銆�
鏈夌殑涓枃Windows鐨勭己鐪佸唴鐮佽繕鏄疓BK锛屽彲浠ラ�氳繃GB18030鍗囩骇鍖呭崌绾у埌GB18030銆備笉杩嘒B18 030鐩稿GBK澧炲姞鐨勫瓧绗︼紝鏅�氫汉鏄緢闅剧敤鍒扮殑锛岄�氬父鎴戜滑杩樻槸鐢℅BK鎸囦唬涓枃Windows鍐呯爜 銆�
杩欓噷杩樻湁涓�浜涚粏鑺傦細
GB2312鐨勫師鏂囪繕鏄尯浣嶇爜锛屼粠鍖轰綅鐮佸埌鍐呯爜锛岄渶瑕佸湪楂樺瓧鑺傚拰浣庡瓧鑺備笂鍒嗗埆鍔犱笂A0銆�
鍦―BCS涓紝GB鍐呯爜鐨勫瓨鍌ㄦ牸寮忓缁堟槸big endian锛屽嵆楂樹綅鍦ㄥ墠銆�
GB2312鐨勪袱涓瓧鑺傜殑鏈�楂樹綅閮芥槸1銆備絾绗﹀悎杩欎釜鏉′欢鐨勭爜浣嶅彧鏈�128*128=16384涓�傛墍浠B K鍜孏B18030鐨勪綆瀛楄妭鏈�楂樹綅閮藉彲鑳戒笉鏄�1銆備笉杩囪繖涓嶅奖鍝岲BCS瀛楃娴佺殑瑙f瀽锛氬湪璇诲彇DBCS瀛� 绗︽祦鏃讹紝鍙閬囧埌楂樹綅涓�1鐨勫瓧鑺傦紝灏卞彲浠ュ皢涓嬩袱涓瓧鑺備綔涓轰竴涓弻瀛楄妭缂栫爜锛岃�屼笉鐢ㄧ浣� 瀛楄妭鐨勯珮浣嶆槸浠�涔堛��
2銆乁nicode銆乁CS鍜孶TF
鍓嶉潰鎻愬埌浠嶢SCII銆丟B2312銆丟BK鍒癎B18030鐨勭紪鐮佹柟娉曟槸鍚戜笅鍏煎鐨勩�傝�孶nicode鍙笌ASCI I鍏煎锛堟洿鍑嗙‘鍦拌锛屾槸涓嶪SO-8859-1鍏煎锛夛紝涓嶨B鐮佷笉鍏煎銆備緥濡傗�滄眽鈥濆瓧鐨刄nicode缂� 鐮佹槸6C49锛岃�孏B鐮佹槸BABA銆�
Unicode涔熸槸涓�绉嶅瓧绗︾紪鐮佹柟娉曪紝涓嶈繃瀹冩槸鐢卞浗闄呯粍缁囪璁★紝鍙互瀹圭撼鍏ㄤ笘鐣屾墍鏈夎瑷�鏂囧瓧 鐨勭紪鐮佹柟妗堛�俇nicode鐨勫鍚嶆槸 "Universal Multiple-Octet Coded Character Set"锛岀畝 绉颁负UCS銆俇CS鍙互鐪嬩綔鏄�"Unicode Character Set"鐨勭缉鍐欍��
鏍规嵁缁村熀鐧剧鍏ㄤ功( http://zh.wikipedia.org/wiki/ )鐨勮杞斤細鍘嗗彶涓婂瓨鍦ㄤ袱涓瘯鍥剧嫭绔� 璁捐Unicode鐨勭粍缁囷紝鍗冲浗闄呮爣鍑嗗寲缁勭粐锛圛SO锛夊拰涓�涓蒋浠跺埗閫犲晢鐨勫崗浼氾紙unicode.org锛� 銆侷SO寮�鍙戜簡ISO 10646椤圭洰锛孶nicode鍗忎細寮�鍙戜簡Unicode椤圭洰銆�
鍦�1991骞村墠鍚庯紝鍙屾柟閮借璇嗗埌涓栫晫涓嶉渶瑕佷袱涓笉鍏煎鐨勫瓧绗﹂泦銆備簬鏄畠浠紑濮嬪悎骞跺弻鏂圭殑 宸ヤ綔鎴愭灉锛屽苟涓哄垱绔嬩竴涓崟涓�缂栫爜琛ㄨ�屽崗鍚屽伐浣溿�備粠Unicode2.0寮�濮嬶紝Unicode椤圭洰閲囩敤浜� 涓嶪SO 10646-1鐩稿悓鐨勫瓧搴撳拰瀛楃爜銆�
鐩墠涓や釜椤圭洰浠嶉兘瀛樺湪锛屽苟鐙珛鍦板叕甯冨悇鑷殑鏍囧噯銆俇nicode鍗忎細鐜板湪鐨勬渶鏂扮増鏈槸2005骞� 鐨刄nicode 4.1.0銆侷SO鐨勬渶鏂版爣鍑嗘槸ISO 10646-3:2003銆�
UCS鍙槸瑙勫畾濡備綍缂栫爜锛屽苟娌℃湁瑙勫畾濡備綍浼犺緭銆佷繚瀛樿繖涓紪鐮併�備緥濡傗�滄眽鈥濆瓧鐨刄CS缂栫爜鏄� 6C49锛屾垜鍙互鐢�4涓猘scii鏁板瓧鏉ヤ紶杈撱�佷繚瀛樿繖涓紪鐮侊紱涔熷彲浠ョ敤utf-8缂栫爜:3涓繛缁殑瀛楄妭 E6 B1 89鏉ヨ〃绀哄畠銆傚叧閿湪浜庨�氫俊鍙屾柟閮借璁ゅ彲銆俇TF-8銆乁TF-7銆乁TF-16閮芥槸琚箍娉涙帴鍙� 鐨勬柟妗堛�俇TF-8鐨勪竴涓壒鍒殑濂藉鏄畠涓嶪SO- 8859-1瀹屽叏鍏煎銆俇TF鏄�淯CS Transformat ion Format鈥濈殑缂╁啓銆�
IETF鐨凴FC2781鍜孯FC3629浠FC鐨勪竴璐鏍硷紝娓呮櫚銆佹槑蹇張涓嶅け涓ヨ皑鍦版弿杩颁簡UTF-16鍜孶TF -8鐨勭紪鐮佹柟娉曘�傛垜鎬绘槸璁颁笉寰桰ETF鏄疘nternet Engineering Task Force鐨勭缉鍐欍�備絾IETF璐� 璐g淮鎶ょ殑RFC鏄疘nternet涓婁竴鍒囪鑼冪殑鍩虹銆�
2.1銆佸唴鐮佸拰code page
鐩墠Windows鐨勫唴鏍稿凡缁忛噰鐢║nicode缂栫爜锛岃繖鏍峰湪鍐呮牳涓婂彲浠ユ敮鎸佸叏涓栫晫鎵�鏈夌殑璇█鏂囧瓧 銆備絾鏄敱浜庣幇鏈夌殑澶ч噺绋嬪簭鍜屾枃妗i兘閲囩敤浜嗘煇绉嶇壒瀹氳瑷�鐨勭紪鐮侊紝渚嬪GBK锛學indows涓嶅彲 鑳戒笉鏀寔鐜版湁鐨勭紪鐮侊紝鑰屽叏閮ㄦ敼鐢║nicode銆�
Windows浣跨敤浠g爜椤�(code page)鏉ラ�傚簲鍚勪釜鍥藉鍜屽湴鍖恒�俢ode page鍙互琚悊瑙d负鍓嶉潰鎻愬埌 鐨勫唴鐮併�侴BK瀵瑰簲鐨刢ode page鏄疌P936銆�
寰蒋涔熶负GB18030瀹氫箟浜哻ode page锛欳P54936銆備絾鏄敱浜嶨B18030鏈変竴閮ㄥ垎4瀛楄妭缂栫爜锛岃�學 indows鐨勪唬鐮侀〉鍙敮鎸佸崟瀛楄妭鍜屽弻瀛楄妭缂栫爜锛屾墍浠ヨ繖涓猚ode page鏄棤娉曠湡姝d娇鐢ㄧ殑銆�
3銆乁CS-2銆乁CS-4銆丅MP
UCS鏈変袱绉嶆牸寮忥細UCS-2鍜孶CS-4銆傞【鍚嶆�濅箟锛孶CS-2灏辨槸鐢ㄤ袱涓瓧鑺傜紪鐮侊紝UCS-4灏辨槸鐢�4涓� 瀛楄妭锛堝疄闄呬笂鍙敤浜�31浣嶏紝鏈�楂樹綅蹇呴』涓�0锛夌紪鐮併�備笅闈㈣鎴戜滑鍋氫竴浜涚畝鍗曠殑鏁板娓告垙锛�
UCS-2鏈�2^16=65536涓爜浣嶏紝UCS-4鏈�2^31=2147483648涓爜浣嶃��
UCS-4鏍规嵁鏈�楂樹綅涓�0鐨勬渶楂樺瓧鑺傚垎鎴�2^7=128涓猤roup銆傛瘡涓猤roup鍐嶆牴鎹楂樺瓧鑺傚垎涓�256 涓猵lane銆傛瘡涓� plane鏍规嵁绗�3涓瓧鑺傚垎涓�256琛�(rows)锛屾瘡琛屽寘鍚�256涓猚ells銆傚綋鐒跺悓涓�琛� 鐨刢ells鍙槸鏈�鍚庝竴涓瓧鑺備笉鍚岋紝鍏朵綑閮界浉鍚屻��
group 0鐨刾lane 0琚О浣淏asic Multilingual Plane, 鍗矪MP銆傛垨鑰呰UCS-4涓紝楂樹袱涓瓧 鑺備负0鐨勭爜浣嶈绉颁綔BMP銆�
灏哢CS-4鐨凚MP鍘绘帀鍓嶉潰鐨勪袱涓浂瀛楄妭灏卞緱鍒颁簡UCS-2銆傚湪UCS-2鐨勪袱涓瓧鑺傚墠鍔犱笂涓や釜闆跺瓧 鑺傦紝灏卞緱鍒颁簡UCS-4鐨凚MP銆傝�岀洰鍓嶇殑UCS-4瑙勮寖涓繕娌℃湁浠讳綍瀛楃琚垎閰嶅湪BMP涔嬪銆�
4銆乁TF缂栫爜
UTF-8灏辨槸浠�8浣嶄负鍗曞厓瀵筓CS杩涜缂栫爜銆備粠UCS-2鍒癠TF-8鐨勭紪鐮佹柟寮忓涓嬶細
UCS-2缂栫爜(16杩涘埗) UTF-8 瀛楄妭娴�(浜岃繘鍒�) 0000 - 007F 0xxxxxxx 0080 - 07FF 110xxx xx 10xxxxxx 0800 - FFFF 1110xxxx 10xxxxxx 10xxxxxx
渚嬪鈥滄眽鈥濆瓧鐨刄nicode缂栫爜鏄�6C49銆�6C49鍦�0800-FFFF涔嬮棿锛屾墍浠ヨ偗瀹氳鐢�3瀛楄妭妯℃澘浜嗭細 1110xxxx 10xxxxxx 10xxxxxx銆傚皢6C49鍐欐垚浜岃繘鍒舵槸锛�0110 110001 001001锛岀敤杩欎釜姣旂壒 娴佷緷娆′唬鏇挎ā鏉夸腑鐨剎锛屽緱鍒帮細11100110 10110001 10001001锛屽嵆E6 B1 89銆�
璇昏�呭彲浠ョ敤璁颁簨鏈祴璇曚竴涓嬫垜浠殑缂栫爜鏄惁姝g‘銆傞渶瑕佹敞鎰忥紝UltraEdit鍦ㄦ墦寮�utf-8缂栫爜 鐨勬枃鏈枃浠舵椂浼氳嚜鍔ㄨ浆鎹负UTF-16锛屽彲鑳戒骇鐢熸贩娣嗐�備綘鍙互鍦ㄨ缃腑鍏虫帀杩欎釜閫夐」銆傛洿濂� 鐨勫伐鍏锋槸Hex Workshop銆�
UTF-16浠�16浣嶄负鍗曞厓瀵筓CS杩涜缂栫爜銆傚浜庡皬浜�0x10000鐨刄CS鐮侊紝UTF-16缂栫爜灏辩瓑浜嶶CS鐮� 瀵瑰簲鐨�16浣嶆棤绗﹀彿鏁存暟銆傚浜庝笉灏忎簬0x10000鐨刄CS鐮侊紝瀹氫箟浜嗕竴涓畻娉曘�備笉杩囩敱浜庡疄闄呬娇 鐢ㄧ殑UCS2锛屾垨鑰匲CS4鐨凚MP蹇呯劧灏忎簬0x10000锛屾墍浠ュ氨鐩墠鑰岃█锛屽彲浠ヨ涓篣TF-16鍜孶CS-2鍩� 鏈浉鍚屻�備絾UCS-2鍙槸涓�涓紪鐮佹柟妗堬紝UTF-16鍗磋鐢ㄤ簬瀹為檯鐨勪紶杈擄紝鎵�浠ュ氨涓嶅緱涓嶈�冭檻瀛楄妭 搴忕殑闂銆�
5銆乁TF鐨勫瓧鑺傚簭鍜孊OM
UTF-8浠ュ瓧鑺備负缂栫爜鍗曞厓锛屾病鏈夊瓧鑺傚簭鐨勯棶棰樸�俇TF-16浠ヤ袱涓瓧鑺備负缂栫爜鍗曞厓锛屽湪瑙i噴涓�涓� UTF-16鏂囨湰鍓嶏紝棣栧厛瑕佸紕娓呮姣忎釜缂栫爜鍗曞厓鐨勫瓧鑺傚簭銆備緥濡傗�滃鈥濈殑Unicode缂栫爜鏄�594E锛� 鈥滀箼鈥濈殑Unicode缂栫爜鏄�4E59銆傚鏋滄垜浠敹鍒癠TF-16瀛楄妭娴佲��594E鈥濓紝閭d箞杩欐槸鈥滃鈥� 杩� 鏄�滀箼鈥濓紵
Unicode瑙勮寖涓帹鑽愮殑鏍囪瀛楄妭椤哄簭鐨勬柟娉曟槸BOM銆侭OM涓嶆槸鈥淏ill Of Material鈥濈殑BOM琛� 锛岃�屾槸Byte Order Mark銆侭OM鏄竴涓湁鐐瑰皬鑱槑鐨勬兂娉曪細
鍦║CS缂栫爜涓湁涓�涓彨鍋�"ZERO WIDTH NO-BREAK SPACE"鐨勫瓧绗︼紝瀹冪殑缂栫爜鏄疐EFF銆傝�孎FFE 鍦║CS涓槸涓嶅瓨鍦ㄧ殑瀛楃锛屾墍浠ヤ笉搴旇鍑虹幇鍦ㄥ疄闄呬紶杈撲腑銆俇CS瑙勮寖寤鸿鎴戜滑鍦ㄤ紶杈撳瓧鑺傛祦 鍓嶏紝鍏堜紶杈撳瓧绗�"ZERO WIDTH NO-BREAK SPACE"銆�
杩欐牱濡傛灉鎺ユ敹鑰呮敹鍒癋EFF锛屽氨琛ㄦ槑杩欎釜瀛楄妭娴佹槸Big-Endian鐨勶紱濡傛灉鏀跺埌FFFE锛屽氨琛ㄦ槑杩� 涓瓧鑺傛祦鏄疞ittle-Endian鐨勩�傚洜姝ゅ瓧绗�"ZERO WIDTH NO-BREAK SPACE"鍙堣绉颁綔BOM銆�
UTF-8涓嶉渶瑕丅OM鏉ヨ〃鏄庡瓧鑺傞『搴忥紝浣嗗彲浠ョ敤BOM鏉ヨ〃鏄庣紪鐮佹柟寮忋�傚瓧绗�"ZERO WIDTH NO-BR EAK SPACE"鐨刄TF-8缂栫爜鏄疎F BB BF锛堣鑰呭彲浠ョ敤鎴戜滑鍓嶉潰浠嬬粛鐨勭紪鐮佹柟娉曢獙璇佷竴涓嬶級銆傛墍 浠ュ鏋滄帴鏀惰�呮敹鍒颁互EF BB BF寮�澶寸殑瀛楄妭娴侊紝灏辩煡閬撹繖鏄疷TF-8缂栫爜浜嗐��
Windows灏辨槸浣跨敤BOM鏉ユ爣璁版枃鏈枃浠剁殑缂栫爜鏂瑰紡鐨勩��
6銆佽繘涓�姝ョ殑鍙傝�冭祫鏂�
鏈枃涓昏鍙傝�冪殑璧勬枡鏄� "Short overview of ISO-IEC 10646 and Unicode" ( http://ww w.nada.kth.se/i18n/ucs/unicode-iso10646-oview.html )銆�
鎴戣繕鎵句簡涓ょ瘒鐪嬩笂鍘讳笉閿欑殑璧勬枡锛屼笉杩囧洜涓烘垜寮�濮嬬殑鐤戦棶閮芥壘鍒颁簡绛旀锛屾墍浠ュ氨娌℃湁鐪嬶細
"Understanding Unicode A general introduction to the Unicode Standard" ( http: //scripts.sil.org/cms/scripts/page.php?site_id=nrsi&item_id=IWS-Chapter04a ) " Character set encoding basics Understanding character set encodings and legacy encodings" ( http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&item_id =IWS-Chapter03 ) 鎴戝啓杩嘦TF-8銆乁CS-2銆丟BK鐩镐簰杞崲鐨勮蒋浠跺寘锛屽寘鎷娇鐢╓indows API鍜� 涓嶄娇鐢╓indows API鐨勭増鏈�備互鍚庢湁鏃堕棿鐨勮瘽锛屾垜浼氭暣鐞嗕竴涓嬫斁鍒版垜鐨勪釜浜轰富椤典笂( http: //fmddlmyy.home4u.china.com )銆�
闄勫綍1 鍐嶈璇村尯浣嶇爜銆丟B2312銆佸唴鐮佸拰浠g爜椤�
鏈夌殑鏈嬪弸瀵规枃绔犱腑杩欏彞璇濊繕鏈夌枒闂細 鈥淕B2312鐨勫師鏂囪繕鏄尯浣嶇爜锛屼粠鍖轰綅鐮佸埌鍐呯爜锛岄渶瑕� 鍦ㄩ珮瀛楄妭鍜屼綆瀛楄妭涓婂垎鍒姞涓夾0銆傗��
鎴戝啀璇︾粏瑙i噴涓�涓嬶細
鈥淕B2312鐨勫師鏂団�濇槸鎸囧浗瀹�1980骞寸殑涓�涓爣鍑嗐�婁腑鍗庝汉姘戝叡鍜屽浗鍥藉鏍囧噯 淇℃伅浜ゆ崲鐢ㄦ眽瀛� 缂栫爜瀛楃闆� 鍩烘湰闆� GB 2312-80銆嬨�傝繖涓爣鍑嗙敤涓や釜鏁版潵缂栫爜姹夊瓧鍜屼腑鏂囩鍙枫�傜涓�涓暟 绉颁负鈥滃尯鈥濓紝绗簩涓暟绉颁负鈥滀綅鈥濄�傛墍浠ヤ篃绉颁负鍖轰綅鐮併��1-9鍖烘槸涓枃绗﹀彿锛�16-55 鍖烘槸涓� 绾ф眽瀛楋紝56-87鍖烘槸浜岀骇姹夊瓧銆傜幇鍦╓indows涔熻繕鏈夊尯浣嶈緭鍏ユ硶锛屼緥濡傝緭鍏�1601寰楀埌鈥滃晩鈥� 銆�
鍐呯爜鏄寚鎿嶄綔绯荤粺鍐呴儴鐨勫瓧绗︾紪鐮併�傛棭鏈熸搷浣滅郴缁熺殑鍐呯爜鏄笌璇█鐩稿叧鐨�.鐜板湪鐨刉indows 鍦ㄥ唴閮ㄧ粺涓�浣跨敤Unicode锛岀劧鍚庣敤浠g爜椤甸�傚簲鍚勭璇█,鈥滃唴鐮佲�濈殑姒傚康灏辨瘮杈冩ā绯婁簡銆傚井 杞竴鑸皢缂虹渷浠g爜椤垫寚瀹氱殑缂栫爜璇存垚鏄唴鐮侊紝鍦ㄧ壒娈婄殑鍦哄悎涔熶細璇磋嚜宸辩殑鍐呯爜鏄疷nicode锛� 渚嬪鍦� GB18030闂鐨勫鐞嗕笂銆�
鎵�璋撲唬鐮侀〉(code page)灏辨槸閽堝涓�绉嶈瑷�鏂囧瓧鐨勫瓧绗︾紪鐮併�備緥濡侴BK鐨刢ode page鏄疌P936 锛孊IG5鐨刢ode page鏄疌P950锛孏B2312鐨刢ode page鏄疌P20936銆�
Windows涓湁缂虹渷浠g爜椤电殑姒傚康锛屽嵆缂虹渷鐢ㄤ粈涔堢紪鐮佹潵瑙i噴瀛楃銆備緥濡俉indows鐨勮浜嬫湰鎵� 寮�浜嗕竴涓枃鏈枃浠讹紝閲岄潰鐨勫唴瀹规槸瀛楄妭娴侊細BA銆丅A銆丏7銆丏6銆俉indows搴旇鍘绘�庝箞瑙i噴瀹冨憿 锛�
鏄寜鐓nicode缂栫爜瑙i噴銆佽繕鏄寜鐓BK瑙i噴銆佽繕鏄寜鐓IG5瑙i噴锛岃繕鏄寜鐓SO8859-1鍘昏В 閲婏紵濡傛灉鎸塆BK鍘昏В閲婏紝灏变細寰楀埌鈥滄眽瀛椻�濅袱涓瓧銆傛寜鐓у叾瀹冪紪鐮佽В閲婏紝鍙兘鎵句笉鍒板搴旂殑 瀛楃锛屼篃鍙兘鎵惧埌閿欒鐨勫瓧绗︺�傛墍璋撯�滈敊璇�濇槸鎸囦笌鏂囨湰浣滆�呯殑鏈剰涓嶇锛岃繖鏃跺氨浜х敓浜� 涔辩爜銆�
绛旀鏄疻indows鎸夌収褰撳墠鐨勭己鐪佷唬鐮侀〉鍘昏В閲婃枃鏈枃浠堕噷鐨勫瓧鑺傛祦銆傜己鐪佷唬鐮侀〉鍙互閫氳繃鎺� 鍒堕潰鏉跨殑鍖哄煙閫夐」璁剧疆銆傝浜嬫湰鐨勫彟瀛樹负涓湁涓�椤笰NSI锛屽叾瀹炲氨鏄寜鐓х己鐪佷唬鐮侀〉鐨勭紪鐮� 鏂规硶淇濆瓨銆�
Windows鐨勫唴鐮佹槸Unicode锛屽畠鍦ㄦ妧鏈笂鍙互鍚屾椂鏀寔澶氫釜浠g爜椤点�傚彧瑕佹枃浠惰兘璇存槑鑷繁浣� 鐢ㄤ粈涔堢紪鐮侊紝鐢ㄦ埛鍙堝畨瑁呬簡瀵瑰簲鐨勪唬鐮侀〉锛學indows灏辫兘姝g‘鏄剧ず锛屼緥濡傚湪HTML鏂囦欢涓氨鍙� 浠ユ寚瀹歝harset銆�
鏈夌殑HTML鏂囦欢浣滆�咃紝鐗瑰埆鏄嫳鏂囦綔鑰咃紝璁や负涓栫晫涓婃墍鏈変汉閮戒娇鐢ㄨ嫳鏂囷紝鍦ㄦ枃浠朵腑涓嶆寚瀹歝h arset銆傚鏋滀粬浣跨敤浜�0x80-0xff涔嬮棿鐨勫瓧绗︼紝涓枃Windows鍙堟寜鐓х己鐪佺殑GBK鍘昏В閲婏紝灏变細 鍑虹幇涔辩爜銆傝繖鏃跺彧瑕佸湪杩欎釜html鏂囦欢涓姞涓婃寚瀹歝harset鐨勮鍙ワ紝渚嬪锛氬鏋滃師浣滆�呬娇鐢ㄧ殑 浠g爜椤靛拰ISO8859-1鍏煎锛屽氨涓嶄細鍑虹幇涔辩爜浜嗐��
鍐嶈鍖轰綅鐮侊紝鍟婄殑鍖轰綅鐮佹槸1601锛屽啓鎴�16杩涘埗鏄�0x10,0x01銆傝繖鍜岃绠楁満骞挎硾浣跨敤鐨凙SCII 缂栫爜鍐茬獊銆備负浜嗗吋瀹�00-7f鐨� ASCII缂栫爜锛屾垜浠湪鍖轰綅鐮佺殑楂樸�佷綆瀛楄妭涓婂垎鍒姞涓夾0銆傝繖鏍� 鈥滃晩鈥濈殑缂栫爜灏辨垚涓築0A1銆傛垜浠皢鍔犺繃涓や釜A0鐨勭紪鐮佷篃绉颁负GB2312缂栫爜锛岃櫧鐒� GB2312鐨勫師 鏂囨牴鏈病鎻愬埌杩欎竴鐐广��
鏈枃閾炬帴