鍒濊瘑decode锛堬級鍜宔ncode锛堬級

鍏抽敭璇嶏細decode锛堬級 聽 聽 聽encode锛堬級

缂栫爜鏍煎紡鐨勬紨鍙�

ASCII鐮�

鏄編鍥芥棭鏈熷埗瀹氱殑缂栫爜瑙勮寖锛屽彧鑳�琛ㄧず128涓瓧绗�锛屽寘鎷嫳鏂囧瓧绗︺�侀樋鎷変集鏁板瓧銆佽タ鏂囧瓧绗︿互鍙�32涓帶鍒跺瓧绗︺��

GB2312

GB2312灏辨槸鍦ˋSCII鍩虹涓婄殑绠�浣撴眽瀛�鎵╁睍銆�

GBK

GBK鏄GB2312鐨勮繘涓�姝ユ墿灞曪紙K鏄眽璇嫾闊砶uo zhan锛堟墿灞曪級涓�滄墿鈥濆瓧鐨勫0姣嶏級锛� 鏀跺綍浜�21886涓眽瀛楀拰绗﹀彿锛屽畬鍏ㄥ吋瀹笹B2312銆�

GB18030

GB18030鏀跺綍浜�70244涓眽瀛楀拰瀛楃锛屾洿鍔犲叏闈紝涓� GB 2312-1980 鍜� GBK 鍏煎銆偮�

GB18030鏀寔灏戞暟姘戞棌鐨勬眽瀛楋紝涔熷寘鍚簡绻佷綋姹夊瓧鍜屾棩闊╂眽瀛椼�偮�

Unicode

鍑嗙‘鏉ヨ锛孶nicode涓嶆槸缂栫爜鏍煎紡锛岃�屾槸瀛楃闆嗐�傝繖涓瓧绗﹂泦鍖呭惈浜嗕笘鐣屼笂鐩墠鎵�鏈夌殑绗﹀彿銆�

UTF锛圲CS Transfer Format锛夋槸鍦ㄤ簰鑱旂綉涓婁娇鐢ㄦ渶骞跨殑涓�绉峌nicode鐨勫疄鐜版柟寮忋�傛垜浠渶甯哥敤鐨勬槸UTF-8鍜孶TF-16銆�

decode 锛堬級鍜� encode锛堬級 鍖哄埆

涓轰粈涔堜細鎶ラ敊鈥淯nicodeEncodeError:'ascii' codec can't encode characters in position 0-1: ordinal notin range(128)鈥濓紵鏈枃灏辨潵鐮旂┒涓�涓嬭繖涓棶棰樸��

瀛楃涓插湪Python鍐呴儴鐨勮〃绀烘槸unicode缂栫爜锛屽洜姝わ紝鍦ㄥ仛缂栫爜杞崲鏃讹紝閫氬父闇�瑕佷互unicode浣滀负涓棿缂栫爜锛屽嵆鍏堝皢鍏朵粬缂栫爜鐨勫瓧绗︿覆瑙g爜锛坉ecode锛夋垚unicode锛屽啀浠巙nicode缂栫爜锛坋ncode锛夋垚鍙︿竴绉嶇紪鐮併��


decode鐨勪綔鐢ㄦ槸灏嗗叾浠栫紪鐮佺殑瀛楃涓茶浆鎹㈡垚unicode缂栫爜锛屽str1.decode('gb2312')锛岃〃绀哄皢gb2312缂栫爜鐨勫瓧绗︿覆str1杞崲鎴恥nicode缂栫爜銆�

encode鐨勪綔鐢ㄦ槸灏唘nicode缂栫爜杞崲鎴愬叾浠栫紪鐮佺殑瀛楃涓诧紝濡俿tr2.encode('gb2312')锛岃〃绀哄皢unicode缂栫爜鐨勫瓧绗︿覆str2杞崲鎴恎b2312缂栫爜銆�


鍥犳锛岃浆鐮佺殑鏃跺�欎竴瀹氳鍏堟悶鏄庣櫧锛屽瓧绗︿覆str鏄粈涔堢紪鐮侊紝鐒跺悗decode鎴恥nicode锛岀劧鍚庡啀encode鎴愬叾浠栫紪鐮�

浠g爜涓瓧绗︿覆鐨勯粯璁ょ紪鐮佷笌浠g爜鏂囦欢鏈韩鐨勭紪鐮佷竴鑷淬��

濡傦細s='涓枃'

濡傛灉鏄湪utf8鐨勬枃浠朵腑锛岃瀛楃涓插氨鏄痷tf8缂栫爜锛屽鏋滄槸鍦╣b2312鐨勬枃浠朵腑锛屽垯鍏剁紪鐮佷负gb2312銆傝繖绉嶆儏鍐典笅锛岃杩涜缂栫爜杞崲锛岄兘闇�瑕佸厛鐢╠ecode鏂规硶灏嗗叾杞崲鎴恥nicode缂栫爜锛屽啀浣跨敤encode鏂规硶灏嗗叾杞崲鎴愬叾浠栫紪鐮併�傞�氬父锛屽湪娌℃湁鎸囧畾鐗瑰畾鐨勭紪鐮佹柟寮忔椂锛岄兘鏄娇鐢ㄧ殑绯荤粺榛樿缂栫爜鍒涘缓鐨勪唬鐮佹枃浠躲��

濡傛灉瀛楃涓叉槸杩欐牱瀹氫箟锛歴=u'涓枃'

鍒欒瀛楃涓茬殑缂栫爜灏辫鎸囧畾涓簎nicode浜嗭紝鍗硃ython鐨勫唴閮ㄧ紪鐮侊紝鑰屼笌浠g爜鏂囦欢鏈韩鐨勭紪鐮佹棤鍏炽�傚洜姝わ紝瀵逛簬杩欑鎯呭喌鍋氱紪鐮佽浆鎹紝鍙渶瑕佺洿鎺ヤ娇鐢╡ncode鏂规硶灏嗗叾杞崲鎴愭寚瀹氱紪鐮佸嵆鍙��

濡傛灉涓�涓瓧绗︿覆宸茬粡鏄痷nicode浜嗭紝鍐嶈繘琛岃В鐮佸垯灏嗗嚭閿欙紝鍥犳閫氬父瑕佸鍏剁紪鐮佹柟寮忔槸鍚︿负unicode杩涜鍒ゆ柇锛�

isinstance(s,unicode)#鐢ㄦ潵鍒ゆ柇鏄惁涓簎nicode

鐢ㄩ潪unicode缂栫爜褰㈠紡鐨剆tr鏉ncode浼氭姤閿�

濡備綍鑾峰緱绯荤粺鐨勯粯璁ょ紪鐮侊紵

#!/usr/bin/env python

#coding=utf-8

import sys

printsys.getdefaultencoding()

璇ユ绋嬪簭鍦ㄨ嫳鏂嘩indowsXP涓婅緭鍑轰负锛歛scii

璇ユ绋嬪簭鍦ㄨ嫳鏂嘩indows7涓婅緭鍑轰负锛歮bcs

鍦ㄦ煇浜汭DE涓紝瀛楃涓茬殑杈撳嚭鎬绘槸鍑虹幇涔辩爜锛岀敋鑷抽敊璇紝鍏跺疄鏄敱浜嶪DE鐨勭粨鏋滆緭鍑烘帶鍒跺彴鑷韩涓嶈兘鏄剧ず瀛楃涓茬殑缂栫爜锛岃�屼笉鏄▼搴忔湰韬殑闂銆�

濡傚湪UliPad涓繍琛屽涓嬩唬鐮侊細

s=u"涓枃"

print s

浼氭彁绀猴細UnicodeEncodeError:'ascii' codec can't encode characters in position 0-1: ordinal notinrange(128)銆傝繖鏄洜涓篣liPad鍦ㄨ嫳鏂嘩indowsXP涓婄殑鎺у埗鍙颁俊鎭緭鍑虹獥鍙f槸鎸夌収ascii缂栫爜杈撳嚭鐨勶紙鑻辨枃绯荤粺鐨勯粯璁ょ紪鐮佹槸ascii锛夛紝鑰屼笂闈唬鐮佷腑鐨勫瓧绗︿覆鏄疷nicode缂栫爜鐨勶紝鎵�浠ヨ緭鍑烘椂浜х敓浜嗛敊璇��

灏嗘渶鍚庝竴鍙ユ敼涓猴細prints.encode('gb2312')

鍒欒兘姝g‘杈撳嚭鈥滀腑鏂団�濅袱涓瓧銆�

鑻ユ渶鍚庝竴鍙ユ敼涓猴細prints.encode('utf8')

鍒欒緭鍑猴細\xe4\xb8\xad\xe6\x96\x87锛岃繖鏄帶鍒跺彴淇℃伅杈撳嚭绐楀彛鎸夌収ascii缂栫爜杈撳嚭utf8缂栫爜鐨勫瓧绗︿覆鐨勭粨鏋溿��

unicode(str,'gb2312')涓巗tr.decode('gb2312')鏄竴鏍风殑锛岄兘鏄皢gb2312缂栫爜鐨剆tr杞负unicode缂栫爜

浣跨敤str.__class__鍙互鏌ョ湅str鐨勭紪鐮佸舰寮�

>>>>>

groups.google.com/group/python-cn/browse_thread/thread/be4e4e0d4c3272dd

-----

python鏄釜瀹规槗鍑虹幇缂栫爜闂鐨勮瑷�銆傛墍浠ワ紝鎴戞寜鐓ф垜鐨勭悊瑙e啓涓嬩笅闈㈣繖浜涙枃瀛椼��

=棣栧厛锛岃浜嗚В鍑犱釜姒傚康銆�=

*瀛楄妭锛氳绠楁満鏁版嵁鐨勮〃绀恒��8浣嶄簩杩涘埗銆傚彲浠ヨ〃绀烘棤绗﹀彿鏁存暟锛�0-255銆備笅鏂囷紝鐢ㄢ�滃瓧鑺傛祦鈥濊〃绀衡�滃瓧鑺傗�濈粍鎴愮殑涓层��

*瀛楃锛氳嫳鏂囧瓧绗︹�渁bc鈥濓紝鎴栬�呬腑鏂囧瓧绗︹�滀綘鎴戜粬鈥濄�傚瓧绗︽湰韬笉鐭ラ亾濡備綍鍦ㄨ绠楁満涓繚瀛樸�備笅鏂囦腑锛屼細閬垮厤浣跨敤鈥滃瓧绗︿覆鈥濊繖涓瘝锛岃�岀敤鈥滄枃鏈�濇潵琛�

绀衡�滃瓧绗︹�濈粍鎴愮殑涓层��

*缂栫爜锛堝姩璇嶏級锛氭寜鐓ф煇绉嶈鍒欙紙杩欎釜瑙勫垯绉颁负锛氱紪鐮侊紙鍚嶈瘝锛夛級灏嗏�滄枃鏈�濊浆鎹负鈥滃瓧鑺傛祦鈥濄�傦紙鍦╬ython涓細unicode鍙樻垚str锛�

*瑙g爜锛堝姩璇嶏級锛氬皢鈥滃瓧鑺傛祦鈥濇寜鐓ф煇绉嶈鍒欒浆鎹㈡垚鈥滄枃鏈�濄�傦紙鍦╬ython涓細str鍙樻垚unicode锛�

**瀹為檯涓婏紝浠讳綍涓滆タ鍦ㄨ绠楁満涓〃绀猴紝閮介渶瑕佺紪鐮併�備緥濡傦紝瑙嗛瑕佺紪鐮佺劧鍚庝繚瀛樺湪鏂囦欢涓紝鎾斁鐨勬椂鍊欓渶瑕佽В鐮佹墠鑳借鐪嬨��

unicode锛歶nicode瀹氫箟浜嗭紝涓�涓�滃瓧绗︹�濆拰涓�涓�滄暟瀛椻�濈殑瀵瑰簲锛屼絾鏄苟娌℃湁瑙勫畾杩欎釜鈥滄暟瀛椻�濆湪璁$畻鏈轰腑鎬庝箞淇濆瓨銆傦紙灏卞儚鍦–涓紝涓�涓暣鏁版棦

鍙互鏄痠nt锛屼篃鍙互鏄痵hort銆倁nicode娌℃湁瑙勫畾鐢╥nt杩樻槸鐢╯hort鏉ヨ〃绀轰竴涓�滃瓧绗︹�濓級

utf8锛歶nicode瀹炵幇銆傚畠浣跨敤unicode瀹氫箟鐨勨�滃瓧绗︹�濃�滄暟瀛椻�濇槧灏勶紝杩涜�岃瀹氫簡锛屽浣曞湪璁$畻鏈轰腑淇濆瓨杩欎釜鏁板瓧銆傚叾瀹冪殑utf16绛夐兘鏄�

unicode瀹炵幇銆�

gbk锛氱被浼紆tf8杩欐牱鐨勨�滅紪鐮佲�濄�備絾鏄畠娌℃湁浣跨敤unicode瀹氫箟鐨勨�滃瓧绗︹�濃�滄暟瀛椻�濇槧灏勶紝鑰屾槸浣跨敤浜嗗彟涓�濂楃殑鏄犲皠鏂规硶銆傝�屼笖锛屽畠杩樺畾涔変簡濡備綍鍦�

璁$畻鏈轰腑淇濆瓨銆�

=python涓殑encode锛宒ecode鏂规硶=

棣栧厛锛岃鐭ラ亾encode鏄� unicode杞崲鎴恠tr銆俤ecode鏄痵tr杞崲鎴恥nicode銆�

涓嬫枃涓紝u浠h〃unicode绫诲瀷鐨勫彉閲忥紝s浠h〃str绫诲瀷鐨勫彉閲忋��

u.encode('...')鍩烘湰涓婃�绘槸鑳芥垚鍔熺殑锛屽彧瑕佷綘濉啓浜嗘纭殑缂栫爜銆傚氨鍍忎换浣曟枃浠堕兘鍙互鍘嬬缉鎴恴ip鏂囦欢銆�

s.decode('...')缁忓父鏄細鍑洪敊鐨勶紝鍥犱负str鏄粈涔堚�滅紪鐮佲�濆彇鍐充簬涓婁笅鏂囷紝褰撲綘瑙g爜鐨勬椂鍊欓渶瑕佺‘淇漵鏄敤浠�涔堢紪鐮佺殑銆傚氨鍍忥紝鎵撳紑zip鏂�

浠剁殑鏃跺�欙紝浣犺纭繚瀹冪‘瀹炴槸zip鏂囦欢锛岃�屼笉浠呬粎鏄吉閫犱簡鎵╁睍鍚嶇殑zip鏂囦欢銆�

u.decode(),s.encode()涓嶅缓璁娇鐢紝s.encode鐩稿綋浜巗.decode().encode()棣栧厛鐢ㄩ粯璁ょ紪鐮侊紙涓�鑸槸

ascii锛夎浆鎹㈡垚unicode鍦ㄨ繘琛宔ncode銆�

=鍏充簬#coding=utf8=

褰撲綘鍦╬y鏂囦欢鐨勭涓�琛屼腑锛屽啓浜嗚繖鍙ヨ瘽锛屽苟纭疄鎸夌収杩欎釜缂栫爜淇濆瓨浜嗘枃鏈殑璇濓紝閭d箞杩欏彞璇濇湁浠ヤ笅鍑犱釜鍔熻兘銆�

1.浣垮緱璇嶆硶鍒嗘瀽鍣ㄨ兘姝e父杩愪綔锛屽浜庢敞閲婁腑鐨勪腑鏂囦笉鎶ラ敊浜嗐��

2.瀵逛簬u"涓枃"杩欐牱literal string鑳界煡閬撲袱涓紩鍙蜂腑鐨勫唴瀹规槸utf8缂栫爜鐨勶紝鐒跺悗鑳芥纭浆鎹㈡垚unicode

3."涓枃"瀵逛簬杩欐牱鐨刲iteralstring浣犱細鐭ラ亾锛岃繖涓棿鐨勫唴瀹规槸utf8缂栫爜锛岀劧鍚庡氨鍙互姝g‘杞崲鎴愬叾瀹冪紪鐮佹垨unicode浜嗐��

娌℃湁鍐欏畬锛屽厛鐮侀偅涔堝瀛楋紝浠ュ悗鍐嶆潵琛ュ厖锛岃繖閲屼笉鏄痺iki锛屽お楹荤儲浜嗐��

>>>>>

>>>>>

=Python缂栫爜鍜學indows鎺у埗鍙�=

鎴戝彂鐜帮紝寰堝鍒濆鑰呭嚭閿欑殑鍦版柟閮藉湪print璇彞锛岃繖鐗垫秹鍒版帶鍒跺彴鐨勮緭鍑恒�傛垜涓嶄簡瑙inux锛屾墍浠ュ彧璇存帶鍒跺彴鐨勩��

棣栧厛锛學indows鐨勬帶鍒跺彴纭疄鏄痷nicode锛坲tf16_le缂栫爜锛夌殑锛屾垨鑰呮洿鍑嗙‘鐨勮浣跨敤瀛楃涓哄崟浣嶈緭鍑烘枃鏈殑銆�

浣嗘槸锛岀▼搴忕殑鎵ц鏄彲浠ヨ閲嶅畾鍚戝埌鏂囦欢鐨勶紝鑰屾枃浠剁殑鍗曚綅鏄�滃瓧鑺傗�濄��

鎵�浠ワ紝瀵逛簬C杩愯鏃剁殑鍑芥暟printf涔嬬被鐨勶紝杈撳嚭蹇呴』鏈変竴涓紪鐮侊紝鎶婃枃鏈浆鎹㈡垚瀛楄妭銆傚彲鑳芥槸涓轰簡鍏煎95锛�98锛�

娌℃湁浣跨敤unicode鐨勭紪鐮侊紝鑰屾槸mbcs锛堜笉鏄痝bk涔嬬被鐨勶級銆�

windows鐨刴bcs锛屼篃灏辨槸ansi锛屽畠浼氬湪涓嶅悓璇█鐨剋indows涓娇鐢ㄤ笉鍚岀殑缂栫爜锛屽湪涓枃鐨剋indows涓氨鏄痝b绯诲垪鐨勭紪鐮併��

杩欓�犳垚浜嗗悓涓�涓枃鏈紝鍦ㄤ笉鍚岃瑷�鐨剋indows涓槸涓嶅吋瀹圭殑銆�

鐜板湪鎴戜滑鐭ラ亾浜嗭紝濡傛灉浣犺鍦╳indows鐨勬帶鍒跺彴涓緭鍑烘枃鏈紝瀹冪殑缂栫爜涓�瀹氳鏄�渕bcs鈥濄��

瀵逛簬python鐨剈nicode鍙橀噺锛屼娇鐢╬rint杈撳嚭鐨勮瘽锛屼細浣跨敤sys.getfilesystemencoding()杩斿洖鐨勭紪鐮侊紝鎶婂畠鍙樻垚str銆�

濡傛灉鏄竴涓猽tf8缂栫爜str鍙橀噺锛岄偅涔堝氨闇�瑕� prints.decode('utf8').encode('mbcs')

鏈�鍚庯紝瀵逛簬str鍙橀噺锛宖ile鏂囦欢璇诲彇鐨勫唴瀹癸紝urllib寰楀埌鐨勭綉缁滀笂鐨勫唴瀹癸紝閮芥槸浠モ�滃瓧鑺傗�濆舰寮忕殑銆�

瀹冧滑濡傛灉纭疄鏄竴娈碘�滄枃鏈�濓紝姣斿浣犳兂print鍑烘潵鐪嬬湅銆傞偅涔堜綘蹇呴』鐭ラ亾瀹冧滑鐨勭紪鐮併�傜劧鍚巇ecode鎴恥nicode銆�

濡備綍鐭ラ亾瀹冧滑鐨勭紪鐮侊細

1.浜嬪厛绾﹀畾銆傦紙姣斿杩欎釜鏂囨湰鏂囦欢灏辨槸浣犺嚜宸辩敤utf8缂栫爜淇濆瓨鐨勶級

2.鍗忚銆傦紙python鏂囦欢绗竴琛岀殑#coding=utf8锛宧tml涓殑绛夛級

2.鐚溿��

>>>>>

> 杩欎釜闈炲父濂斤紝浣嗚繕涓嶆槸寰堟槑鐧�

> 灏嗏�滄枃鏈�濊浆鎹负鈥滃瓧鑺傛祦鈥濄�傦紙鍦╬ython涓細unicode鍙樻垚str锛�

"鏈�鍚庯紝瀵逛簬str鍙橀噺锛宖ile鏂囦欢璇诲彇鐨勫唴瀹癸紝urllib寰楀埌鐨勭綉缁滀笂鐨勫唴瀹癸紝閮芥槸浠モ�滃瓧鑺傗�濆舰寮忕殑銆�"

铏界劧鏂囦欢鎴栬�呯綉椤垫槸鏂囨湰鐨�,浣嗘槸鍦ㄤ繚瀛樻垨鑰呬紶杈撴椂宸茬粡琚紪鐮佹垚bytes浜�,鎵�浠ョ敤"rb"鎵撳紑鐨刦ile鍜屼粠socket璇诲彇鐨勬祦鏄熀浜庡瓧鑺傜殑.

"瀹冧滑濡傛灉纭疄鏄竴娈碘�滄枃鏈�濓紝姣斿浣犳兂print鍑烘潵鐪嬬湅銆傞偅涔堜綘蹇呴』鐭ラ亾瀹冧滑鐨勭紪鐮併�傜劧鍚巇ecode鎴恥nicode銆�"

杩欓噷鐨勫姞寮曞彿鐨�"鏂囨湰",鍏跺疄杩樻槸瀛楄妭娴�(bytes),鑰屼笉鏄湡姝g殑鏂囨湰(unicode),鍙槸璇存槑鎴戜滑鐭ラ亾浠栨槸鍙互瑙g爜鎴愭枃鏈殑.

鍦ㄨВ鐮佺殑鏃跺��,濡傛灉鏄熀浜庣害瀹氱殑,閭e氨鍙互鐩存帴浠庢寚瀹氬湴鏂硅鍙栧BOM鎴栬�卲ython鏂囦欢鐨勬寚瀹歝oding鎴栬�呯綉椤电殑meta,灏卞彲浠ユ纭В鐮�,

浣嗘槸鐜板湪寰堝鏂囦欢/缃戦〉铏界劧鎸囧畾浜嗙紪鐮�,浣嗘槸鏂囦欢鏍煎紡瀹為檯鍗翠娇鐢ㄤ簡鍏朵粬鐨勭紪鐮�(姣斿py鏂囦欢鎸囧畾浜哻oding=utf8,浣嗘槸浣犺繕鏄彲浠ヤ繚瀛樻垚ansi--璁颁簨鏈殑榛樿缂栫爜),杩欑鎯呭喌涓嬬湡瀹炵殑缂栫爜灏遍渶瑕佸幓鐚滀簡

瑙g爜浜嗙殑鏂囨湰鍙瓨鍦ㄨ繍琛岀幆澧冧腑,濡傛灉浣犻渶瑕佹墦鍗�/淇濆瓨/杈撳嚭缁欐暟鎹簱/缃戠粶浼犻��,灏卞張闇�瑕佷竴娆$紪鐮佽繃绋�,杩欎釜缂栫爜涓庝笂闈㈢殑缂栫爜娌℃湁鍏崇郴,鍙槸渚濊禆浜庝綘鐨勯�夋嫨,浣嗘槸杩欎釜缂栫爜涔熶笉鏄彲浠ラ殢渚块�夋嫨鐨�,鍥犱负缂栫爜鍚庣殑bytes濡傛灉鍙堥渶瑕佷紶閫掔粰鍏朵粬浜�/鐜,閭d箞濡傛灉浣犵殑缂栫爜涔熶笉閬靛惊绾﹀畾,鍙堢粰涓嬩竴涓汉/鐜閫犳垚浜嗗洶鎵�,浜庢槸閫掑綊涔媬~~~

>>>>>

> 涓昏鏈変竴鏉¢潪甯稿鏄撹瑙o細

>涓�鑸汉浼氳涓篣nicode锛堝箍涔夛級缁熶竴浜嗙紪鐮侊紝鍏跺疄涓嶇劧銆俇nicode涓嶆槸鍞竴鐨勭紪鐮侊紝鑰屼竴澶у爢缂栫爜鐨勭粺绉般�備絾鏄疻indows涓婾nicode

> 锛堢嫮涔夛級涓�鑸壒鎸嘦CS2锛屼篃灏辨槸UTF-16/LE

unicode浣滀负瀛楃闆�(ucs)鏄敮涓�鐨�,缂栫爜鏂规(utf)鎵嶆槸鏈夊緢澶氱

>>>>>

灏嗗瓧绗︿笌瀛楄妭鐨勬蹇靛尯鍒嗗紑鏉ユ槸寰堥噸瑕佺殑銆侸ava涓�鐩村氨鏄繖鏍凤紝Python涔熷紑濮嬭繖涔堝仛浜嗭紝Ruby璨屼技杩樺湪娣蜂贡褰撲腑銆�

>>>>>

>>>>>

鎴戜篃璇翠袱鍙ャ�傛垜瀵圭紪鐮佺殑鐮旂┒鐩稿姣旇緝娣变竴浜涖�傚洜涓哄伐浣滀腑涔熺粡甯搁亣鍒颁贡鐮侊紝浜庢槸鍦�05骞达紝瀵圭紪鐮佷笓闂ㄥ仛杩囩爺绌讹紝骞跺湪鍏徃鍒婄墿涓婂彂杩囨枃绔狅紝鏈�鍚庡舰鎴愪簡涓�涓暀鏉愶紝姣忓勾鍦ㄥ叕鍙哥粰鏂板憳宸ラ兘璁蹭竴閬嶃�備簬鏄」鐩腑閬囧埌涔辩爜鐨勯棶棰樺氨鑳藉緢蹇殑瀹氫綅骞惰В鍐充簡銆�

鐞嗚涓婏紝浠庝竴涓瓧绗﹀埌鍏蜂綋鐨勭紪鐮侊紝浼氱粡杩囦互涓嬪嚑涓蹇点��

瀛楃闆嗭紙Abstract character repertoire锛�

缂栫爜瀛楃闆嗭紙Coded character set锛�

瀛楃缂栫爜鏂瑰紡锛圕haracter encoding form锛�

瀛楃缂栫爜鏂规锛圕haracter encoding scheme 锛�

瀛楃闆嗭細灏辩畻涓�鍫嗘娊璞$殑瀛楃锛屽鎵�鏈変腑鏂囥�傚瓧绗﹂泦鐨勫畾涔夋槸鎶借薄鐨勶紝涓庤绠楁満鏃犲叧銆�

缂栫爜瀛楃闆嗭細鏄竴涓粠鏁存暟闆嗗瓙闆嗗埌瀛楃闆嗘娊璞″厓绱犵殑鏄犲皠銆傚嵆缁欐娊璞$殑瀛楃缂栦笂鏁板瓧銆傚gb2312涓殑瀹氫箟鐨勫瓧绗︼紝姣忎釜瀛楃閮芥湁涓暣鏁板拰瀹冨搴斻�備竴涓暣鏁板彧瀵瑰簲鐫�涓�涓瓧绗︺�傚弽杩囨潵锛屽垯涓嶄竴瀹氭槸銆傝繖閲屾墍璇寸殑鏄犲皠鍏崇郴锛屾槸鏁板鎰忎箟涓婄殑鏄犲皠鍏崇郴銆傜紪鐮佸瓧绗﹂泦涔熸槸涓庤绠楁満鏃犲叧鐨勩�倁nicode瀛楃闆嗕篃鍦ㄨ繖涓�灞傘��

瀛楃缂栫爜鏂瑰紡锛氳繖涓紑濮嬩笌璁$畻鏈烘湁鍏充簡銆傜紪鐮佸瓧绗﹂泦鐨勭紪鐮佺偣鍦ㄨ绠楁満閲岀殑鍏蜂綋琛ㄧ幇褰㈠紡銆傞�氫織鐨勮锛屾剰鎬濆氨鏄�庝箞鏍锋墠鑳藉皢瀛楃鎵�瀵瑰簲鐨勬暣鏁扮殑鏀捐繘璁$畻鏈哄唴瀛橈紝鎴栨枃浠躲�佹垨缃戠粶涓�備簬鏄紝涓嶅悓浜烘湁涓嶅悓鐨勫疄鐜版柟寮忥紝鎵�璋撶殑涓囩爜濂旇吘锛屽氨鏄寚杩欎釜銆俫b2312锛寀tf-8,utf-16,utf-32绛夐兘鍦ㄨ繖涓�灞傘��

瀛楃缂栫爜鏂规锛氳繖涓洿鍔犱笌璁$畻鏈哄瘑鍒囩浉鍏炽�傚叿浣撴槸涓庢搷浣滅郴缁熷瘑鍒囩浉鍏炽�備富瑕佹槸瑙e喅澶у皬瀛楄妭搴忕殑闂銆傚浜嶶TF-16鍜孶TF-32

缂栫爜锛孶nicode閮芥敮鎸乥ig-endian 鍜� little-endian涓ょ缂栫爜鏂规銆�

涓�鑸潵璇达紝鎴戜滑鎵�璇寸殑缂栫爜锛岄兘鍦ㄧ涓夊眰瀹屾垚銆傚叿浣撳埌涓�涓蒋浠剁郴缁熶腑锛屽垯寰堝鏉傘��

娴忚鍣紞apache锛峵omcat锛堝寘鎷瑃omcat鍐呴儴鐨刯sp缂栫爜銆佺紪璇戯紝鏂囦欢璇诲彇锛夛紞鏁版嵁搴撲箣闂达紝鍙瀛樺湪鏁版嵁浜や簰锛屽氨鏈夊彲鑳藉彂鐢熺紪鐮佷笉涓�鑷达紝濡傛灉鍦ㄨ鍙栨暟鎹椂锛屾病鏈夋纭殑decode鍜宔ncode锛屽嚭鐜颁贡鐮佸氨鏄甯镐究楗簡銆�



鍙傝�冿細

https://blog.csdn.net/m0_38080253/article/details/78841280

https://blog.csdn.net/qq_40134903/article/details/80710882

你可能感兴趣的:(鍒濊瘑decode锛堬級鍜宔ncode锛堬級)