Python绗旇_3_jieba鍒嗚瘝

鍔熻兘聽1)锛氬垎璇�

jieba.cut() 鏂规硶鎺ュ彈涓や釜杈撳叆鍙傛暟:聽1)聽绗竴涓弬鏁颁负闇�瑕佸垎璇嶇殑瀛楃涓猜�2锛塩ut_all鍙傛暟鐢ㄦ潵鎺у埗鏄惁閲囩敤鍏ㄦā寮�

jieba.cut_for_search() 鏂规硶鎺ュ彈涓�涓弬鏁帮細闇�瑕佸垎璇嶇殑瀛楃涓�,璇ユ柟娉曢�傚悎鐢ㄤ簬鎼滅储寮曟搸鏋勫缓鍊掓帓绱㈠紩鐨勫垎璇嶏紝绮掑害姣旇緝缁�

娉ㄦ剰锛氬緟鍒嗚瘝鐨勫瓧绗︿覆鍙互鏄痝bk瀛楃涓层�乽tf-8瀛楃涓叉垨鑰卽nicode

jieba.cut() 浠ュ強jieba.cut_for_search() 杩斿洖鐨勭粨鏋勯兘鏄竴涓彲杩唬鐨刧enerator锛屽彲浠ヤ娇鐢╢or寰幆鏉ヨ幏寰楀垎璇嶅悗寰楀埌鐨勬瘡涓�涓瘝璇�(unicode)锛屼篃鍙互鐢╨ist(jieba.cut(...))杞寲涓簂ist

浠g爜绀轰緥(聽鍒嗚瘝聽)

# -*- coding: utf-8 -*-

import jieba

seg_list聽=聽jieba.cut("鎴戞潵鍒板寳浜竻鍗庡ぇ瀛�",cut_all=True)

print聽"Full聽Mode:",聽"/聽".join(seg_list)聽# 鍏ㄦā寮�

seg_list聽=聽jieba.cut("鎴戞潵鍒板寳浜竻鍗庡ぇ瀛�",cut_all=False)

print聽"Default聽Mode:",聽"/聽".join(seg_list)聽# 绮剧‘妯″紡

seg_list聽=聽jieba.cut("浠栨潵鍒颁簡缃戞槗鏉爺澶у帵")聽# 榛樿鏄簿纭ā寮�

print聽",聽".join(seg_list)

seg_list聽=聽jieba.cut_for_search("灏忔槑纭曞+姣曚笟浜庝腑鍥界瀛﹂櫌璁$畻鎵�锛屽悗鍦ㄦ棩鏈含閮藉ぇ瀛︽繁閫�")聽# 鎼滅储寮曟搸妯″紡

print聽",聽".join(seg_list)

杩愯缁撴灉锛�

鍏ㄦā寮忥細鎴�/ 鏉ュ埌/ 鍖椾含/ 娓呭崕/ 娓呭崕澶у/ 鍗庡ぇ/ 澶у

绮剧‘妯″紡锛氭垜/ 鏉ュ埌/ 鍖椾含/ 娓呭崕澶у

榛樿妯″紡锛堝嵆绮剧‘妯″紡锛夛細浠�, 鏉ュ埌, 浜�, 缃戞槗, 鏉爺, 澶у帵

鎼滅储妯″紡锛氬皬鏄�, 纭曞+, 姣曚笟, 浜�, 涓浗, 绉戝, 瀛﹂櫌, 绉戝闄�, 涓浗绉戝闄�, 璁$畻, 璁$畻鎵�, 锛�, 鍚�, 鍦�, 鏃ユ湰, 浜兘, 澶у, 鏃ユ湰浜兘澶у, 娣遍��

你可能感兴趣的:(Python绗旇_3_jieba鍒嗚瘝)