爬虫:scrapy, beautiful soup

自然语言处理:nltk, Pattern (Google, Twitter, and Wikipedia APIs, a web crawler, a HTML DOM parser), 结巴分词

科学计算:NumPy, SciPy, matplotlib

机器学习、数据挖掘:scikit-learn, pandas, MDP (neural networks), PyBrain (neural networks), Theano (GPU, deep learning)


来源:

1. http://www.52nlp.cn/python-网页爬虫-文本处理-科学计算-机器学习-数据挖掘

2. python机器学习库 http://qxde01.blog.163.com/blog/static/67335744201368101922991/