爬虫:scrapy, beautiful soup
自然语言处理:nltk, Pattern (Google, Twitter, and Wikipedia APIs, a web crawler, a HTML DOM parser), 结巴分词
科学计算:NumPy, SciPy, matplotlib
机器学习、数据挖掘:scikit-learn, pandas, MDP (neural networks), PyBrain (neural networks), Theano (GPU, deep learning)
来源:
1. http://www.52nlp.cn/python-网页爬虫-文本处理-科学计算-机器学习-数据挖掘
2. python机器学习库 http://qxde01.blog.163.com/blog/static/67335744201368101922991/