数据集网站(持续更新)

UCI数据集
    http://archive.ics.uci.edu/ml/datasets.php

Kaggle竞赛数据集
    https://www.kaggle.com/datasets

ImageNet
    http://image-net.org/

VisualData
    https://www.visualdata.io/

MS COCO
    http://mscoco.org/

Stanford CoreNLP
    http://nlp.stanford.edu/sentiment/code.html

IMDB
    http://ai.stanford.edu/~amaas/data/sentiment/

Sentiment140
    http://help.sentiment140.com/for-students/

HotspotQA
    https://hotpotqa.github.io/

Enron Email
    https://www.cs.cmu.edu/~./enron/

Amazon
    https://snap.stanford.edu/data/web-Amazon.html

百度Apolloscapes
    http://apolloscape.auto/

Berkeley DeepDrive
    http://bdd-data.berkeley.edu/

Robotcar
    http://robotcar-dataset.robots.ox.ac.uk/

Data.gov
    https://www.data.gov/

Food Environment Atlas
    https://catalog.data.gov/dataset/food-environment-atlas-f4a22

Annual Survey of School System Finances
    https://catalog.data.gov/dataset/annual-survey-of-school-system-finances

NCES
    https://nces.ed.gov/

Data USA
    http://datausa.io/

中国国家统计局
    http://www.stats.gov.cn/

Quandl
    https://www.quandl.com/

WorldBank
    https://data.worldbank.org/

IMF
    https://www.imf.org/en/Data

Markets
    https://markets.ft.com/data/

Google Trends
    http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0

US Macro Regional
    https://research.google.com/audioset/

2000 HUB5 English
    https://catalog.ldc.upenn.edu/LDC2002T43

LibriSpeech
    http://www.openslr.org/12/

PhysioNet
    https://physionet.org/

scikit-learn中的数据集
    sklearn.datasets.load_:自带数据集(数据量较小)
    sklearn.datasets.fetch_:在线下载的数据集
    sklearn.datasets.make_:生成指定类型的随机数据集
    sklearn.datasets.load_svmlight_file:svmlight/libsvm格式的数据集
    sklearn.datasets.fetch_mldata:mldata.org在线下载数据集

 

你可能感兴趣的:(sklearn,python,人工智能)