OCR dataset

Datasets

there are three websites that have the dataset list of some different data type:
1 - www.iapr-tc11.org
2 - tc11.cvc.uab.es
3 - rrc.cvc.uab.es

  • 2017 COCO-Text
    2017 DeTEXT
    2017 DOST
    2017 FSNS
    2017 MLT
    2017 IEHHR
    2011-2015 Born-DIgitalImage
    2013-2015 Focused Scene Text
    2013-2015 Text in Videos
    2015 Incidental Scene Text

  • ICDAR Chinese 2017

    • more than 12,000 images. Most of the images are collected in the wild by phone cameras.
    • Task: Chinese Text in the Wild.
  • Chinese Text in the Wild 2017

    • 32,285 high resolution images, 1,018,402 character instances, 3,850 character categories, 6 kinds of attributes
  • Total-Text 2017

    • 1555 images,11459 text instances, includes curved tex
  • SCUT_FORU_DB_Release 2016

    • FORU contains two parts, which are Chinese2k and English2k dataset, respectively.
  • SynthText in the Wild Dataset 2016

    • 800 thousand images, 8 million synthetic word instances.
    • Each text instance is annotated with its text-string, word-level and character-level bounding-boxes.
  • COCO-Text (Computer Vision Group, Cornell) 2016

    • 63,686 images, 173,589 text instances, 3 fine-grained text attributes.
    • Task: text location and recognition
    • COCO-Text API
  • USTB-SV1k 2014

    • 1000 (500 for training and 500 for testing) street view (patch) images from 6 USA cities
  • Synthetic Word Dataset (Oxford, VGG) 2014

    • 9 million images covering 90k English words
    • Task: text recognition, segmantation
    • download
  • IIIT 5K-Words 2012

    • 5000 images from Scene Texts and born-digital (2k training and 3k testing images)
    • Each image is a cropped word image of scene text with case-insensitive labels
    • Task: text recognition
    • download
  • StanfordSynth(Stanford, AI Group) 2012

    • Small single-character images of 62 characters (0-9, a-z, A-Z)
    • Task: text recognition
    • download
  • MSRA Text Detection 500 Database (MSRA-TD500) 2012

    • 500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)
    • Chinese, English or mixture of both
    • Task: text detection
  • OSTD 2011

    • cannot find the downloadlink
  • Traffice Guide Panel Text Dataset,TGPT 2016

    • 3841 high-resolution individual images, 2315 containing traffic guide panel level annotations (1911 for training and 404 for testing, and all the testing images are manually labeled with ground truth tight text region bounding boxes), 1526 containing no traffic signs}.
  • Street View Text (SVT) 2010

    • 350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)
    • Only word level bounding boxes are provided with case-insensitive labels
    • Task: text location
  • KAIST Scene_Text Database 2010

    • 3000 images of indoor and outdoor scenes containing text
    • Korean, English (Number), and Mixed (Korean + English + Number)
    • Task: text location, segmantation and recognition
  • Chars74k 2009

    • Over 74K images from natural images, as well as a set of synthetically generated characters
    • Small single-character images of 62 characters (0-9, a-z, A-Z)
    • Task: text recognition

你可能感兴趣的:(深度学习,神经网络,计算机视觉)