Tesseract-OCR(开源光学字符识别引擎)

Tesseract-OCR Background

           The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. Image input is managed by the Leptonica Image Processing Library which can read a wide variety of image formats.

      更多详情请访问项目主页: http://code.google.com/p/tesseract-ocr/


                                   
TesseractDotnet

          TesseractDot 是Tesseract-OCR的.NET项目, 方便.NET开发人员使用Tesseract-OCR.但是我还没发现C++可用的类库,源码也无法编译成dll.
     
      更多详情请访问项目主页: http://code.google.com/p/tesseractdotnet/


另外推荐一些文章:
       使用Tesseract OCR 提取复杂图像中的文字
       tesseract 训练
       TrainingTesseract3

Tesseract-OCR(开源光学字符识别引擎)_第1张图片

你可能感兴趣的:(.net,image,processing,library,引擎,output)