导航

JavaYe

欢迎光临小站,喝口咖啡,品位Java,希望您喜欢JavaYe.COM 。

« JavaYe原创入门视频【原创】一行代码把网页调成灰暗色调 »

【原创】一个好用的OCR模块


       最近忽然对验证码识别很感兴趣,在网上找到一个OCR的模块tesseract , 试用了一下啊,效果不错。我下了个最新版的tesseract,貌似不需要怎么训练识别率也能达到90%以上,前提是自己要写程序把图片变成黑白的并且去噪。

         

Background

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images.

 

  • 相关文章:

发表评论:

◎欢迎参与讨论,请在这里发表您的看法、交流您的观点。

Copyright 2007-2008 JavaYe.com. Some Rights Reserved.

Search

控制面板

  • [管理登录]  [给我留言]

最近发表