blocks|key|1551566|text|无耻的插件和免责声明:在.NET中使用的my+company包|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1551567|Tesseract是一个OK+OCR引擎。它可能会遗漏很多东西，并且很容易被非文本所迷惑。你能做的最好的事情就是确保它只得到文本。下一个最好的办法是给它一些合理的二值化(自适应或动态阈值)或灰度，让它尝试进行二值化。|1551568|entityMap|0|LINK|mutability|MUTABLE|url|http://www.atalasoft.com/^0|K|A|0|0|0^^$0|@$1|2|3|4|5|6|7|N|8|@]|9|@$A|O|B|P|1|Q]]|C|$]]|$1|D|3|E|5|6|7|R|8|@]|9|@]|C|$]]|$1|F|3|-4|5|6|7|S|8|@]|9|@]|C|$]]]|G|$H|$5|I|J|K|C|$L|M]]]]

Shameless plug and disclaimer: <a href="http://www.atalasoft.com" rel="nofollow noreferrer">my company</a> packages Tesseract for use in .NET

Tesseract is an OK OCR engine. It can miss a lot and gets readily confused by non-text. The best thing you can do for it is to make sure it gets text only. The next best thing is to give it something sanely binarized (adaptive or dynamic threshold to get there) or grayscale and let it try to do binarization.

blocks|key|1372628|text|characters|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1372629|Profit：)|unordered-list-item|1372630|ordered-list-item|1372631|使图像格外整洁，并且周围有足够的空闲空间:)|1372632|1372633|1372634|这里有几个真实世界的例子。|1372635|OCR是原始图像(裁剪后的功率表numbers)|1372636|Second图像在GIMP中略微清理过图像，tesseract|1372637|
|1372638|Third图像中约50%25的|1372639|准确率是完全清理过的图像-+100%25无需任何训练即可识别！|1372640|1372641|1372642|​|1372643|📷|atomic|offset|length|1372644|1372645|1372646|1372647|entityMap|0|IMAGE|mutability|IMMUTABLE|imageUrl|https://ask.qcloudimg.com/http-save/yehe-900000/0bcb5d5451836344c4fc97b732bc79e6.jpeg|imageAlt|1|https://ask.qcloudimg.com/http-save/yehe-900000/59ca3c3b0f24f89d1f5bc2789327994f.png|2|https://ask.qcloudimg.com/http-save/yehe-900000/3339341009dca33592b523e5846de4b0.png^0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|0|1|0|0|0|1|1|0|0|1|2|0|0^^$0|@$1|2|3|4|5|6|7|1L|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|1M|8|@]|9|@]|A|$]]|$1|E|3|-4|5|F|7|1N|8|@]|9|@]|A|$]]|$1|G|3|H|5|F|7|1O|8|@]|9|@]|A|$]]|$1|I|3|-4|5|6|7|1P|8|@]|9|@]|A|$]]|$1|J|3|-4|5|6|7|1Q|8|@]|9|@]|A|$]]|$1|K|3|L|5|6|7|1R|8|@]|9|@]|A|$]]|$1|M|3|N|5|6|7|1S|8|@]|9|@]|A|$]]|$1|O|3|P|5|D|7|1T|8|@]|9|@]|A|$]]|$1|Q|3|R|5|6|7|1U|8|@]|9|@]|A|$]]|$1|S|3|T|5|D|7|1V|8|@]|9|@]|A|$]]|$1|U|3|V|5|D|7|1W|8|@]|9|@]|A|$]]|$1|W|3|-4|5|6|7|1X|8|@]|9|@]|A|$]]|$1|X|3|-4|5|6|7|1Y|8|@]|9|@]|A|$]]|$1|Y|3|Z|5|6|7|1Z|8|@]|9|@]|A|$]]|$1|10|3|11|5|12|7|20|8|@]|9|@$13|21|14|22|1|23]]|A|$]]|$1|15|3|11|5|12|7|24|8|@]|9|@$13|25|14|26|1|27]]|A|$]]|$1|16|3|11|5|12|7|28|8|@]|9|@$13|29|14|2A|1|2B]]|A|$]]|$1|17|3|Z|5|6|7|2C|8|@]|9|@]|A|$]]|$1|18|3|-4|5|6|7|2D|8|@]|9|@]|A|$]]]|19|$1A|$5|1B|1C|1D|A|$1E|1F|1G|-4]]|1H|$5|1B|1C|1D|A|$1E|1I|1G|-4]]|1J|$5|1B|1C|1D|A|$1E|1K|1G|-4]]]]

<ol>
<li>Train tesseract to recognize your font</li>
<li>Make image extra clean and with enough free space around characters</li>
<li>Profit :)</li>
</ol>

Here are few real world examples. 

<ul>
<li>First image is original image (croped power meter numbers)</li>
<li>Second image is slightly cleaned up image in GIMP, around 50% OCR accuracy in tesseract</li>
<li>Third image is completely cleaned image - 100% OCR recognized without any training!</li>
</ul>

<img src="https://i.stack.imgur.com/JUNNJ.jpg" alt="enter image description here">
<img src="https://i.stack.imgur.com/i48J7.png" alt="enter image description here">
<img src="https://i.stack.imgur.com/KjD0v.png" alt="enter image description here">

blocks|key|249349|text|为了区分0和O，一个简单的解决方案是选择一种能够区分两者的字体(例如:0的中间有一个破折号或点)。这在你的应用程序中是可以接受的吗？|type|unstyled|depth|inlineStyleRanges|entityRanges|data|249350|另一种解决方案是在文本的逐个字符分析之后应用基于字典的步骤-将识别的文本输入到某种形式的拼写检查器或验证器中，以区分困难的字符。|249351|例如，一个后面跟着其他数字的圆形符号最有可能是零，而后面跟着字母的同一个符号最有可能是大写的o。这是一个微不足道的例子，但它表明了上下文对于制作更可靠的OCR系统是多么必要。|249352|entityMap^0|0|0|0^^$0|@$1|2|3|4|5|6|7|H|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|I|8|@]|9|@]|A|$]]|$1|D|3|E|5|6|7|J|8|@]|9|@]|A|$]]|$1|F|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|G|$]]

For distinguishing between 0 and O, one simple solution is to choose a font that distinguishes between both (eg: 0 has a dash or dot in its middle). Would that be acceptable in your application?

Another solution is to apply a dictionary-based step after the character-by-character analysis of the text - feeding the recognized text into some form of spell-checker or validator to differentiate between difficult characters. 

For instance, a round symbol followed by other numbers is most likely to be a zero, while the same symbol followed by letters is most likely to be a capital o. It's a trivial example, but it shows how context is necessary to make a more reliable OCR system.

blocks|key|1372455|text|即使在最好的条件下，OCR变体也会悄悄接近你。你最好的选择就是设计你的测试来意识到它们。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1372456|entityMap^0|0^^$0|@$1|2|3|4|5|6|7|D|8|@]|9|@]|A|$]]|$1|B|3|-4|5|6|7|E|8|@]|9|@]|A|$]]]|C|$]]

Even under the best conditions OCR variants will sneak up on you. Your best option will be to design your tests to be aware of them.

I am using <a href="http://code.google.com/p/tesseract-ocr/" rel="nofollow noreferrer">Tesseract OCR</a> (via <a href="http://code.google.com/p/pytesser/" rel="nofollow noreferrer">pytesser</a>) and PIL (Python Image Library) for automated test of an application.

I am checking that the displayed text is ok by making a screenshot and getting the text thanks to tesseract.

I had some issues in the beginning and it seems to work better since I have increased the size of the screenshot thanks to the bicubic interpolation of PIL.

Unfortunatelly, I still have some mistakes like confusion between '0' and 'O'. I can imagine that I will have other similar issues in the future.

I would like to know if there are some techniques to prepare an image in order to help the OCR. Any idea is welcomed.

Thanks in advance

How to give best chance of success to an OCR software?

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我正在使用 (通过)和PIL (Python Image Library)对应用程序进行自动化测试。多亏了tesseract，我正在通过截屏和获取文本来检查显示的文本是否正常。我在开始的时候遇到了一些问题，它似乎工作得更好，因为我增加了截图的尺寸，这要归功于PIL的双三次插值。不幸的是，我仍然有一些错误，比如把“0”和“O”搞混了。我可以想象，我将来还会遇到其他类似的问题。我想知道是否有一些技术来

问如何给OCR软件最大的成功机会？
EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何给OCR软件最大的成功机会？EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何给OCR软件最大的成功机会？
EN