TOPOCR 3.1 的主要功能
* Incredible OCR accuracy, upto 99.8% with a 3 MP camera
异乎寻常的OCR准确率,对于3 MP相机来说,准确率更是高达99.8%;
* No page limits, and no extra downloads or components needed
无页面限制,无需另外单独下载或安装其他配件;
* Built-in Document Layout Analysis
内置文件排版分析功能;
* Handles skew and uneven lighting
可处理扭曲和不均匀光线图片
* Output formats include searchable PDF and HTML
输出格式包括常见的PDF与 html格式;
* Able to read 11 different languages
可以识别11种语言(英语、丹麦语、荷兰语、芬兰语、法语、德语、意大利语、挪威语、葡萄牙语、西班牙语、瑞典语);
* Powerful, easy to use Image Processing
强大而易用的图象处理功能;
* Supports Smartphones: See some Smartphone samples
支持智能手机:参见智能手机样本;
* Includes built-in Text and Image WYSIWYG Editors
包括内置文本与图像的WTSIWTG编辑器;
* Post-processing spell checker for all 11 languages
可以对全部11种语言进行后期拼写检查处理;
* Built-in Text-To-Speech software.
内置文本-读音软件;
* Browser Helper Mode supports creating free audio eBooks
浏览帮助模式可以帮助用户制作免费有声电子书。
Understanding Digital Camera OCR 如何理解数码相机OCR
Replacing a scanner with a camera can dramatically improve OCR throughput. It can also allow you to have a portable solution to document capture and OCR on content which normally would be inaccessible. In order to have a high degree of OCR accuracy, it's necessary to understand the limitations of the current version of TopOCR. To give high OCR accuracy, TopOCR requires a minimum of a 3 MegaPixel digital camera. This will allow a user to TIGHTLY frame an 8 1/2" by 11" document. TopOCR can handle variable lighting, which ordinarily would give poor OCR results, and small amounts of document skew. Large amounts of skew will generate significantly more errors. Therefore it is essential for the user to learn how to use their camera effectively for the best OCR results. This best way to improve your skill is by practicing, after first reading our tutorial, download TopOCR and you can get started!
以相机取代扫描仪是对OCR概念的彻底颠覆。本软件还可以使您随时随地解决文件捕捉并将其进行OCR转换的问题。为获得很高的OCR识别率,您需要对TopOCR目前的版本有个很好的了解。为获得好的效果,TopOCR所要求所使用起码相机最低应该为3百万相素,以便可以使用户获得8 1/2'' X 11''的文本框。TopOCR可以处理各种可能会影响产生较差OCR效果或形成文本扭曲的光线。如果存在大量扭曲,则会产生很多问题。故此,为获得最佳的OCR识别效果,用户必须首先学会如何有效地使用好自己的相机。但要获得最终满意的效果,仍需要经过大量的实践。最首要的还是该先阅读我们的“指南”—— 下载TopOCR软件 —— 开始使用!
Document Image Capture with a Camera 用相机获取文本图像
Your Document 您的文本
Babel Reader and TopOCR require images to be in portrait orientation for OCR.
The document should be laying on an uncluttered flat surface.
欲进行OCR处理,我们的Babel Reader 与TopOCR要求所采用的图片格式必须为纵向版面,文本必须置于一个整齐的平面上。
Portrait Orientation 纵向版面 |
|
Configure Your Camera 相机的设置
Bring up the Camera Settings dialog by pressing the camera button on your phone or by running the "Camera" program.
Change the following settings:
通过按动您手机上的拍照按钮或启动“照相”程序来进行以下设定:
* Set resolution to 5MP or less (No limit for TopOCR)
将分辨率设为5 MP或更低(TopOCR无此限制)
* Set JPEG quality to "High"
将JPEG质量范围调到“高-High”
* Enable Anti-Shaking
启动“抗震动”功能
* Enable "Text Mode" if supported
如有可能,请启动“文本模式-Text Mode”
* Use a Time Delay to allow you to get a straight even image
启动“延时功能”,以获得平直的图像
* Other items can be set to Automatic
其他内容可设定为“自动- Automatic”
Typically, the image files that you capture with your camera are stored under My Storage\DCIM.
通常情况下,应该将您通过手机所捕捉到的图像保存至My Storage\DCIM。
Light 光线
For Best Results - Use a Lamp 为获得最佳效果 —— 请使用一盏灯来进行照明 |
|
Make sure there are no shadows on the page from your hands or camera.
要确保页面上没有您的手或相机所形成的阴影。
Holding the Camera 抓住相机 |
|
When you take the picture, the camera lens must be EXACTLY parallel to the page being photographed.
In practice, this is not too hard to do.
Simply hold the camera directly overhead while looking through the viewfinder. Frame the page as much as possible while keeping the horizontal and vertical edges as straight as possible.
拍照时,相机镜头应该与所拍摄页面保持绝对的平行。要做到这点,其实并不难。一边观察取景器,一边直接将相机举过头顶,尽量在将横边和竖边保持垂直情况下选定图像。
声明:
以上中文内容由“上海语翼翻译服务有限公司”独家翻译,供用户参照使用。
更多详情,请访问TopSoft, Ltd官网http://www.topocr.com/
凡引用或转载本处中文内容者,请注明:“本中文翻译内容由上海语翼翻译服务有限公司提供”字样。