Add Custom Recognition Blocks & Set Automatic Spelling Correction in .NET Apps
This new release of Aspose.OCR for .NET 2.0.0 has introduced many useful features, including Automatic Spelling Correction, allowing user to Add Custom Text Blocks, Auto Detection of Text Blocks and Removal of Graphics (Non-Textual Blocks).
Lane Cove, New South Wales, Australia, July 10, 2014 (Newswire.com) - What’s new in this release?
We are proud to announce the new release of Aspose.OCR for .NET 2.0.0. Users may now define custom recognition blocks on the document image. These will be processed by the OcrEngine while other areas of the image are skipped. However, if user defined recognition blocks are not set prior to processing, the OcrEngine automatically calculate the recognition blocks using its default behavior. Images with text can be divided into many text recognition blocks. It is sometimes important to apply a special sorting so that the correct text block order is maintained. The OcrConfig class has exposed the DetectReadingOrder property to handle situations where an image within the main document image may contain text blocks of its own. Setting the DetectReadingOrder property to true instructs the OcrEngine to maintain the order of text regions for better recognition result. However, performance degradation can be observed for images with a large number of text blocks. In such cases, it is advised to turn this feature off by setting the DetectReadingOrder property to false. In order to improve the accuracy of the recognized data, the Aspose.OCR for .NET API has exposed the DoSpellingCorrection property for the OcrConfig class. A Boolean property, it can be either set to true or false indicating whether automatic spelling correction should be applied on the recognized text or not. Setting the DoSpellingCorrection property to true will improve the OCR results; however, the process may take more time to complete. Please check the detailed article on setting the OcrEngine to automatically correct spelling . Document images to be processed with OcrEngine may contain graphics. During the OCR process, these graphics may not contain any text to be recognized although they are detected as a separate block. If developers wish to ignore the non-textual blocks such as graphics from the recognition process, they just have to set the RemoveNonText property exposed by the OcrConfig class to true. Last but not the least, we have completely overhauled the OcrEngine with new recognition algorithm for better accuracy of the recognized data and performance considerations. Moreover, Aspose.OCR API now uses a resource file that is just of 5.5MB in size unlike previous resource files of 88MB. This release includes plenty of new & improved features as listed below
· Support new algorithm for OCR.
· Support detection of reading order.
· Support automatic text regions detection.
· Support spelling correction.
· Allow user to define which blocks to process.
· Support removing of non-text blocks.
· Add .Net 4.5 binaries into release
· Add possibility to recognize text on complex background
· Resource file path should be specified explicitly by user.
· Restore layout of image
Other most recent bug fixes are also included in this release
Newly added documentation pages and articles
Some new tips and articles have now been added into Aspose.OCR for .NET documentation that may guide you briefly how to use Aspose.Diagram for performing different tasks like the followings.
- Setting the OcrEngine to Automatically Detect the Reading Order: http://www.aspose.com/docs/display/ocrnet/Setting+the+OcrEngine+to+Automatically+Detect+the+Reading+Order
- Setting the OcrEngine to Automatically Correct the Spellings: http://www.aspose.com/docs/display/ocrnet/Setting+the+OcrEngine+to+Automatically+Correct+the+Spellings
Overview: Aspose.OCR for .NET
Aspose.OCR for .NET is a character recognition component built to allow developers to add OCR functionality in their ASP .NET web applications, web services and Windows applications. It provides a simple set of classes for controlling character recognition tasks. It helps developers to work with image (BMP, TIFF) files from within their own applications. It allows developers to extract text from images quickly & easily, saving time & effort involved in developing an OCR solution from scratch.
More about Aspose.OCR for .NET
- Homepage of Aspose.OCR for .NET: http://www.aspose.com/.net/ocr-component.aspx
-Download Aspose.OCR for .NET: http://www.aspose.com/community/files/51/.net-components/aspose.ocr_for_.net/default.aspx
- Online Demos for Aspose.OCR for .NET:http://www.aspose.com/demos/.net-components/aspose.ocr/default.aspx
- Online documentation of Aspose.OCR for .NET: http://www.aspose.com/docs/display/ocrnet/Home
Contact Information
Aspose Pty Ltd, Suite 163,
79 Longueville Road
Lane Cove, NSW, 2066
Australia
http://www.aspose.com/
sales@aspose.com
Phone: 888.277.6734
Fax: 866.810.9465