Class Summary |
EANYCODE_CHAR |
It should be noted that the format for char_code for version 2.0 and beyond is UTF-8,
which means that ASCII characters will come out as one structure but other characters
will be returned in two or more instances of this structure with a single byte of the
UTF-8 code in each, but each will have the same bounding box.
Programs which want to handle languages with different characters sets will need to
handle extended characters appropriately, but all code needs to be prepared to
receive UTF-8 coded characters for characters such as bullet and fancy quotes. |
ETEXT_DESC |
Description of the output of the OCR engine. |
TessDllAPI1 |
A Java wrapper for Tesseract OCR DLL using JNA Direct Mapping . |
Tesseract |
An object layer on top of TessDllAPI , provides character recognition support for common image formats, and multi-page TIFF images beyond the uncompressed,
binary TIFF format supported by Tesseract OCR engine. |
Tesseract1 |
An object layer on top of TessDllAPI1 , provides character recognition support for common image formats, and multi-page TIFF images beyond the uncompressed,
binary TIFF format supported by Tesseract OCR engine. |