ITessAPI.EANYCODE_CHAR |
It should be noted that the format for char_code for version 2.0 and
beyond is UTF-8, which means that ASCII characters will come out as one
structure but other characters will be returned in two or more instances
of this structure with a single byte of the UTF-8 code in each, but each
will have the same bounding box.
Programs which want to handle languages with different characters sets
will need to handle extended characters appropriately, but
all
code needs to be prepared to receive UTF-8 coded characters for
characters such as bullet and fancy quotes.
|
ITessAPI.ETEXT_DESC |
Description of the output of the OCR engine.
|
ITessAPI.TessBaseAPI |
Base class for all tesseract APIs.
|
ITessAPI.TessChoiceIterator |
|
ITessAPI.TessMutableIterator |
MutableIterator adds access to internal data structures.
|
ITessAPI.TessPageIterator |
Class to iterate over tesseract page structure, providing access to all
levels of the page hierarchy, without including any tesseract headers or
having to handle any tesseract structures.
WARNING! This class points to data held within the TessBaseAPI class, and
therefore can only be used while the TessBaseAPI class still exists and
has not been subjected to a call of Init ,
SetImage , Recognize , Clear ,
End DetectOS , or anything else that changes the
internal PAGE_RES .
|
ITessAPI.TessResultIterator |
Iterator for tesseract results that is capable of iterating in proper
reading order over Bi Directional (e.g.
|
ITessAPI.TessResultRenderer |
Interface for rendering tesseract results into a document, such as text,
HOCR or pdf.
|
ITessAPI.TimeVal |
|
OCRResult |
Encapsulates Tesseract OCR results at file level.
|
TessAPI1 |
A Java wrapper for Tesseract OCR 4.1.0 API using
JNA Direct Mapping .
|
Tesseract |
An object layer on top of TessAPI , provides character
recognition support for common image formats, and multi-page TIFF images
beyond the uncompressed, binary TIFF format supported by Tesseract OCR
engine.
|
Tesseract1 |
An object layer on top of TessAPI1 , provides character
recognition support for common image formats, and multi-page TIFF images
beyond the uncompressed, binary TIFF format supported by Tesseract OCR
engine.
|
Word |
Encapsulates Tesseract OCR results at certain page iterator level.
|