An interface represents common TessAPI classes/constants.
Callback for cancel_func
.
It should be noted that the format for char_code for version 2.0 and
beyond is UTF-8, which means that ASCII characters will come out as one
structure but other characters will be returned in two or more instances
of this structure with a single byte of the UTF-8 code in each, but each
will have the same bounding box.
Programs which want to handle languages with different characters sets
will need to handle extended characters appropriately, but
all
code needs to be prepared to receive UTF-8 coded characters for
characters such as bullet and fancy quotes.
Description of the output of the OCR engine.
Base class for all tesseract APIs.
MutableIterator adds access to internal data structures.
Class to iterate over tesseract page structure, providing access to all
levels of the page hierarchy, without including any tesseract headers or
having to handle any tesseract structures.
WARNING!
Iterator for tesseract results that is capable of iterating in proper
reading order over Bi Directional (e.g. mixed Hebrew and English) text.
Interface for rendering tesseract results into a document, such as text,
HOCR or pdf.
An interface represents common OCR methods.
Rendered formats supported by Tesseract.
Encapsulates Tesseract OCR results at file level.
Encapsulates Tesseract Orientation Script Detection (OSD) results.
A Java wrapper for Tesseract OCR 5.4.1 API
using
JNA Interface Mapping
.
A Java wrapper for Tesseract OCR 5.4.1 API
using
JNA Direct Mapping
.
Encapsulates Tesseract OCR results at certain page iterator level.