An interface represents common TessAPI classes/constants.
Callback for cancel_func
.
It should be noted that the format for char_code for version 2.0 and
beyond is UTF-8, which means that ASCII characters will come out as one
structure but other characters will be returned in two or more instances
of this structure with a single byte of the UTF-8 code in each, but each
will have the same bounding box.
Programs which want to handle languages with different characters sets
will need to handle extended characters appropriately, but
all
code needs to be prepared to receive UTF-8 coded characters for
characters such as bullet and fancy quotes.
Description of the output of the OCR engine.
Base class for all tesseract APIs.
MutableIterator adds access to internal data structures.
Class to iterate over tesseract page structure, providing access to all
levels of the page hierarchy, without including any tesseract headers or
having to handle any tesseract structures.
WARNING! This class points to data held within the TessBaseAPI class, and
therefore can only be used while the TessBaseAPI class still exists and
has not been subjected to a call of Init
,
SetImage
, Recognize
, Clear
,
End
DetectOS
, or anything else that changes the
internal PAGE_RES
.
Iterator for tesseract results that is capable of iterating in proper
reading order over Bi Directional (e.g.
Interface for rendering tesseract results into a document, such as text,
HOCR or pdf.
An interface represents common OCR methods.
Rendered formats supported by Tesseract.
Encapsulates Tesseract OCR results at file level.
A Java wrapper for Tesseract OCR 4.1.0 API
using
JNA Interface Mapping
.
A Java wrapper for Tesseract OCR 4.1.0 API
using
JNA Direct Mapping
.
Encapsulates Tesseract OCR results at certain page iterator level.