Package net.sourceforge.tess4j
Interface ITessAPI
-
- All Known Subinterfaces:
TessAPI
- All Known Implementing Classes:
TessAPI1,Tesseract1
public interface ITessAPIAn interface represents common TessAPI classes/constants.
-
-
Nested Class Summary
Nested Classes Modifier and Type Interface Description static interfaceITessAPI.CANCEL_FUNCCallback forcancel_func.static classITessAPI.EANYCODE_CHARIt should be noted that the format for char_code for version 2.0 and beyond is UTF-8, which means that ASCII characters will come out as one structure but other characters will be returned in two or more instances of this structure with a single byte of the UTF-8 code in each, but each will have the same bounding box.
Programs which want to handle languages with different characters sets will need to handle extended characters appropriately, but all code needs to be prepared to receive UTF-8 coded characters for characters such as bullet and fancy quotes.static classITessAPI.ETEXT_DESCDescription of the output of the OCR engine.static classITessAPI.TessBaseAPIBase class for all tesseract APIs.static interfaceITessAPI.TessCancelFuncstatic classITessAPI.TessChoiceIteratorstatic classITessAPI.TessMutableIteratorMutableIterator adds access to internal data structures.static interfaceITessAPI.TessOcrEngineModeWhen Tesseract/Cube is initialized we can choose to instantiate/load/run only the Tesseract part, only the Cube part or both along with the combiner.static interfaceITessAPI.TessOrientation+------------------+ | 1 Aaaa Aaaa Aaaa | | Aaa aa aaa aa | | aaaaaa A aa aaa.static classITessAPI.TessPageIteratorClass to iterate over tesseract page structure, providing access to all levels of the page hierarchy, without including any tesseract headers or having to handle any tesseract structures.
WARNING! This class points to data held within the TessBaseAPI class, and therefore can only be used while the TessBaseAPI class still exists and has not been subjected to a call ofInit,SetImage,Recognize,Clear,EndDetectOS, or anything else that changes the internalPAGE_RES.static interfaceITessAPI.TessPageIteratorLevelEnum of the elements of the page hierarchy, used inResultIteratorto provide functions that operate on each level without having to have 5x as many functions.static interfaceITessAPI.TessPageSegModePossible modes for page layout analysis.static interfaceITessAPI.TessParagraphJustificationNOTA BENE: Fully justified paragraphs (text aligned to both left and right margins) are marked by Tesseract with JUSTIFICATION_LEFT if their text is written with a left-to-right script and with JUSTIFICATION_RIGHT if their text is written in a right-to-left script.
Interpretation for text read in vertical lines: "Left" is wherever the starting reading position is.static interfaceITessAPI.TessPolyBlockTypePossible types for a POLY_BLOCK or ColPartition.static interfaceITessAPI.TessProgressFuncstatic classITessAPI.TessResultIteratorIterator for tesseract results that is capable of iterating in proper reading order over Bi Directional (e.g.static classITessAPI.TessResultRendererInterface for rendering tesseract results into a document, such as text, HOCR or pdf.static interfaceITessAPI.TessTextlineOrderThe text lines are read in the given sequence.
In English, the order is top-to-bottom.static interfaceITessAPI.TessWritingDirectionThe grapheme clusters within a line of text are laid out logically in this direction, judged when looking at the text line rotated so that its Orientation is "page up".
For English text, the writing direction is left-to-right.static classITessAPI.TimeVal
-
-
-
Field Detail
-
TRUE
static final int TRUE
- See Also:
- Constant Field Values
-
FALSE
static final int FALSE
- See Also:
- Constant Field Values
-
-