net.sourceforge.tess4j (Tess4J - Tesseract for Java 5.13.0 API)

package net.sourceforge.tess4j

Related Packages

Package

Description

net.sourceforge.tess4j.util
Class

Description

ITessAPI

An interface represents common TessAPI classes/constants.

ITessAPI.CANCEL_FUNC

Callback for cancel_func.

ITessAPI.EANYCODE_CHAR

It should be noted that the format for char_code for version 2.0 and beyond is UTF-8, which means that ASCII characters will come out as one structure but other characters will be returned in two or more instances of this structure with a single byte of the UTF-8 code in each, but each will have the same bounding box.

Programs which want to handle languages with different characters sets will need to handle extended characters appropriately, but all code needs to be prepared to receive UTF-8 coded characters for characters such as bullet and fancy quotes.

ITessAPI.ETEXT_DESC

Description of the output of the OCR engine.

ITessAPI.TessBaseAPI

Base class for all tesseract APIs.

ITessAPI.TessCancelFunc

ITessAPI.TessChoiceIterator

ITessAPI.TessMutableIterator

MutableIterator adds access to internal data structures.

ITessAPI.TessOcrEngineMode

When Tesseract/Cube is initialized we can choose to instantiate/load/run only the Tesseract part, only the Cube part or both along with the combiner.

ITessAPI.TessOrientation

+------------------+ | 1 Aaaa Aaaa Aaaa | | Aaa aa aaa aa | | aaaaaa A aa aaa. | | 2 | | ####### c c C | | ####### c c c | | < ####### c c c | | < ####### c c | | < ####### .

ITessAPI.TessPageIterator

Class to iterate over tesseract page structure, providing access to all levels of the page hierarchy, without including any tesseract headers or having to handle any tesseract structures.
WARNING!

ITessAPI.TessPageIteratorLevel

Enum of the elements of the page hierarchy, used in ResultIterator to provide functions that operate on each level without having to have 5x as many functions.

ITessAPI.TessPageSegMode

Possible modes for page layout analysis.

ITessAPI.TessParagraphJustification

NOTA BENE: Fully justified paragraphs (text aligned to both left and right margins) are marked by Tesseract with JUSTIFICATION_LEFT if their text is written with a left-to-right script and with JUSTIFICATION_RIGHT if their text is written in a right-to-left script.

Interpretation for text read in vertical lines: "Left" is wherever the starting reading position is.

ITessAPI.TessPolyBlockType

Possible types for a POLY_BLOCK or ColPartition.

ITessAPI.TessProgressFunc

ITessAPI.TessResultIterator

Iterator for tesseract results that is capable of iterating in proper reading order over Bi Directional (e.g. mixed Hebrew and English) text.

ITessAPI.TessResultRenderer

Interface for rendering tesseract results into a document, such as text, HOCR or pdf.

ITessAPI.TessTextlineOrder

The text lines are read in the given sequence.

In English, the order is top-to-bottom.

ITessAPI.TessWritingDirection

The grapheme clusters within a line of text are laid out logically in this direction, judged when looking at the text line rotated so that its Orientation is "page up".

For English text, the writing direction is left-to-right.

ITessAPI.TimeVal

ITesseract

An interface represents common OCR methods.

ITesseract.RenderedFormat

Rendered formats supported by Tesseract.

OCRResult

Encapsulates Tesseract OCR results at file level.

OSDResult

Encapsulates Tesseract Orientation Script Detection (OSD) results.

TessAPI

A Java wrapper for Tesseract OCR 5.4.1 API using JNA Interface Mapping.

TessAPI1

A Java wrapper for Tesseract OCR 5.4.1 API using JNA Direct Mapping.

Tesseract

An object layer on top of TessAPI, provides character recognition support for common image formats, and multi-page TIFF images beyond the uncompressed, binary TIFF format supported by Tesseract OCR engine.

Tesseract1

An object layer on top of TessAPI1, provides character recognition support for common image formats, and multi-page TIFF images beyond the uncompressed, binary TIFF format supported by Tesseract OCR engine.

TesseractException

Word

Encapsulates Tesseract OCR results at certain page iterator level.

Package net.sourceforge.tess4j