ITessAPI.CANCEL_FUNC, ITessAPI.EANYCODE_CHAR, ITessAPI.ETEXT_DESC, ITessAPI.TessBaseAPI, ITessAPI.TessChoiceIterator, ITessAPI.TessMutableIterator, ITessAPI.TessOcrEngineMode, ITessAPI.TessOrientation, ITessAPI.TessPageIterator, ITessAPI.TessPageIteratorLevel, ITessAPI.TessPageSegMode, ITessAPI.TessParagraphJustification, ITessAPI.TessPolyBlockType, ITessAPI.TessResultIterator, ITessAPI.TessResultRenderer, ITessAPI.TessTextlineOrder, ITessAPI.TessWritingDirection, ITessAPI.TimeVal
Modifier and Type | Field and Description |
---|---|
static TessAPI |
INSTANCE
An instance of the class library.
|
Modifier and Type | Method and Description |
---|---|
int |
TessBaseAPIAdaptToWordStr(ITessAPI.TessBaseAPI handle,
int mode,
String wordstr)
Applies the given word to the adaptive classifier if possible.
|
com.sun.jna.ptr.IntByReference |
TessBaseAPIAllWordConfidences(ITessAPI.TessBaseAPI handle)
Returns an array of all word confidences, terminated by -1.
|
ITessAPI.TessPageIterator |
TessBaseAPIAnalyseLayout(ITessAPI.TessBaseAPI handle)
Runs page layout analysis in the mode set by
SetPageSegMode . |
void |
TessBaseAPIClear(ITessAPI.TessBaseAPI handle)
Free up recognition results and any stored image data, without actually
freeing any recognition data that would be time-consuming to reload.
|
void |
TessBaseAPIClearAdaptiveClassifier(ITessAPI.TessBaseAPI handle)
Call between pages or documents etc to free up memory and forget adaptive
data.
|
void |
TessBaseAPIClearPersistentCache(ITessAPI.TessBaseAPI handle)
Clear any library-level memory caches.
|
ITessAPI.TessBaseAPI |
TessBaseAPICreate()
Creates an instance of the base class for all Tesseract APIs.
|
void |
TessBaseAPIDelete(ITessAPI.TessBaseAPI handle)
Disposes the TesseractAPI instance.
|
int |
TessBaseAPIDetectOrientationScript(ITessAPI.TessBaseAPI handle,
IntBuffer orient_deg,
FloatBuffer orient_conf,
com.sun.jna.ptr.PointerByReference script_name,
FloatBuffer script_conf)
Detect the orientation of the input image and apparent script (alphabet).
|
void |
TessBaseAPIDumpPGM(ITessAPI.TessBaseAPI handle,
String filename)
Dump the internal binary image to a PGM file.
|
void |
TessBaseAPIEnd(ITessAPI.TessBaseAPI handle)
Close down tesseract and free up all memory.
|
com.sun.jna.ptr.PointerByReference |
TessBaseAPIGetAvailableLanguagesAsVector(ITessAPI.TessBaseAPI handle)
Returns the available languages in the vector of STRINGs.
|
int |
TessBaseAPIGetBoolVariable(ITessAPI.TessBaseAPI handle,
String name,
IntBuffer value)
Get the value of an internal bool parameter.
|
com.sun.jna.Pointer |
TessBaseAPIGetBoxText(ITessAPI.TessBaseAPI handle,
int page_number)
The recognized text is returned as a char* which is coded as a UTF8 box
file and must be freed with the delete [] operator.
|
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetComponentImages(ITessAPI.TessBaseAPI handle,
int level,
int text_only,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids)
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style
Boxa , Pixa pair, in reading
order. |
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetComponentImages1(ITessAPI.TessBaseAPI handle,
int level,
int text_only,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style
Boxa , Pixa pair, in reading
order. |
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetConnectedComponents(ITessAPI.TessBaseAPI handle,
com.sun.jna.ptr.PointerByReference cc)
Gets the individual connected (text) components (created after pages
segmentation step, but before recognition) as a Leptonica-style
Boxa , Pixa pair, in reading order. |
String |
TessBaseAPIGetDatapath(ITessAPI.TessBaseAPI handle) |
int |
TessBaseAPIGetDoubleVariable(ITessAPI.TessBaseAPI handle,
String name,
DoubleBuffer value)
Get the value of an internal double parameter.
|
com.sun.jna.Pointer |
TessBaseAPIGetHOCRText(ITessAPI.TessBaseAPI handle,
int page_number)
Make a HTML-formatted string with hOCR markup from the internal data
structures.
|
String |
TessBaseAPIGetInitLanguagesAsString(ITessAPI.TessBaseAPI handle)
Returns the languages string used in the last valid initialization.
|
net.sourceforge.lept4j.Pix |
TessBaseAPIGetInputImage(ITessAPI.TessBaseAPI handle) |
String |
TessBaseAPIGetInputName(ITessAPI.TessBaseAPI handle)
These functions are required for searchable PDF output.
|
int |
TessBaseAPIGetIntVariable(ITessAPI.TessBaseAPI handle,
String name,
IntBuffer value)
Get the value of an internal int parameter.
|
ITessAPI.TessResultIterator |
TessBaseAPIGetIterator(ITessAPI.TessBaseAPI handle)
Get a reading-order iterator to the results of LayoutAnalysis and/or
Recognize . |
com.sun.jna.ptr.PointerByReference |
TessBaseAPIGetLoadedLanguagesAsVector(ITessAPI.TessBaseAPI handle)
Returns the loaded languages in the vector of STRINGs.
|
ITessAPI.TessMutableIterator |
TessBaseAPIGetMutableIterator(ITessAPI.TessBaseAPI handle)
Get a mutable iterator to the results of LayoutAnalysis and/or
Recognize . |
int |
TessBaseAPIGetPageSegMode(ITessAPI.TessBaseAPI handle)
Return the current page segmentation mode.
|
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetRegions(ITessAPI.TessBaseAPI handle,
com.sun.jna.ptr.PointerByReference pixa)
Get the result of page layout analysis as a Leptonica-style
Boxa , Pixa pair, in reading order. |
int |
TessBaseAPIGetSourceYResolution(ITessAPI.TessBaseAPI handle) |
String |
TessBaseAPIGetStringVariable(ITessAPI.TessBaseAPI handle,
String name)
Get the value of an internal string parameter.
|
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetStrips(ITessAPI.TessBaseAPI handle,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids)
Get textlines and strips of image regions as a Leptonica-style
Boxa , Pixa pair, in reading order. |
int |
TessBaseAPIGetTextDirection(ITessAPI.TessBaseAPI handle,
IntBuffer out_offset,
FloatBuffer out_slope)
Gets text direction.
|
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetTextlines(ITessAPI.TessBaseAPI handle,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids)
Get the textlines as a Leptonica-style
Boxa ,
Pixa pair, in reading order. |
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetTextlines1(ITessAPI.TessBaseAPI handle,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the textlines as a Leptonica-style
Boxa ,
Pixa pair, in reading order. |
net.sourceforge.lept4j.Pix |
TessBaseAPIGetThresholdedImage(ITessAPI.TessBaseAPI handle)
ONLY available after
SetImage if you have Leptonica
installed. |
int |
TessBaseAPIGetThresholdedImageScaleFactor(ITessAPI.TessBaseAPI handle) |
String |
TessBaseAPIGetUnichar(ITessAPI.TessBaseAPI handle,
int unichar_id)
Gets the string of the specified unichar.
|
com.sun.jna.Pointer |
TessBaseAPIGetUNLVText(ITessAPI.TessBaseAPI handle)
The recognized text is returned as a char* which is coded as UNLV format
Latin-1 with specific reject and suspect codes and must be freed with the
delete [] operator.
|
com.sun.jna.Pointer |
TessBaseAPIGetUTF8Text(ITessAPI.TessBaseAPI handle)
The recognized text is returned as a char* which is coded as UTF-8 and
must be freed with the delete [] operator.
|
net.sourceforge.lept4j.Boxa |
TessBaseAPIGetWords(ITessAPI.TessBaseAPI handle,
com.sun.jna.ptr.PointerByReference pixa)
Get the words as a Leptonica-style
Boxa , Pixa
pair, in reading order. |
int |
TessBaseAPIInit1(ITessAPI.TessBaseAPI handle,
String datapath,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size)
Instances are now mostly thread-safe and totally independent, but some
global parameters remain.
|
int |
TessBaseAPIInit2(ITessAPI.TessBaseAPI handle,
String datapath,
String language,
int oem) |
int |
TessBaseAPIInit3(ITessAPI.TessBaseAPI handle,
String datapath,
String language) |
int |
TessBaseAPIInit4(ITessAPI.TessBaseAPI handle,
String datapath,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size,
com.sun.jna.ptr.PointerByReference vars_vec,
com.sun.jna.ptr.PointerByReference vars_values,
com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size,
int set_only_non_debug_params) |
void |
TessBaseAPIInitForAnalysePage(ITessAPI.TessBaseAPI handle)
Init only for page layout analysis.
|
int |
TessBaseAPIInitLangMod(ITessAPI.TessBaseAPI handle,
String datapath,
String language)
Init only the lang model component of Tesseract.
|
int |
TessBaseAPIIsValidWord(ITessAPI.TessBaseAPI handle,
String word)
Check whether a word is valid according to Tesseract's language model.
|
int |
TessBaseAPIMeanTextConf(ITessAPI.TessBaseAPI handle)
Returns the average word confidence for Tesseract page result.
|
void |
TessBaseAPIPrintVariablesToFile(ITessAPI.TessBaseAPI handle,
String filename)
Print Tesseract parameters to the given file.
Note: Must not be the first method called after instance create. |
int |
TessBaseAPIProcessPage(ITessAPI.TessBaseAPI handle,
net.sourceforge.lept4j.Pix pix,
int page_index,
String filename,
String retry_config,
int timeout_millisec,
ITessAPI.TessResultRenderer renderer) |
int |
TessBaseAPIProcessPages(ITessAPI.TessBaseAPI handle,
String filename,
String retry_config,
int timeout_millisec,
ITessAPI.TessResultRenderer renderer)
Recognizes all the pages in the named file, as a multi-page tiff or list
of filenames, or single image, and gets the appropriate kind of text
according to parameters:
tessedit_create_boxfile ,
tessedit_make_boxes_from_boxes ,
tessedit_write_unlv , tessedit_create_hocr . |
void |
TessBaseAPIReadConfigFile(ITessAPI.TessBaseAPI handle,
String filename,
int init_only)
Read a "config" file containing a set of param, value pairs.
|
int |
TessBaseAPIRecognize(ITessAPI.TessBaseAPI handle,
ITessAPI.ETEXT_DESC monitor)
Recognize the image from
SetAndThresholdImage , generating
Tesseract internal structures. |
int |
TessBaseAPIRecognizeForChopTest(ITessAPI.TessBaseAPI handle,
ITessAPI.ETEXT_DESC monitor)
Variant on
Recognize used for testing chopper. |
com.sun.jna.Pointer |
TessBaseAPIRect(ITessAPI.TessBaseAPI handle,
ByteBuffer imagedata,
int bytes_per_pixel,
int bytes_per_line,
int left,
int top,
int width,
int height)
Recognize a rectangle from an image and return the result as a string.
|
void |
TessBaseAPISetImage(ITessAPI.TessBaseAPI handle,
ByteBuffer imagedata,
int width,
int height,
int bytes_per_pixel,
int bytes_per_line)
Provide an image for Tesseract to recognize.
|
void |
TessBaseAPISetImage2(ITessAPI.TessBaseAPI handle,
net.sourceforge.lept4j.Pix pix)
Provide an image for Tesseract to recognize.
|
void |
TessBaseAPISetInputImage(ITessAPI.TessBaseAPI handle,
net.sourceforge.lept4j.Pix pix) |
void |
TessBaseAPISetInputName(ITessAPI.TessBaseAPI handle,
String name)
Set the name of the input file.
|
void |
TessBaseAPISetOutputName(ITessAPI.TessBaseAPI handle,
String name)
Set the name of the bonus output files.
|
void |
TessBaseAPISetPageSegMode(ITessAPI.TessBaseAPI handle,
int mode)
Set the current page segmentation mode.
|
void |
TessBaseAPISetRectangle(ITessAPI.TessBaseAPI handle,
int left,
int top,
int width,
int height)
Restrict recognition to a sub-rectangle of the image.
|
void |
TessBaseAPISetSourceResolution(ITessAPI.TessBaseAPI handle,
int ppi)
Set the resolution of the source image in pixels per inch so font size
information can be calculated in results.
|
int |
TessBaseAPISetVariable(ITessAPI.TessBaseAPI handle,
String name,
String value)
Set the value of an internal "parameter." Supply the name of the
parameter and the value as a string, just as you would in a config file.
|
ITessAPI.TessResultRenderer |
TessBoxTextRendererCreate(String outputbase) |
float |
TessChoiceIteratorConfidence(ITessAPI.TessChoiceIterator handle) |
void |
TessChoiceIteratorDelete(ITessAPI.TessChoiceIterator handle) |
String |
TessChoiceIteratorGetUTF8Text(ITessAPI.TessChoiceIterator handle) |
int |
TessChoiceIteratorNext(ITessAPI.TessChoiceIterator handle) |
void |
TessDeleteIntArray(IntBuffer arr)
Deallocates the memory block occupied by integer array.
|
void |
TessDeleteResultRenderer(ITessAPI.TessResultRenderer renderer) |
void |
TessDeleteText(com.sun.jna.Pointer text)
Deallocates the memory block occupied by text.
|
void |
TessDeleteTextArray(com.sun.jna.ptr.PointerByReference arr)
Deallocates the memory block occupied by text array.
|
ITessAPI.TessResultRenderer |
TessHOcrRendererCreate(String outputbase) |
int |
TessPageIteratorBaseline(ITessAPI.TessPageIterator handle,
int level,
IntBuffer x1,
IntBuffer y1,
IntBuffer x2,
IntBuffer y2)
Returns the baseline of the current object at the given level.
|
void |
TessPageIteratorBegin(ITessAPI.TessPageIterator handle)
Resets the iterator to point to the start of the page.
|
int |
TessPageIteratorBlockType(ITessAPI.TessPageIterator handle)
Returns the type of the current block.
|
int |
TessPageIteratorBoundingBox(ITessAPI.TessPageIterator handle,
int level,
IntBuffer left,
IntBuffer top,
IntBuffer right,
IntBuffer bottom)
Returns the bounding rectangle of the current object at the given level
in coordinates of the original image.
|
ITessAPI.TessPageIterator |
TessPageIteratorCopy(ITessAPI.TessPageIterator handle)
Creates a copy of the specified PageIterator instance.
|
void |
TessPageIteratorDelete(ITessAPI.TessPageIterator handle)
Deletes the specified PageIterator instance.
|
net.sourceforge.lept4j.Pix |
TessPageIteratorGetBinaryImage(ITessAPI.TessPageIterator handle,
int level)
Returns a binary image of the current object at the given level.
|
net.sourceforge.lept4j.Pix |
TessPageIteratorGetImage(ITessAPI.TessPageIterator handle,
int level,
int padding,
net.sourceforge.lept4j.Pix original_image,
IntBuffer left,
IntBuffer top)
Returns an image of the current object at the given level in greyscale if
available in the input.
|
int |
TessPageIteratorIsAtBeginningOf(ITessAPI.TessPageIterator handle,
int level)
Returns TRUE if the iterator is at the start of an object at the given
level.
|
int |
TessPageIteratorIsAtFinalElement(ITessAPI.TessPageIterator handle,
int level,
int element)
Returns whether the iterator is positioned at the last element in a given
level.
|
int |
TessPageIteratorNext(ITessAPI.TessPageIterator handle,
int level)
Moves to the start of the next object at the given level in the page
hierarchy, and returns false if the end of the page was reached.
|
void |
TessPageIteratorOrientation(ITessAPI.TessPageIterator handle,
IntBuffer orientation,
IntBuffer writing_direction,
IntBuffer textline_order,
FloatBuffer deskew_angle)
Returns the orientation.
|
void |
TessPageIteratorParagraphInfo(ITessAPI.TessPageIterator handle,
IntBuffer justification,
IntBuffer is_list_item,
IntBuffer is_crown,
IntBuffer first_line_indent)
Gets paragraph information.
|
ITessAPI.TessResultRenderer |
TessPDFRendererCreate(String outputbase,
String datadir) |
float |
TessResultIteratorConfidence(ITessAPI.TessResultIterator handle,
int level)
Returns the mean confidence of the current object at the given level.
|
ITessAPI.TessResultIterator |
TessResultIteratorCopy(ITessAPI.TessResultIterator handle)
Creates a copy of the specified ResultIterator instance.
|
void |
TessResultIteratorDelete(ITessAPI.TessResultIterator handle)
Deletes the specified ResultIterator handle.
|
ITessAPI.TessChoiceIterator |
TessResultIteratorGetChoiceIterator(ITessAPI.TessResultIterator handle) |
ITessAPI.TessPageIterator |
TessResultIteratorGetPageIterator(ITessAPI.TessResultIterator handle)
Gets the PageIterator of the specified ResultIterator instance.
|
ITessAPI.TessPageIterator |
TessResultIteratorGetPageIteratorConst(ITessAPI.TessResultIterator handle)
Gets the PageIterator of the specified ResultIterator instance.
|
com.sun.jna.Pointer |
TessResultIteratorGetUTF8Text(ITessAPI.TessResultIterator handle,
int level)
Returns the null terminated UTF-8 encoded text string for the current
object at the given level.
|
int |
TessResultIteratorNext(ITessAPI.TessResultIterator handle,
int level) |
int |
TessResultIteratorSymbolIsDropcap(ITessAPI.TessResultIterator handle)
Returns TRUE if the current symbol is a dropcap.
|
int |
TessResultIteratorSymbolIsSubscript(ITessAPI.TessResultIterator handle)
Returns TRUE if the current symbol is a subscript.
|
int |
TessResultIteratorSymbolIsSuperscript(ITessAPI.TessResultIterator handle)
Returns TRUE if the current symbol is a superscript.
|
String |
TessResultIteratorWordFontAttributes(ITessAPI.TessResultIterator handle,
IntBuffer is_bold,
IntBuffer is_italic,
IntBuffer is_underlined,
IntBuffer is_monospace,
IntBuffer is_serif,
IntBuffer is_smallcaps,
IntBuffer pointsize,
IntBuffer font_id)
Returns the font attributes of the current word.
|
int |
TessResultIteratorWordIsFromDictionary(ITessAPI.TessResultIterator handle)
Returns TRUE if the current word was found in a dictionary.
|
int |
TessResultIteratorWordIsNumeric(ITessAPI.TessResultIterator handle)
Returns TRUE if the current word is numeric.
|
String |
TessResultIteratorWordRecognitionLanguage(ITessAPI.TessResultIterator handle) |
int |
TessResultRendererAddImage(ITessAPI.TessResultRenderer renderer,
com.sun.jna.ptr.PointerByReference api) |
int |
TessResultRendererBeginDocument(ITessAPI.TessResultRenderer renderer,
String title) |
int |
TessResultRendererEndDocument(ITessAPI.TessResultRenderer renderer) |
com.sun.jna.Pointer |
TessResultRendererExtention(ITessAPI.TessResultRenderer renderer) |
int |
TessResultRendererImageNum(ITessAPI.TessResultRenderer renderer) |
void |
TessResultRendererInsert(ITessAPI.TessResultRenderer renderer,
ITessAPI.TessResultRenderer next) |
ITessAPI.TessResultRenderer |
TessResultRendererNext(ITessAPI.TessResultRenderer renderer) |
com.sun.jna.Pointer |
TessResultRendererTitle(ITessAPI.TessResultRenderer renderer) |
ITessAPI.TessResultRenderer |
TessTextRendererCreate(String outputbase) |
ITessAPI.TessResultRenderer |
TessUnlvRendererCreate(String outputbase) |
String |
TessVersion()
Gets the version identifier.
|
static final TessAPI INSTANCE
String TessVersion()
void TessDeleteText(com.sun.jna.Pointer text)
text
- the pointer to textvoid TessDeleteTextArray(com.sun.jna.ptr.PointerByReference arr)
arr
- text array pointer referencevoid TessDeleteIntArray(IntBuffer arr)
arr
- int arrayITessAPI.TessResultRenderer TessTextRendererCreate(String outputbase)
ITessAPI.TessResultRenderer TessHOcrRendererCreate(String outputbase)
ITessAPI.TessResultRenderer TessPDFRendererCreate(String outputbase, String datadir)
ITessAPI.TessResultRenderer TessUnlvRendererCreate(String outputbase)
ITessAPI.TessResultRenderer TessBoxTextRendererCreate(String outputbase)
void TessDeleteResultRenderer(ITessAPI.TessResultRenderer renderer)
void TessResultRendererInsert(ITessAPI.TessResultRenderer renderer, ITessAPI.TessResultRenderer next)
ITessAPI.TessResultRenderer TessResultRendererNext(ITessAPI.TessResultRenderer renderer)
int TessResultRendererBeginDocument(ITessAPI.TessResultRenderer renderer, String title)
int TessResultRendererAddImage(ITessAPI.TessResultRenderer renderer, com.sun.jna.ptr.PointerByReference api)
int TessResultRendererEndDocument(ITessAPI.TessResultRenderer renderer)
com.sun.jna.Pointer TessResultRendererExtention(ITessAPI.TessResultRenderer renderer)
com.sun.jna.Pointer TessResultRendererTitle(ITessAPI.TessResultRenderer renderer)
int TessResultRendererImageNum(ITessAPI.TessResultRenderer renderer)
ITessAPI.TessBaseAPI TessBaseAPICreate()
void TessBaseAPIDelete(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancevoid TessBaseAPISetInputName(ITessAPI.TessBaseAPI handle, String name)
handle
- the TesseractAPI instancename
- name of the input fileString TessBaseAPIGetInputName(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancevoid TessBaseAPISetInputImage(ITessAPI.TessBaseAPI handle, net.sourceforge.lept4j.Pix pix)
net.sourceforge.lept4j.Pix TessBaseAPIGetInputImage(ITessAPI.TessBaseAPI handle)
int TessBaseAPIGetSourceYResolution(ITessAPI.TessBaseAPI handle)
String TessBaseAPIGetDatapath(ITessAPI.TessBaseAPI handle)
void TessBaseAPISetOutputName(ITessAPI.TessBaseAPI handle, String name)
handle
- the TesseractAPI instancename
- name of the output fileint TessBaseAPISetVariable(ITessAPI.TessBaseAPI handle, String name, String value)
SetVariable("tessedit_char_blacklist", "xyz");
to ignore x,
y and z. Or SetVariable("classify_bln_numeric_mode", "1");
to set numeric-only mode. SetVariable
may be used before
Init
, but settings will revert to defaults on
End()
.Init()
. Only works for non-init
variables (init variables should be passed to Init()
).handle
- the TesseractAPI instancename
- name of the inputvalue
- variable valueint TessBaseAPIGetIntVariable(ITessAPI.TessBaseAPI handle, String name, IntBuffer value)
handle
- the TesseractAPI instancename
- name of the inputvalue
- pass the int buffer valueint TessBaseAPIGetBoolVariable(ITessAPI.TessBaseAPI handle, String name, IntBuffer value)
handle
- the TesseractAPI instancename
- pass the name of the variablevalue
- pass the int buffer valueint TessBaseAPIGetDoubleVariable(ITessAPI.TessBaseAPI handle, String name, DoubleBuffer value)
handle
- the TesseractAPI instancename
- pass the name of the variablevalue
- pass the double buffer valueString TessBaseAPIGetStringVariable(ITessAPI.TessBaseAPI handle, String name)
handle
- the TesseractAPI instancename
- pass the name of the variablevoid TessBaseAPIPrintVariablesToFile(ITessAPI.TessBaseAPI handle, String filename)
handle
- the TesseractAPI instancefilename
- name of the file where the variables will be persistedint TessBaseAPIInit1(ITessAPI.TessBaseAPI handle, String datapath, String language, int oem, com.sun.jna.ptr.PointerByReference configs, int configs_size)
SetVariable
on some of the Params in classify and textord.
If you do, then the effect will be to change it for all your
instances.Init
are those listed
above here in the class definition.Init
multiple times on the same instance to change language,
or just to reset the classifier. Languages may specify internally that
they want to be loaded with one or more other languages, so the ~
sign is available to override that. E.g., if hin
were set to
load eng
by default, then hin+~eng
would force
loading only hin
. The number of loaded languages is limited
only by memory, with the caveat that loading additional languages will
impact both speed and accuracy, as there is more work to do to decide on
the applicable language, and there is more chance of hallucinating
incorrect words. WARNING: On changing languages, all Tesseract parameters
are reset back to their default values. (Which may vary between
languages.) If you have a rare need to set a Variable that controls
initialization for a second call to Init
you should
explicitly call End()
and then use SetVariable
before Init
.Init
.set_only_non_debug_params
is true, only params that do
not contain "debug" in the name will be set.handle
- the TesseractAPI instancedatapath
- The datapath
must be the name of the parent
directory of tessdata
and must end in
/. Any name after the last / will be stripped.language
- The language is (usually) an ISO 639-3
string or NULL
will default to eng
. The
language may be a string of the form [~]<lang>[+[~]<lang>]
indicating that multiple languages are to be loaded. E.g.,
hin+eng
will load Hindi and English.oem
- ocr engine modeconfigs
- pointer configurationconfigs_size
- pointer configuration sizeint TessBaseAPIInit2(ITessAPI.TessBaseAPI handle, String datapath, String language, int oem)
handle
- the TesseractAPI instancedatapath
- The datapath
must be the name of the parent
directory of tessdata
and must end in
/. Any name after the last / will be stripped.language
- The language is (usually) an ISO 639-3
string or NULL
will default to eng
. The
language may be a string of the form [~]<lang>[+[~]<lang>]
indicating that multiple languages are to be loaded. E.g.,
hin+eng
will load Hindi and English.oem
- ocr engine modeint TessBaseAPIInit3(ITessAPI.TessBaseAPI handle, String datapath, String language)
handle
- the TesseractAPI instancedatapath
- The datapath
must be the name of the parent
directory of tessdata
and must end in
/. Any name after the last / will be stripped.language
- The language is (usually) an ISO 639-3
string or NULL
will default to eng
. The
language may be a string of the form [~]<lang>[+[~]<lang>]
indicating that multiple languages are to be loaded. E.g.,
hin+eng
will load Hindi and English.int TessBaseAPIInit4(ITessAPI.TessBaseAPI handle, String datapath, String language, int oem, com.sun.jna.ptr.PointerByReference configs, int configs_size, com.sun.jna.ptr.PointerByReference vars_vec, com.sun.jna.ptr.PointerByReference vars_values, com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size, int set_only_non_debug_params)
handle
- the TesseractAPI instancedatapath
- The datapath
must be the name of the parent
directory of tessdata
and must end in
/. Any name after the last / will be stripped.language
- The language is (usually) an ISO 639-3
string or NULL
will default to eng
. The
language may be a string of the form [~]<lang>[+[~]<lang>]
indicating that multiple languages are to be loaded. E.g.,
hin+eng
will load Hindi and English.oem
- ocr engine modeconfigs
- pointer configurationconfigs_size
- pointer configuration sizevars_vec
- vars_values
- vars_vec_size
- set_only_non_debug_params
- String TessBaseAPIGetInitLanguagesAsString(ITessAPI.TessBaseAPI handle)
hin
loaded eng
automatically as well, then
that will not be included in this list. To find the languages actually
loaded, use GetLoadedLanguagesAsVector
. The returned string
should NOT be deleted.handle
- the TesseractAPI instancecom.sun.jna.ptr.PointerByReference TessBaseAPIGetLoadedLanguagesAsVector(ITessAPI.TessBaseAPI handle)
Init
, including those loaded as
dependencies of other loaded languages.handle
- the TesseractAPI instancecom.sun.jna.ptr.PointerByReference TessBaseAPIGetAvailableLanguagesAsVector(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instanceint TessBaseAPIInitLangMod(ITessAPI.TessBaseAPI handle, String datapath, String language)
SetVariable
and
IsValidWord
. WARNING: temporary! This function will be
removed from here and placed in a separate API at some future time.handle
- the TesseractAPI instancedatapath
- The datapath
must be the name of the parent
directory of tessdata
and must end in
/. Any name after the last / will be stripped.language
- The language is (usually) an ISO 639-3
string or NULL
will default to eng. The language may be a
string of the form [~]<lang>[+[~]<lang>] indicating that
multiple languages are to be loaded. E.g., hin+eng will load Hindi and
English.void TessBaseAPIInitForAnalysePage(ITessAPI.TessBaseAPI handle)
SetImage
and AnalysePage
. Calls that attempt
recognition will generate an error.handle
- the TesseractAPI instancevoid TessBaseAPIReadConfigFile(ITessAPI.TessBaseAPI handle, String filename, int init_only)
tessdata/configs
,
tessdata/tessconfigs
and also accepts a relative or absolute
path name. Note: only non-init params will be set (init params are set by
Init()
).handle
- the TesseractAPI instancefilename
- relative or absolute path for the "config" file
containing a set of param and value pairsinit_only
- void TessBaseAPISetPageSegMode(ITessAPI.TessBaseAPI handle, int mode)
PSM_SINGLE_BLOCK
. The mode is stored as an IntParam so it
can also be modified by ReadConfigFile
or
SetVariable("tessedit_pageseg_mode", mode as string)
.handle
- the TesseractAPI instancemode
- tesseract page segment modeint TessBaseAPIGetPageSegMode(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancecom.sun.jna.Pointer TessBaseAPIRect(ITessAPI.TessBaseAPI handle, ByteBuffer imagedata, int bytes_per_pixel, int bytes_per_line, int left, int top, int width, int height)
Init
. Currently has no
error checking. Greyscale of 8 and color of 24 or 32 bits per pixel may
be given. Palette color images will not work properly and must be
converted to 24 bit. Binary images of 1 bit per pixel may also be given
but they must be byte packed with the MSB of the first byte being the
first pixel, and a 1 represents WHITE. For binary images set
bytes_per_pixel=0. The recognized text is returned as a char* which is
coded as UTF8 and must be freed with the delete [] operator.TesseractRect
is the simplified convenience
interface. For advanced uses, use SetImage
, (optionally)
SetRectangle
, Recognize
, and one or more of the
Get*Text
functions below.handle
- the TesseractAPI instanceimagedata
- image byte bufferbytes_per_pixel
- bytes per pixelbytes_per_line
- bytes per lineleft
- image lefttop
- image topwidth
- image widthheight
- image heightvoid TessBaseAPIClearAdaptiveClassifier(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancevoid TessBaseAPISetImage(ITessAPI.TessBaseAPI handle, ByteBuffer imagedata, int width, int height, int bytes_per_pixel, int bytes_per_line)
TesseractRect
above. Does not copy the image buffer, or take
ownership. The source image may be destroyed after Recognize
is called,
either explicitly or implicitly via one of the Get*Text
functions. SetImage
clears all recognition results, and sets
the rectangle to the full image, so it may be followed immediately by a
GetUTF8Text
, and it will automatically perform recognition.handle
- the TesseractAPI instanceimagedata
- image byte bufferwidth
- image widthheight
- image heightbytes_per_pixel
- bytes per pixelbytes_per_line
- bytes per linevoid TessBaseAPISetImage2(ITessAPI.TessBaseAPI handle, net.sourceforge.lept4j.Pix pix)
SetImage
above, Tesseract doesn't take a copy or ownership
or pixDestroy
the image, so it must persist until after
Recognize
. Pix
vs raw, which to use? Use
Pix
where possible. A future version of Tesseract may choose
to use Pix
as its internal representation and discard
IMAGE
altogether. Because of that, an implementation that
sources and targets Pix
may end up with less copies than an
implementation that does not.handle
- the TesseractAPI instancepix
- imagevoid TessBaseAPISetSourceResolution(ITessAPI.TessBaseAPI handle, int ppi)
SetImage()
.handle
- the TesseractAPI instanceppi
- source resolution valuevoid TessBaseAPISetRectangle(ITessAPI.TessBaseAPI handle, int left, int top, int width, int height)
SetImage
. Each SetRectangle
clears the
recognition results so multiple rectangles can be recognized with the
same image.handle
- the TesseractAPI instanceleft
- valuetop
- valuewidth
- valueheight
- valuenet.sourceforge.lept4j.Pix TessBaseAPIGetThresholdedImage(ITessAPI.TessBaseAPI handle)
SetImage
if you have Leptonica
installed. Get a copy of the internal thresholded image from Tesseract.handle
- the TesseractAPI instancenet.sourceforge.lept4j.Boxa TessBaseAPIGetRegions(ITessAPI.TessBaseAPI handle, com.sun.jna.ptr.PointerByReference pixa)
Boxa
, Pixa
pair, in reading order. Can be
called before or after Recognize
.handle
- the TesseractAPI instancepixa
- array of Pixnet.sourceforge.lept4j.Boxa TessBaseAPIGetTextlines(ITessAPI.TessBaseAPI handle, com.sun.jna.ptr.PointerByReference pixa, com.sun.jna.ptr.PointerByReference blockids)
Boxa
,
Pixa
pair, in reading order. Can be called before or after
Recognize
. If blockids
is not NULL
, the
block-id of each line is also returned as an array of one element per
line. delete [] after use. If paraids
is not
NULL
, the paragraph-id of each line within its block is also
returned as an array of one element per line. delete [] after use.handle
- the TesseractAPI instancepixa
- array of Pixblockids
- net.sourceforge.lept4j.Boxa TessBaseAPIGetTextlines1(ITessAPI.TessBaseAPI handle, int raw_image, int raw_padding, com.sun.jna.ptr.PointerByReference pixa, com.sun.jna.ptr.PointerByReference blockids, com.sun.jna.ptr.PointerByReference paraids)
Boxa
,
Pixa
pair, in reading order. Can be called before or after
Recognize
. If blockids
is not NULL
, the
block-id of each line is also returned as an array of one element per
line. delete [] after use. If paraids
is not
NULL
, the paragraph-id of each line within its block is also
returned as an array of one element per line. delete [] after use.handle
- the TesseractAPI instanceraw_image
- raw_padding
- pixa
- array of Pixblockids
- paraids
- net.sourceforge.lept4j.Boxa TessBaseAPIGetStrips(ITessAPI.TessBaseAPI handle, com.sun.jna.ptr.PointerByReference pixa, com.sun.jna.ptr.PointerByReference blockids)
Boxa
, Pixa
pair, in reading order. Enables
downstream handling of non-rectangular regions. Can be called before or
after Recognize
. If blockids
is not NULL, the block-id of
each line is also returned as an array of one element per line. delete []
after use.handle
- the TesseractAPI instancepixa
- array of Pixblockids
- net.sourceforge.lept4j.Boxa TessBaseAPIGetWords(ITessAPI.TessBaseAPI handle, com.sun.jna.ptr.PointerByReference pixa)
Boxa
, Pixa
pair, in reading order. Can be called before or after
Recognize
.handle
- the TesseractAPI instancepixa
- array of Pixnet.sourceforge.lept4j.Boxa TessBaseAPIGetConnectedComponents(ITessAPI.TessBaseAPI handle, com.sun.jna.ptr.PointerByReference cc)
Boxa
, Pixa
pair, in reading order. Can be
called before or after Recognize
.handle
- the TesseractAPI instancecc
- array of Pixnet.sourceforge.lept4j.Boxa TessBaseAPIGetComponentImages(ITessAPI.TessBaseAPI handle, int level, int text_only, com.sun.jna.ptr.PointerByReference pixa, com.sun.jna.ptr.PointerByReference blockids)
Boxa
, Pixa
pair, in reading
order. Can be called before or after Recognize
. If blockids
is not NULL
, the block-id of each component is also returned
as an array of one element per component. delete [] after use. If
text_only
is true, then only text components are returned.
Helper function to get binary images with no padding (most common usage).handle
- the TesseractAPI instancelevel
- PageIteratorLeveltext_only
- pixa
- array of Pixblockids
- net.sourceforge.lept4j.Boxa TessBaseAPIGetComponentImages1(ITessAPI.TessBaseAPI handle, int level, int text_only, int raw_image, int raw_padding, com.sun.jna.ptr.PointerByReference pixa, com.sun.jna.ptr.PointerByReference blockids, com.sun.jna.ptr.PointerByReference paraids)
Boxa
, Pixa
pair, in reading
order. Can be called before or after Recognize
. If blockids
is not NULL
, the block-id of each component is also returned
as an array of one element per component. delete [] after use. If
paraids
is not NULL
, the paragraph-id of each
component with its block is also returned as an array of one element per
component. delete [] after use. If raw_image
is true, then
portions of the original image are extracted instead of the thresholded
image and padded with raw_padding. If text_only
is true,
then only text components are returned.handle
- the TesseractAPI instancelevel
- PageIteratorLeveltext_only
- raw_image
- raw_padding
- pixa
- array of Pixblockids
- paraids
- int TessBaseAPIGetThresholdedImageScaleFactor(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancevoid TessBaseAPIDumpPGM(ITessAPI.TessBaseAPI handle, String filename)
handle
- the TesseractAPI instancefilename
- pgm file nameITessAPI.TessPageIterator TessBaseAPIAnalyseLayout(ITessAPI.TessBaseAPI handle)
SetPageSegMode
.
May optionally be called prior to Recognize
to get access to
just the page layout results. Returns an iterator to the results. Returns
NULL
on error. The returned iterator must be deleted after
use. WARNING! This class points to data held within the
TessBaseAPI
class, and therefore can only be used while the
TessBaseAPI
class still exists and has not been subjected to
a call of Init
, SetImage
,
Recognize
, Clear
, End
, DetectOS,
or anything else that changes the internal PAGE_RES
.handle
- the TesseractAPI instanceint TessBaseAPIRecognize(ITessAPI.TessBaseAPI handle, ITessAPI.ETEXT_DESC monitor)
SetAndThresholdImage
, generating
Tesseract internal structures. Returns 0 on success. Optional. The
Get*Text
functions below will call Recognize
if
needed. After Recognize
, the output is kept internally until
the next SetImage
.handle
- the TesseractAPI instancemonitor
- the result as Tesseract internal structuresint TessBaseAPIRecognizeForChopTest(ITessAPI.TessBaseAPI handle, ITessAPI.ETEXT_DESC monitor)
Recognize
used for testing chopper.handle
- the TesseractAPI instancemonitor
- the result as Tesseract internal structuresITessAPI.TessResultIterator TessBaseAPIGetIterator(ITessAPI.TessBaseAPI handle)
Recognize
. The returned iterator must be deleted after use.
WARNING! This class points to data held within the
TessBaseAPI
class, and therefore can only be used while the
TessBaseAPI
class still exists and has not been subjected to
a call of Init
, SetImage
,
Recognize
, Clear
, End
, DetectOS,
or anything else that changes the internal PAGE_RES.handle
- the TesseractAPI instanceITessAPI.TessMutableIterator TessBaseAPIGetMutableIterator(ITessAPI.TessBaseAPI handle)
Recognize
. The returned iterator must be deleted after use.
WARNING! This class points to data held within the
TessBaseAPI
class, and therefore can only be used while the
TessBaseAPI
class still exists and has not been subjected to
a call of Init
, SetImage
,
Recognize
, Clear
, End
, DetectOS,
or anything else that changes the internal PAGE_RES
.handle
- the TesseractAPI instanceint TessBaseAPIProcessPages(ITessAPI.TessBaseAPI handle, String filename, String retry_config, int timeout_millisec, ITessAPI.TessResultRenderer renderer)
tessedit_create_boxfile
,
tessedit_make_boxes_from_boxes
,
tessedit_write_unlv
, tessedit_create_hocr
.
Calls ProcessPage on each page in the input file, which may be a
multi-page tiff, single-page other file format, or a plain text list of
images to read. If tessedit_page_number is non-negative, processing
begins at that page of a multi-page tiff file, or filelist. The text is
returned in text_out. Returns false on error. If non-zero
timeout_millisec terminates processing after the timeout on a single
page. If non-NULL and non-empty, and some page fails for some reason, the
page is reprocessed with the retry_config config file. Useful for
interactively debugging a bad page.handle
- the TesseractAPI instancefilename
- multi-page tiff or list of filenamesretry_config
- retry config valuestimeout_millisec
- timeout valuerenderer
- result rendererint TessBaseAPIProcessPage(ITessAPI.TessBaseAPI handle, net.sourceforge.lept4j.Pix pix, int page_index, String filename, String retry_config, int timeout_millisec, ITessAPI.TessResultRenderer renderer)
com.sun.jna.Pointer TessBaseAPIGetUTF8Text(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancecom.sun.jna.Pointer TessBaseAPIGetHOCRText(ITessAPI.TessBaseAPI handle, int page_number)
handle
- the TesseractAPI instancepage_number
- page numbercom.sun.jna.Pointer TessBaseAPIGetBoxText(ITessAPI.TessBaseAPI handle, int page_number)
handle
- the TesseractAPI instancepage_number
- number of the pagecom.sun.jna.Pointer TessBaseAPIGetUNLVText(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instanceint TessBaseAPIMeanTextConf(ITessAPI.TessBaseAPI handle)
handle
- the TesseractAPI instancecom.sun.jna.ptr.IntByReference TessBaseAPIAllWordConfidences(ITessAPI.TessBaseAPI handle)
GetUTF8Text
.handle
- the TesseractAPI instanceint TessBaseAPIAdaptToWordStr(ITessAPI.TessBaseAPI handle, int mode, String wordstr)
SetImage
/SetRectangle
have been used to set the
image to the given word. The mode arg should be
PSM_SINGLE_WORD
or PSM_CIRCLE_WORD
, as that
will be used to control layout analysis. The currently set PageSegMode is
preserved.handle
- the TesseractAPI instancemode
- tesseract page segment modewordstr
- The word must be SPACE-DELIMITED UTF-8 - l i k e t h i s ,
so it can tell the boundaries of the graphemes.void TessBaseAPIClear(ITessAPI.TessBaseAPI handle)
SetImage
or
TesseractRect
before doing any Recognize
or
Get*
operation.handle
- the TesseractAPI instancevoid TessBaseAPIEnd(ITessAPI.TessBaseAPI handle)
End()
is
equivalent to destructing and reconstructing your TessBaseAPI. Once
End()
has been used, none of the other API functions may be
used other than Init
and anything declared above it in the
class definition.handle
- the TesseractAPI instanceint TessBaseAPIIsValidWord(ITessAPI.TessBaseAPI handle, String word)
handle
- the TesseractAPI instanceword
- word valueint TessBaseAPIGetTextDirection(ITessAPI.TessBaseAPI handle, IntBuffer out_offset, FloatBuffer out_slope)
handle
- the TesseractAPI instanceout_offset
- offsetout_slope
- slopevoid TessBaseAPIClearPersistentCache(ITessAPI.TessBaseAPI handle)
Init()
and
End()
of individual TessBaseAPI's. This function allows the
clearing of these caches.handle
- the TesseractAPI instanceint TessBaseAPIDetectOrientationScript(ITessAPI.TessBaseAPI handle, IntBuffer orient_deg, FloatBuffer orient_conf, com.sun.jna.ptr.PointerByReference script_name, FloatBuffer script_conf)
orient_deg
is the detected clockwise rotation of the input image in
degrees (0, 90, 180, 270); orient_conf
is the confidence (15.0 is
reasonably confident); script_name
is an ASCII string, the name of the
script, e.g. "Latin"; script_conf
is confidence level in the script.String TessBaseAPIGetUnichar(ITessAPI.TessBaseAPI handle, int unichar_id)
handle
- the TesseractAPI instanceunichar_id
- the unichar idvoid TessPageIteratorDelete(ITessAPI.TessPageIterator handle)
handle
- the TessPageIterator instanceITessAPI.TessPageIterator TessPageIteratorCopy(ITessAPI.TessPageIterator handle)
handle
- the TessPageIterator instancevoid TessPageIteratorBegin(ITessAPI.TessPageIterator handle)
handle
- the TessPageIterator instanceint TessPageIteratorNext(ITessAPI.TessPageIterator handle, int level)
PTIsTextType(BLockType())
is false for non-text blocks.handle
- the TessPageIterator instancelevel
- tesseract page levelint TessPageIteratorIsAtBeginningOf(ITessAPI.TessPageIterator handle, int level)
handle
- the TessPageIterator instancelevel
- tesseract page levelint TessPageIteratorIsAtFinalElement(ITessAPI.TessPageIterator handle, int level, int element)
handle
- the TessPageIterator instancelevel
- tesseract page levelelement
- page iterator levelint TessPageIteratorBoundingBox(ITessAPI.TessPageIterator handle, int level, IntBuffer left, IntBuffer top, IntBuffer right, IntBuffer bottom)
handle
- the TessPageIterator instancelevel
- tesseract page levelleft
- int buffer positiontop
- int buffer positionright
- int buffer positionbottom
- int buffer positionint TessPageIteratorBlockType(ITessAPI.TessPageIterator handle)
handle
- the TessPageIterator instancenet.sourceforge.lept4j.Pix TessPageIteratorGetBinaryImage(ITessAPI.TessPageIterator handle, int level)
pixDestroy
to delete the image after use. The following
methods are used to generate the images: RIL_BLOCK
: mask the
page image with the block polygon. RIL_TEXTLINE
: Clip the
rectangle of the line box from the page image. TODO(rays) fix this to
generate and use a line polygon. RIL_WORD
: Clip the
rectangle of the word box from the page image. RIL_SYMBOL
:
Render the symbol outline to an image for cblobs (prior to recognition)
or the bounding box otherwise. A reconstruction of the original image
(using xor to check for double representation) should be reasonably
accurate, apart from removed noise, at the block level. Below the block
level, the reconstruction will be missing images and line separators. At
the symbol level, kerned characters will be invade the bounding box if
rendered after recognition, making an xor reconstruction inaccurate, but
an or construction better. Before recognition, symbol-level
reconstruction should be good, even with xor, since the images come from
the connected components.handle
- the TessPageIterator instancelevel
- PageIteratorLevelnet.sourceforge.lept4j.Pix TessPageIteratorGetImage(ITessAPI.TessPageIterator handle, int level, int padding, net.sourceforge.lept4j.Pix original_image, IntBuffer left, IntBuffer top)
pixDestroy
to delete the image after use.handle
- the TessPageIterator instancelevel
- PageIteratorLevelpadding
- original_image
- left
- top
- int TessPageIteratorBaseline(ITessAPI.TessPageIterator handle, int level, IntBuffer x1, IntBuffer y1, IntBuffer x2, IntBuffer y2)
handle
- the TessPageIterator instancelevel
- PageIteratorLevelx1
- int buffer positiony1
- int buffer positionx2
- int buffer positiony2
- int buffer positionvoid TessPageIteratorOrientation(ITessAPI.TessPageIterator handle, IntBuffer orientation, IntBuffer writing_direction, IntBuffer textline_order, FloatBuffer deskew_angle)
handle
- the TessPageIterator instanceorientation
- orientation valuewriting_direction
- writing direction valuetextline_order
- text line orderdeskew_angle
- deskew anglevoid TessPageIteratorParagraphInfo(ITessAPI.TessPageIterator handle, IntBuffer justification, IntBuffer is_list_item, IntBuffer is_crown, IntBuffer first_line_indent)
handle
- the TessPageIterator instancejustification
- justification typeis_list_item
- list itemis_crown
- very first or continuationfirst_line_indent
- first line indentationvoid TessResultIteratorDelete(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceITessAPI.TessResultIterator TessResultIteratorCopy(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceITessAPI.TessPageIterator TessResultIteratorGetPageIterator(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceITessAPI.TessPageIterator TessResultIteratorGetPageIteratorConst(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceint TessResultIteratorNext(ITessAPI.TessResultIterator handle, int level)
com.sun.jna.Pointer TessResultIteratorGetUTF8Text(ITessAPI.TessResultIterator handle, int level)
handle
- the TessResultIterator instancelevel
- tesseract page levelfloat TessResultIteratorConfidence(ITessAPI.TessResultIterator handle, int level)
handle
- the TessResultIterator instancelevel
- tesseract page levelString TessResultIteratorWordRecognitionLanguage(ITessAPI.TessResultIterator handle)
String TessResultIteratorWordFontAttributes(ITessAPI.TessResultIterator handle, IntBuffer is_bold, IntBuffer is_italic, IntBuffer is_underlined, IntBuffer is_monospace, IntBuffer is_serif, IntBuffer is_smallcaps, IntBuffer pointsize, IntBuffer font_id)
Init
, SetImage
, End
or deleting
the TessBaseAPI. Pointsize is returned in printers points (1/72 inch).handle
- the TessResultIterator instanceis_bold
- font attributeis_italic
- font attributeis_underlined
- font attributeis_monospace
- font attributeis_serif
- font attributeis_smallcaps
- font attributepointsize
- font attributefont_id
- font attributeint TessResultIteratorWordIsFromDictionary(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceint TessResultIteratorWordIsNumeric(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceint TessResultIteratorSymbolIsSuperscript(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceint TessResultIteratorSymbolIsSubscript(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceint TessResultIteratorSymbolIsDropcap(ITessAPI.TessResultIterator handle)
handle
- the TessResultIterator instanceITessAPI.TessChoiceIterator TessResultIteratorGetChoiceIterator(ITessAPI.TessResultIterator handle)
void TessChoiceIteratorDelete(ITessAPI.TessChoiceIterator handle)
int TessChoiceIteratorNext(ITessAPI.TessChoiceIterator handle)
String TessChoiceIteratorGetUTF8Text(ITessAPI.TessChoiceIterator handle)
float TessChoiceIteratorConfidence(ITessAPI.TessChoiceIterator handle)
Copyright © 2017 Tess4J. All Rights Reserved.