int
Applies the given word to the adaptive classifier if possible.
static int
Applies the given word to the adaptive classifier if possible.
com.sun.jna.ptr.IntByReference
Returns an array of all word confidences, terminated by -1.
static com.sun.jna.ptr.IntByReference
Returns an array of all word confidences, terminated by -1.
Runs page layout analysis in the mode set by SetPageSegMode
.
Runs page layout analysis in the mode set by SetPageSegMode
.
void
Free up recognition results and any stored image data, without actually
freeing any recognition data that would be time-consuming to reload.
static void
Free up recognition results and any stored image data, without actually
freeing any recognition data that would be time-consuming to reload.
void
Call between pages or documents etc to free up memory and forget adaptive
data.
static void
Call between pages or documents etc to free up memory and forget adaptive
data.
void
Clear any library-level memory caches.
static void
Clear any library-level memory caches.
void
Disposes the TesseractAPI instance.
static void
Disposes the TesseractAPI instance.
int
Detect the orientation of the input image and apparent script (alphabet).
static int
Detect the orientation of the input image and apparent script (alphabet).
void
Close down tesseract and free up all memory.
static void
Close down tesseract and free up all memory.
com.sun.jna.Pointer
Make an XML-formatted string with Alto markup from the internal data
structures.
static com.sun.jna.Pointer
Make an XML-formatted string with Alto markup from the internal data
structures.
com.sun.jna.ptr.PointerByReference
Returns the available languages in the vector of STRINGs.
static com.sun.jna.ptr.PointerByReference
Returns the available languages in the vector of STRINGs.
int
Get the value of an internal bool parameter.
static int
Get the value of an internal bool parameter.
com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as a UTF8 box
file and must be freed with the delete [] operator.
static com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as a UTF8 box
file and must be freed with the delete [] operator.
net.sourceforge.lept4j.Boxa
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style Boxa
, Pixa
pair, in reading
order.
static net.sourceforge.lept4j.Boxa
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style Boxa
, Pixa
pair, in reading
order.
net.sourceforge.lept4j.Boxa
TessAPI.TessBaseAPIGetComponentImages1(ITessAPI.TessBaseAPI handle,
int level,
int text_only,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style Boxa
, Pixa
pair, in reading
order.
static net.sourceforge.lept4j.Boxa
TessAPI1.TessBaseAPIGetComponentImages1(ITessAPI.TessBaseAPI handle,
int level,
int text_only,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the given level kind of components (block, textline, word etc.) as a
Leptonica-style Boxa
, Pixa
pair, in reading
order.
net.sourceforge.lept4j.Boxa
Gets the individual connected (text) components (created after pages
segmentation step, but before recognition) as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
Gets the individual connected (text) components (created after pages
segmentation step, but before recognition) as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
int
Get the value of an internal double parameter.
static int
Get the value of an internal double parameter.
com.sun.jna.Pointer
Make a HTML-formatted string with hOCR markup from the internal data
structures.
static com.sun.jna.Pointer
Make a HTML-formatted string with hOCR markup from the internal data
structures.
Returns the languages string used in the last valid initialization.
Returns the languages string used in the last valid initialization.
net.sourceforge.lept4j.Pix
static net.sourceforge.lept4j.Pix
These functions are required for searchable PDF output.
These functions are required for searchable PDF output.
int
Get the value of an internal int parameter.
static int
Get the value of an internal int parameter.
Get a reading-order iterator to the results of LayoutAnalysis and/or
Recognize
.
Get a reading-order iterator to the results of LayoutAnalysis and/or
Recognize
.
com.sun.jna.ptr.PointerByReference
Returns the loaded languages in the vector of STRINGs.
static com.sun.jna.ptr.PointerByReference
Returns the loaded languages in the vector of STRINGs.
com.sun.jna.Pointer
Create a UTF8 box file for LSTM training from the internal data
structures.
static com.sun.jna.Pointer
Create a UTF8 box file for LSTM training from the internal data
structures.
Get a mutable iterator to the results of LayoutAnalysis and/or
Recognize
.
Get a mutable iterator to the results of LayoutAnalysis and/or
Recognize
.
int
Return the current page segmentation mode.
static int
Return the current page segmentation mode.
net.sourceforge.lept4j.Boxa
Get the result of page layout analysis as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
Get the result of page layout analysis as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
int
static int
Get the value of an internal string parameter.
Get the value of an internal string parameter.
net.sourceforge.lept4j.Boxa
Get textlines and strips of image regions as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
Get textlines and strips of image regions as a Leptonica-style
Boxa
, Pixa
pair, in reading order.
int
static int
net.sourceforge.lept4j.Boxa
Get the textlines as a Leptonica-style Boxa
,
Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
Get the textlines as a Leptonica-style Boxa
,
Pixa
pair, in reading order.
net.sourceforge.lept4j.Boxa
TessAPI.TessBaseAPIGetTextlines1(ITessAPI.TessBaseAPI handle,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the textlines as a Leptonica-style Boxa
,
Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
TessAPI1.TessBaseAPIGetTextlines1(ITessAPI.TessBaseAPI handle,
int raw_image,
int raw_padding,
com.sun.jna.ptr.PointerByReference pixa,
com.sun.jna.ptr.PointerByReference blockids,
com.sun.jna.ptr.PointerByReference paraids)
Get the textlines as a Leptonica-style Boxa
,
Pixa
pair, in reading order.
net.sourceforge.lept4j.Pix
ONLY available after SetImage
if you have Leptonica
installed.
static net.sourceforge.lept4j.Pix
ONLY available after SetImage
if you have Leptonica
installed.
int
static int
com.sun.jna.Pointer
Make a TSV-formatted string from the internal data structures.
static com.sun.jna.Pointer
Make a TSV-formatted string from the internal data structures.
Gets the string of the specified unichar.
Gets the string of the specified unichar.
com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as UNLV format
Latin-1 with specific reject and suspect codes and must be freed with the
delete [] operator.
static com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as UNLV format
Latin-1 with specific reject and suspect codes and must be freed with the
delete [] operator.
com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as UTF-8 and
must be freed with the delete [] operator.
static com.sun.jna.Pointer
The recognized text is returned as a char* which is coded as UTF-8 and
must be freed with the delete [] operator.
net.sourceforge.lept4j.Boxa
Get the words as a Leptonica-style Boxa
, Pixa
pair, in reading order.
static net.sourceforge.lept4j.Boxa
Get the words as a Leptonica-style Boxa
, Pixa
pair, in reading order.
com.sun.jna.Pointer
Create a UTF8 box file with WordStr strings from the internal data
structures.
static com.sun.jna.Pointer
Create a UTF8 box file with WordStr strings from the internal data
structures.
int
Instances are now mostly thread-safe and totally independent, but some
global parameters remain.
static int
Instances are now mostly thread-safe and totally independent, but some
global parameters remain.
int
static int
int
static int
int
TessAPI.TessBaseAPIInit4(ITessAPI.TessBaseAPI handle,
String datapath,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size,
com.sun.jna.ptr.PointerByReference vars_vec,
com.sun.jna.ptr.PointerByReference vars_values,
com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size,
int set_only_non_debug_params)
static int
TessAPI1.TessBaseAPIInit4(ITessAPI.TessBaseAPI handle,
String datapath,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size,
com.sun.jna.ptr.PointerByReference vars_vec,
com.sun.jna.ptr.PointerByReference vars_values,
com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size,
int set_only_non_debug_params)
int
TessAPI.TessBaseAPIInit5(ITessAPI.TessBaseAPI handle,
String data,
int data_size,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size,
com.sun.jna.ptr.PointerByReference vars_vec,
com.sun.jna.ptr.PointerByReference vars_values,
com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size,
int set_only_non_debug_params)
static int
TessAPI1.TessBaseAPIInit5(ITessAPI.TessBaseAPI handle,
String data,
int data_size,
String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size,
com.sun.jna.ptr.PointerByReference vars_vec,
com.sun.jna.ptr.PointerByReference vars_values,
com.ochafik.lang.jnaerator.runtime.NativeSize vars_vec_size,
int set_only_non_debug_params)
void
Init only for page layout analysis.
static void
Init only for page layout analysis.
int
Check whether a word is valid according to Tesseract's language model.
static int
Check whether a word is valid according to Tesseract's language model.
int
Returns the average word confidence for Tesseract page result.
static int
Returns the average word confidence for Tesseract page result.
void
Print Tesseract parameters to the given file.
Note: Must not be the first method called after instance create.
static void
Print Tesseract parameters to the given file.
Note: Must not be the first method called after instance create.
int
static int
int
Recognizes all the pages in the named file, as a multi-page tiff or list
of filenames, or single image, and gets the appropriate kind of text
according to parameters: tessedit_create_boxfile
,
tessedit_make_boxes_from_boxes
,
tessedit_write_unlv
, tessedit_create_hocr
.
static int
Recognizes all the pages in the named file, as a multi-page tiff or list
of filenames, or single image, and gets the appropriate kind of text
according to parameters: tessedit_create_boxfile
,
tessedit_make_boxes_from_boxes
,
tessedit_write_unlv
, tessedit_create_hocr
.
void
Read a "config" file containing a set of param, value pairs.
static void
Read a "config" file containing a set of param, value pairs.
int
Recognize the image from SetAndThresholdImage
, generating
Tesseract internal structures.
static int
Recognize the image from SetAndThresholdImage
, generating
Tesseract internal structures.
com.sun.jna.Pointer
Recognize a rectangle from an image and return the result as a string.
static com.sun.jna.Pointer
Recognize a rectangle from an image and return the result as a string.
void
Provide an image for Tesseract to recognize.
static void
Provide an image for Tesseract to recognize.
void
Provide an image for Tesseract to recognize.
static void
Provide an image for Tesseract to recognize.
void
static void
void
Set the name of the input file.
static void
Set the name of the input file.
void
Set the name of the bonus output files.
static void
Set the name of the bonus output files.
void
Set the current page segmentation mode.
static void
Set the current page segmentation mode.
void
Restrict recognition to a sub-rectangle of the image.
static void
Restrict recognition to a sub-rectangle of the image.
void
Set the resolution of the source image in pixels per inch so font size
information can be calculated in results.
static void
Set the resolution of the source image in pixels per inch so font size
information can be calculated in results.
int
Set the value of an internal "parameter." Supply the name of the
parameter and the value as a string, just as you would in a config file.
static int
Set the value of an internal "parameter." Supply the name of the
parameter and the value as a string, just as you would in a config file.