Package | Description |
---|---|
net.sourceforge.tess4j |
Modifier and Type | Method and Description |
---|---|
static TessAPI1.TessBaseAPI |
TessAPI1.TessBaseAPICreate()
Creates an instance of the base class for all Tesseract APIs.
|
Modifier and Type | Method and Description |
---|---|
static int |
TessAPI1.TessBaseAPIAdaptToWordStr(TessAPI1.TessBaseAPI handle,
int mode,
java.lang.String wordstr)
Applies the given word to the adaptive classifier if possible.
|
static com.sun.jna.ptr.IntByReference |
TessAPI1.TessBaseAPIAllWordConfidences(TessAPI1.TessBaseAPI handle)
Returns all word confidences (between 0 and 100) in an array, terminated
by -1.
|
static TessAPI1.TessPageIterator |
TessAPI1.TessBaseAPIAnalyseLayout(TessAPI1.TessBaseAPI handle)
Runs page layout analysis in the mode set by SetPageSegMode.
|
static void |
TessAPI1.TessBaseAPIClear(TessAPI1.TessBaseAPI handle)
Free up recognition results and any stored image data, without actually
freeing any recognition data that would be time-consuming to reload.
|
static void |
TessAPI1.TessBaseAPIClearAdaptiveClassifier(TessAPI1.TessBaseAPI handle)
Call between pages or documents etc to free up memory and forget adaptive
data.
|
static void |
TessAPI1.TessBaseAPIDelete(TessAPI1.TessBaseAPI handle)
Disposes the TesseractAPI instance.
|
static void |
TessAPI1.TessBaseAPIEnd(TessAPI1.TessBaseAPI handle)
Close down tesseract and free up all memory.
|
static com.sun.jna.ptr.PointerByReference |
TessAPI1.TessBaseAPIGetAvailableLanguagesAsVector(TessAPI1.TessBaseAPI handle)
Returns the available languages in the vector of STRINGs.
|
static int |
TessAPI1.TessBaseAPIGetBoolVariable(TessAPI1.TessBaseAPI handle,
java.lang.String name,
java.nio.IntBuffer value) |
static java.lang.String |
TessAPI1.TessBaseAPIGetBoxText(TessAPI1.TessBaseAPI handle,
int page_number)
The recognized text is returned as a char* which is coded in the same
format as a box file used in training.
|
static int |
TessAPI1.TessBaseAPIGetDoubleVariable(TessAPI1.TessBaseAPI handle,
java.lang.String name,
java.nio.DoubleBuffer value) |
static java.lang.String |
TessAPI1.TessBaseAPIGetHOCRText(TessAPI1.TessBaseAPI handle,
int page_number)
Make a HTML-formatted string with hOCR markup from the internal data
structures.
|
static java.lang.String |
TessAPI1.TessBaseAPIGetInitLanguagesAsString(TessAPI1.TessBaseAPI handle)
Returns the languages string used in the last valid initialization.
|
static int |
TessAPI1.TessBaseAPIGetIntVariable(TessAPI1.TessBaseAPI handle,
java.lang.String name,
java.nio.IntBuffer value)
Returns true (1) if the parameter was found among Tesseract parameters.
|
static TessAPI1.TessResultIterator |
TessAPI1.TessBaseAPIGetIterator(TessAPI1.TessBaseAPI handle)
Get a reading-order iterator to the results of LayoutAnalysis and/or
Recognize.
|
static com.sun.jna.ptr.PointerByReference |
TessAPI1.TessBaseAPIGetLoadedLanguagesAsVector(TessAPI1.TessBaseAPI handle)
Returns the loaded languages in the vector of STRINGs.
|
static int |
TessAPI1.TessBaseAPIGetPageSegMode(TessAPI1.TessBaseAPI handle)
Return the current page segmentation mode.
|
static java.lang.String |
TessAPI1.TessBaseAPIGetStringVariable(TessAPI1.TessBaseAPI handle,
java.lang.String name) |
static int |
TessAPI1.TessBaseAPIGetTextDirection(TessAPI1.TessBaseAPI handle,
java.nio.IntBuffer out_offset,
java.nio.FloatBuffer out_slope) |
static java.lang.String |
TessAPI1.TessBaseAPIGetUnichar(TessAPI1.TessBaseAPI handle,
int unichar_id)
This method returns the string form of the specified unichar.
|
static java.lang.String |
TessAPI1.TessBaseAPIGetUNLVText(TessAPI1.TessBaseAPI handle)
The recognized text is returned as a char* which is coded as UNLV format
Latin-1 with specific reject and suspect codes and must be freed with the
delete [] operator.
|
static java.lang.String |
TessAPI1.TessBaseAPIGetUTF8Text(TessAPI1.TessBaseAPI handle)
The recognized text is returned as a char* which is coded as UTF-8 and
must be freed with the delete [] operator.
|
static int |
TessAPI1.TessBaseAPIInit1(TessAPI1.TessBaseAPI handle,
java.lang.String datapath,
java.lang.String language,
int oem,
com.sun.jna.ptr.PointerByReference configs,
int configs_size)
Instances are now mostly thread-safe and totally independent, but some
global parameters remain.
|
static int |
TessAPI1.TessBaseAPIInit2(TessAPI1.TessBaseAPI handle,
java.lang.String datapath,
java.lang.String language,
int oem) |
static int |
TessAPI1.TessBaseAPIInit3(TessAPI1.TessBaseAPI handle,
java.lang.String datapath,
java.lang.String language) |
static void |
TessAPI1.TessBaseAPIInitForAnalysePage(TessAPI1.TessBaseAPI handle)
Init only for page layout analysis.
|
static int |
TessAPI1.TessBaseAPIInitLangMod(TessAPI1.TessBaseAPI handle,
java.lang.String datapath,
java.lang.String language)
Init only the lang model component of Tesseract.
|
static int |
TessAPI1.TessBaseAPIIsValidWord(TessAPI1.TessBaseAPI handle,
java.lang.String word)
Check whether a word is valid according to Tesseract's language model.
|
static int |
TessAPI1.TessBaseAPIMeanTextConf(TessAPI1.TessBaseAPI handle)
Returns the (average) confidence value between 0 and 100.
|
static void |
TessAPI1.TessBaseAPIPrintVariablesToFile(TessAPI1.TessBaseAPI handle,
java.lang.String filename)
Print Tesseract parameters to the given file.
Note: Must not be the first method called after instance create. |
static java.lang.String |
TessAPI1.TessBaseAPIProcessPages(TessAPI1.TessBaseAPI handle,
java.lang.String filename,
java.lang.String retry_config,
int timeout_millisec)
Recognizes all the pages in the named file, as a multi-page tiff or list
of filenames, or single image, and gets the appropriate kind of text
according to parameters:
tessedit_create_boxfile ,
tessedit_make_boxes_from_boxes , tessedit_write_unlv ,
tessedit_create_hocr . |
static void |
TessAPI1.TessBaseAPIReadConfigFile(TessAPI1.TessBaseAPI handle,
java.lang.String filename,
int init_only)
Read a "config" file containing a set of param, value pairs.
|
static int |
TessAPI1.TessBaseAPIRecognize(TessAPI1.TessBaseAPI handle,
TessAPI1.ETEXT_DESC monitor)
Recognize the image from SetAndThresholdImage, generating Tesseract
internal structures.
|
static int |
TessAPI1.TessBaseAPIRecognizeForChopTest(TessAPI1.TessBaseAPI handle,
TessAPI1.ETEXT_DESC monitor)
Variant on Recognize used for testing chopper.
|
static java.lang.String |
TessAPI1.TessBaseAPIRect(TessAPI1.TessBaseAPI handle,
java.nio.ByteBuffer imagedata,
int bytes_per_pixel,
int bytes_per_line,
int left,
int top,
int width,
int height)
Recognize a rectangle from an image and return the result as a string.
|
static void |
TessAPI1.TessBaseAPISetImage(TessAPI1.TessBaseAPI handle,
java.nio.ByteBuffer imagedata,
int width,
int height,
int bytes_per_pixel,
int bytes_per_line)
Provide an image for Tesseract to recognize.
|
static void |
TessAPI1.TessBaseAPISetInputName(TessAPI1.TessBaseAPI handle,
java.lang.String name)
Set the name of the input file.
|
static void |
TessAPI1.TessBaseAPISetOutputName(TessAPI1.TessBaseAPI handle,
java.lang.String name)
Set the name of the bonus output files.
|
static void |
TessAPI1.TessBaseAPISetPageSegMode(TessAPI1.TessBaseAPI handle,
int mode)
Set the current page segmentation mode.
|
static void |
TessAPI1.TessBaseAPISetRectangle(TessAPI1.TessBaseAPI handle,
int left,
int top,
int width,
int height)
Restrict recognition to a sub-rectangle of the image.
|
static void |
TessAPI1.TessBaseAPISetSourceResolution(TessAPI1.TessBaseAPI handle,
int ppi)
Set the resolution of the source image in pixels per inch so font size
information can be calculated in results.
|
static int |
TessAPI1.TessBaseAPISetVariable(TessAPI1.TessBaseAPI handle,
java.lang.String name,
java.lang.String value)
Set the value of an internal "parameter." Supply the name of the
parameter and the value as a string, just as you would in a config file.
|