Package net.sourceforge.tess4j
Interface ITesseract
- All Known Implementing Classes:
Tesseract
,Tesseract1
public interface ITesseract
An interface represents common OCR methods.
-
Nested Class Summary
Modifier and TypeInterfaceDescriptionstatic enum
Rendered formats supported by Tesseract. -
Field Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
createDocuments
(String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats) Creates documents for given renderers.void
createDocuments
(String filename, String outputbase, List<ITesseract.RenderedFormat> formats) Creates documents for given renderers.createDocumentsWithResults
(BufferedImage[] bis, String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) Creates documents with OCR results for given renderers at specified page iterator level.createDocumentsWithResults
(BufferedImage bi, String filename, String outputbase, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) Creates documents with OCR result for given renderers at specified page iterator level.createDocumentsWithResults
(String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) Creates documents with OCR results for given renderers at specified page iterator level.createDocumentsWithResults
(String filename, String outputbase, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) Creates documents with OCR result for given renderers at specified page iterator level.doOCR
(int xsize, int ysize, ByteBuffer buf, Rectangle rect, int bpp) Performs OCR operation.doOCR
(int xsize, int ysize, ByteBuffer buf, String filename, Rectangle rect, int bpp) Performs OCR operation.doOCR
(BufferedImage bi) Performs OCR operation.doOCR
(BufferedImage bi, Rectangle rect) Performs OCR operation.Performs OCR operation.Performs OCR operation.Performs OCR operation.Performs OCR operation.getSegmentedRegions
(BufferedImage bi, int pageIteratorLevel) Gets segmented regions at specified page iterator level.getWords
(BufferedImage bi, int pageIteratorLevel) Gets recognized words at specified page iterator level.void
setConfigs
(List<String> configs) Sets configs to be passed to Tesseract'sInit
method.void
setDatapath
(String datapath) Sets tessdata path.void
setLanguage
(String language) Sets language for OCR.void
setOcrEngineMode
(int ocrEngineMode) Sets OCR engine mode.void
setPageSegMode
(int mode) Sets page segmentation mode.void
setTessVariable
(String key, String value) Deprecated.void
setVariable
(String key, String value) Sets the value of Tesseract's internal parameter.
-
Field Details
-
htmlBeginTag
- See Also:
-
htmlEndTag
- See Also:
-
-
Method Details
-
doOCR
Performs OCR operation.- Parameters:
imageFile
- an image file- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
Performs OCR operation.- Parameters:
imageFile
- an image filerect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
Performs OCR operation.- Parameters:
bi
- a buffered image- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
Performs OCR operation.- Parameters:
bi
- a buffered imagerect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
Performs OCR operation.- Parameters:
imageList
- a list ofIIOImage
objectsrect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
Performs OCR operation.- Parameters:
imageList
- a list ofIIOImage
objectsfilename
- input file name. Needed only for training and reading a UNLV zone file.rect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
String doOCR(int xsize, int ysize, ByteBuffer buf, Rectangle rect, int bpp) throws TesseractException Performs OCR operation. UseSetImage
, (optionally)SetRectangle
, and one or more of theGet*Text
functions.- Parameters:
xsize
- width of imageysize
- height of imagebuf
- pixel datarect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.bpp
- bits per pixel, represents the bit depth of the image, with 1 for binary bitmap, 8 for gray, and 24 for color RGB.- Returns:
- the recognized text
- Throws:
TesseractException
-
doOCR
String doOCR(int xsize, int ysize, ByteBuffer buf, String filename, Rectangle rect, int bpp) throws TesseractException Performs OCR operation. UseSetImage
, (optionally)SetRectangle
, and one or more of theGet*Text
functions.- Parameters:
xsize
- width of imageysize
- height of imagebuf
- pixel datafilename
- input file name. Needed only for training and reading a UNLV zone file.rect
- the bounding rectangle defines the region of the image to be recognized. A rectangle of zero dimension ornull
indicates the whole image.bpp
- bits per pixel, represents the bit depth of the image, with 1 for binary bitmap, 8 for gray, and 24 for color RGB.- Returns:
- the recognized text
- Throws:
TesseractException
-
setDatapath
Sets tessdata path.- Parameters:
datapath
- the tessdata path to set
-
setLanguage
Sets language for OCR.- Parameters:
language
- the language code, which follows ISO 639-3 standard.
-
setOcrEngineMode
void setOcrEngineMode(int ocrEngineMode) Sets OCR engine mode.- Parameters:
ocrEngineMode
- the OcrEngineMode to set
-
setPageSegMode
void setPageSegMode(int mode) Sets page segmentation mode.- Parameters:
mode
- the page segmentation mode to set
-
setTessVariable
Deprecated.UsesetVariable(java.lang.String,java.lang.String)
instead.Sets the value of Tesseract's internal parameter.- Parameters:
key
- variable name, e.g.,tessedit_create_hocr
,tessedit_char_whitelist
, etc.value
- value for corresponding variable, e.g., "1", "0", "0123456789", etc.
-
setVariable
Sets the value of Tesseract's internal parameter.- Parameters:
key
- variable name, e.g.,tessedit_create_hocr
,tessedit_char_whitelist
, etc.value
- value for corresponding variable, e.g., "1", "0", "0123456789", etc.
-
setConfigs
Sets configs to be passed to Tesseract'sInit
method.- Parameters:
configs
- list of config filenames, e.g., "digits", "bazaar", "quiet"
-
createDocuments
void createDocuments(String filename, String outputbase, List<ITesseract.RenderedFormat> formats) throws TesseractException Creates documents for given renderers.- Parameters:
filename
- input imageoutputbase
- output filename without extensionformats
- types of renderers- Throws:
TesseractException
-
createDocuments
void createDocuments(String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats) throws TesseractException Creates documents for given renderers.- Parameters:
filenames
- array of input filesoutputbases
- array of output filenames without extensionformats
- types of renderers- Throws:
TesseractException
-
createDocumentsWithResults
OCRResult createDocumentsWithResults(BufferedImage bi, String filename, String outputbase, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) throws TesseractException Creates documents with OCR result for given renderers at specified page iterator level.- Parameters:
bi
- input buffered imagefilename
- filename (optional)outputbase
- output filenames without extensionformats
- types of rendererpageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- OCR result
- Throws:
TesseractException
-
createDocumentsWithResults
List<OCRResult> createDocumentsWithResults(BufferedImage[] bis, String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) throws TesseractException Creates documents with OCR results for given renderers at specified page iterator level.- Parameters:
bis
- array of input buffered imagesfilenames
- array of filenamesoutputbases
- array of output filenames without extensionformats
- types of rendererpageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- list of OCR results
- Throws:
TesseractException
-
createDocumentsWithResults
OCRResult createDocumentsWithResults(String filename, String outputbase, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) throws TesseractException Creates documents with OCR result for given renderers at specified page iterator level.- Parameters:
filename
- input fileoutputbase
- output filenames without extensionformats
- types of rendererpageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- OCR result
- Throws:
TesseractException
-
createDocumentsWithResults
List<OCRResult> createDocumentsWithResults(String[] filenames, String[] outputbases, List<ITesseract.RenderedFormat> formats, int pageIteratorLevel) throws TesseractException Creates documents with OCR results for given renderers at specified page iterator level.- Parameters:
filenames
- array of input filesoutputbases
- array of output filenames without extensionformats
- types of rendererpageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- list of OCR results
- Throws:
TesseractException
-
getSegmentedRegions
List<Rectangle> getSegmentedRegions(BufferedImage bi, int pageIteratorLevel) throws TesseractException Gets segmented regions at specified page iterator level.- Parameters:
bi
- input buffered imagepageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- list of
Rectangle
- Throws:
TesseractException
-
getWords
Gets recognized words at specified page iterator level.- Parameters:
bi
- input buffered imagepageIteratorLevel
- TessPageIteratorLevel enum- Returns:
- list of
Word
-
setVariable(java.lang.String,java.lang.String)
instead.