The following methods break TesseractRect into pieces, so you can get hold of the thresholded image, get the text in different formats, get bounding boxes, confidences etc.
int * tesseract::TessBaseAPI::AllWordConfidences | ( | ) | [inherited] |
Returns all word confidences (between 0 and 100) in an array, terminated by -1. The calling function must delete [] after use. The number of confidences should correspond to the number of space- delimited words in GetUTF8Text.
void tesseract::TessBaseAPI::Clear | ( | ) | [inherited] |
Free up recognition results and any stored image data, without actually freeing any recognition data that would be time-consuming to reload. Afterwards, you must call SetImage or TesseractRect before doing any Recognize or Get* operation.
Estimates the Orientation And Script of the image.
void tesseract::TessBaseAPI::DumpPGM | ( | const char * | filename | ) | [inherited] |
Dump the internal binary image to a PGM file.
void tesseract::TessBaseAPI::End | ( | ) | [inherited] |
Close down tesseract and free up all memory. End() is equivalent to destructing and reconstructing your TessBaseAPI. Once End() has been used, none of the other API functions may be used other than Init and anything declared above it in the class definition.
char * tesseract::TessBaseAPI::GetBoxText | ( | int | page_number | ) | [inherited] |
The recognized text is returned as a char* which is coded in the same format as a box file used in training. Returned string must be freed with the delete [] operator. Constructs coordinates in the original image - not just the rectangle. page_number is a 0-base page index that will appear in the box file.
const Dawg * tesseract::TessBaseAPI::GetDawg | ( | int | i | ) | const [inherited] |
Return the pointer to the i-th dawg loaded into tesseract_ object.
void tesseract::TessBaseAPI::GetFeatures | ( | INT_FEATURE_ARRAY | int_features, | |
int * | num_features | |||
) | [inherited] |
This method returns the features associated with the input image.
char * tesseract::TessBaseAPI::GetHOCRText | ( | int | page_id | ) | [inherited] |
Make a HTML-formatted string with hOCR markup from the internal data structures. STL removed from original patch submission and refactored by rays. page_id is 1-based and will appear in the output.
const char * tesseract::TessBaseAPI::GetLastInitLanguage | ( | ) | const [inherited] |
Return the language used in the last valid initialization.
Boxa * tesseract::TessBaseAPI::GetRegions | ( | Pixa ** | pixa | ) | [inherited] |
Get the result of page layout analysis as a leptonica-style Boxa, Pixa pair, in reading order. Can be called before or after Recognize.
bool tesseract::TessBaseAPI::GetTextDirection | ( | int * | out_offset, | |
float * | out_slope | |||
) | [inherited] |
Boxa * tesseract::TessBaseAPI::GetTextlines | ( | Pixa ** | pixa, | |
int ** | blockids | |||
) | [inherited] |
Get the textlines as a leptonica-style Boxa, Pixa pair, in reading order. Can be called before or after Recognize. If blockids is not NULL, the block-id of each line is also returned as an array of one element per line. delete [] after use.
Pix * tesseract::TessBaseAPI::GetThresholdedImage | ( | ) | [inherited] |
Get a copy of the internal thresholded image from Tesseract. Caller takes ownership of the Pix and must pixDestroy it. May be called any time after SetImage, or after TesseractRect.
char * tesseract::TessBaseAPI::GetUNLVText | ( | ) | [inherited] |
The recognized text is returned as a char* which is coded as UNLV format Latin-1 with specific reject and suspect codes and must be freed with the delete [] operator.
char * tesseract::TessBaseAPI::GetUTF8Text | ( | ) | [inherited] |
The recognized text is returned as a char* which is coded as UTF8 and must be freed with the delete [] operator.
Boxa * tesseract::TessBaseAPI::GetWords | ( | Pixa ** | pixa | ) | [inherited] |
Get the words as a leptonica-style Boxa, Pixa pair, in reading order. Can be called before or after Recognize.
int tesseract::TessBaseAPI::IsValidWord | ( | const char * | word | ) | [inherited] |
Check whether a word is valid according to Tesseract's language model
int tesseract::TessBaseAPI::MeanTextConf | ( | ) | [inherited] |
Returns the (average) confidence value between 0 and 100.
int tesseract::TessBaseAPI::NumDawgs | ( | ) | const [inherited] |
Return the number of dawgs loaded into tesseract_ object.
int tesseract::TessBaseAPI::Recognize | ( | ETEXT_STRUCT * | monitor | ) | [inherited] |
Recognize the image from SetAndThresholdImage, generating Tesseract internal structures. Returns 0 on success. Optional. The Get*Text functions below will call Recognize if needed. After Recognize, the output is kept internally until the next SetImage.
int tesseract::TessBaseAPI::RecognizeForChopTest | ( | struct ETEXT_STRUCT * | monitor | ) | [inherited] |
Methods to retrieve information after SetAndThresholdImage(), Recognize() or TesseractRect(). (Recognize is called implicitly if needed.) Variant on Recognize used for testing chopper.
void tesseract::TessBaseAPI::SetDictFunc | ( | DictFunc | f | ) | [inherited] |
Set the letter_is_okay function to point somewhere else.
void tesseract::TessBaseAPI::SetImage | ( | const Pix * | pix | ) | [inherited] |
Provide an image for Tesseract to recognize. As with SetImage above, Tesseract doesn't take a copy or ownership or pixDestroy the image, so it must persist until after Recognize. Pix vs raw, which to use? Use Pix where possible. A future version of Tesseract may choose to use Pix as its internal representation and discard IMAGE altogether. Because of that, an implementation that sources and targets Pix may end up with less copies than an implementation that does not.
void tesseract::TessBaseAPI::SetImage | ( | const unsigned char * | imagedata, | |
int | width, | |||
int | height, | |||
int | bytes_per_pixel, | |||
int | bytes_per_line | |||
) | [inherited] |
Provide an image for Tesseract to recognize. Format is as TesseractRect above. Does not copy the image buffer, or take ownership. The source image may be destroyed after Recognize is called, either explicitly or implicitly via one of the Get*Text functions. SetImage clears all recognition results, and sets the rectangle to the full image, so it may be followed immediately by a GetUTF8Text, and it will automatically perform recognition.
void tesseract::TessBaseAPI::SetRectangle | ( | int | left, | |
int | top, | |||
int | width, | |||
int | height | |||
) | [inherited] |
Restrict recognition to a sub-rectangle of the image. Call after SetImage. Each SetRectangle clears the recogntion results so multiple rectangles can be recognized with the same image.
void tesseract::TessBaseAPI::SetThresholder | ( | ImageThresholder * | thresholder | ) | [inline, inherited] |
In extreme cases only, usually with a subclass of Thresholder, it is possible to provide a different Thresholder. The Thresholder may be preloaded with an image, settings etc, or they may be set after. Note that Tesseract takes ownership of the Thresholder and will delete it when it it is replaced or the API is destructed.