image processing to improve tesseract OCR accuracy

tesseract

asked 2014-10-29 02:08:12 -0600

Deepak Kumar
69 ●3 ●8 ●12

updated 2020-11-02 10:48:49 -0600

sturkmen

6772 ●3 ●48 ●79 https://github.com/stu...

I've been using tesseract to convert screenshot image into text. The size of image is small, and I'm looking for tips on what sort of image processing/image enhancement might improve the results. I've noticed that text in the image looks find and perfect to read through eye but when i pass it to tesseract it is unable to find text from image.

edit retag flag offensive close merge delete

Comments

I think you would be better to put it in the tesseract forum itself.

StevenPuttemans ( 2014-10-29 03:19:24 -0600 )edit

If you are using open-source OCR engine, it may be the case that it does not implement an "omni-font" OCR algorithm (note: not to be confused with a similarly-named commercial OCR vendor.) * An "omni-font" OCR algorithm is able to recognize Latin alphabets from many typefaces, including unforeseen ones without having specifically being trained against that typeface. Without an "omni-font" algorithm, one will sometimes have to run the OCR training algorithm using the same document text that one would like to recognize. *(Posting as a comment since I do not know concretely whether Tesseract OCR is omni-font or not.)

rwong ( 2014-10-29 16:43:48 -0600 )edit

why not try the online service, i used this online ocr service, it's accuracy well and free to use.

JonyGreen ( 2015-09-07 00:46:25 -0600 )edit

add a comment

//convert first your image to float to improve precision... img.convertTo(imgTmp, CV_32F); GaussianBlur(imgTmp, imgResult, cv::Size(0, 0), 3); addWeighted(imgTmp, 1.5, imgResult, -0.5, 0, imgResult); // convert back to 8bits gray scale imgResult.convertTo(imgResult, CV_8U);

Comments

my image is not visible after applying these functions. i dont know how to post image in this website. if u give me your mail id i can send u.

thanks

Deepak Kumar ( 2014-11-03 00:18:35 -0600 )edit

add a comment

image processing to improve tesseract OCR accuracy

Comments

1 answer

Comments

Links

Question Tools

Stats

Related questions

image processing to improve tesseract OCR accuracy edit

Comments

1 answer

Comments

Links

Question Tools

Stats

Related questions

image processing to improve tesseract OCR accuracy