image processing to improve tesseract OCR accuracy
I've been using tesseract to convert screenshot image into text. The size of image is small, and I'm looking for tips on what sort of image processing/image enhancement might improve the results. I've noticed that text in the image looks find and perfect to read through eye but when i pass it to tesseract it is unable to find text from image.
I think you would be better to put it in the tesseract forum itself.
If you are using open-source OCR engine, it may be the case that it does not implement an "omni-font" OCR algorithm (note: not to be confused with a similarly-named commercial OCR vendor.) * An "omni-font" OCR algorithm is able to recognize Latin alphabets from many typefaces, including unforeseen ones without having specifically being trained against that typeface. Without an "omni-font" algorithm, one will sometimes have to run the OCR training algorithm using the same document text that one would like to recognize. *(Posting as a comment since I do not know concretely whether Tesseract OCR is omni-font or not.)
why not try the online service, i used this online ocr service, it's accuracy well and free to use.