admin管理员组文章数量:1023596
I want to recognize a hadwritten digit in a binary image.
I planned on using tesseract ocr, but I could never get the accuracy above 50%. Here is a part of the code I used:
plt.imshow(roi,cmap='gray')
plt.axis('off')
plt.show()
text = pytesseract.image_to_string(roi, config='--psm 10')
print(text)
And the image drawn: digit_4
The text was incorrect most of the time - in the case above, it was '+'. Other incorrect answers included '4.', 'UL', and 'A'. I originally had
text = pytesseract.image_to_string(roi, config='--psm 10 digits')
but removed the digits setting after seeing half of the text come out as blank.
- How can I improve the accuracy?
- Why does the ocr output multiple characters when it is set to recognize a single character?
(I am using version 4.1.1 pytesseract)
I want to recognize a hadwritten digit in a binary image.
I planned on using tesseract ocr, but I could never get the accuracy above 50%. Here is a part of the code I used:
plt.imshow(roi,cmap='gray')
plt.axis('off')
plt.show()
text = pytesseract.image_to_string(roi, config='--psm 10')
print(text)
And the image drawn: digit_4
The text was incorrect most of the time - in the case above, it was '+'. Other incorrect answers included '4.', 'UL', and 'A'. I originally had
text = pytesseract.image_to_string(roi, config='--psm 10 digits')
but removed the digits setting after seeing half of the text come out as blank.
- How can I improve the accuracy?
- Why does the ocr output multiple characters when it is set to recognize a single character?
(I am using version 4.1.1 pytesseract)
本文标签: pythonPytesseract OCR not recognizing digits in clean binary imageStack Overflow
版权声明:本文标题:python - Pytesseract OCR not recognizing digits in clean binary image - Stack Overflow 内容由热心网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://it.en369.cn/questions/1745585652a2157578.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论