OCR has tendency to misread the '<' between first and middle names. #58

Nimbus76 · 2021-05-11T18:39:38Z

read_mrz()/tesseract tends to interpret the '<' between first and middle name as a 'K'

I have tried multiple scans of varying quality of several passports and this anomaly occurs more often than not. Sometimes, it also interprets the '<' as an "X".

Every other field has been reliable.

canklot · 2022-04-19T15:57:51Z

Are you using the legacy mode with tesseract?

RanaOsamaAsif · 2022-10-29T10:17:00Z

Facing the same issue with names, is there any way to fix/improve this behavior?

konstantint · 2022-10-29T13:57:36Z

@RanaOsamaAsif Try both the legacy and new Tesseract models. In my experience the legacy model was more robust with respect to this particular issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR has tendency to misread the '<' between first and middle names. #58

OCR has tendency to misread the '<' between first and middle names. #58

Nimbus76 commented May 11, 2021

canklot commented Apr 19, 2022

RanaOsamaAsif commented Oct 29, 2022

konstantint commented Oct 29, 2022

OCR has tendency to misread the '<' between first and middle names. #58

OCR has tendency to misread the '<' between first and middle names. #58

Comments

Nimbus76 commented May 11, 2021

canklot commented Apr 19, 2022

RanaOsamaAsif commented Oct 29, 2022

konstantint commented Oct 29, 2022