Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test cases #3

Open
falktan opened this issue Apr 15, 2021 · 4 comments
Open

Test cases #3

falktan opened this issue Apr 15, 2021 · 4 comments
Assignees
Labels
enhancement New feature or request

Comments

@falktan
Copy link
Owner

falktan commented Apr 15, 2021

A small collection of sample pictures to test the quality of the OCR algorithm would be helpful.
I.e. there should be about 15 pictures with different lighting conditions and different text size and quality with more or less distracting background.
The repo https://github.com/falktan/ovip-supplementary is probably a good place for these pictures.

In addition, it would be helpful to have a convenient way to test the algorithm on those pictures.

@falktan falktan added the good first issue Good for newcomers label Apr 15, 2021
@falktan
Copy link
Owner Author

falktan commented Apr 17, 2021

It might also be a good idea to search for a reasonable collection of sample images that is available publicly.

@falktan falktan removed the good first issue Good for newcomers label Apr 17, 2021
@falktan
Copy link
Owner Author

falktan commented Apr 17, 2021

This paper references a number of sources for "out in the wild" text (as a bonus it is an interesting read about how to use LSTM for OCR).
https://arxiv.org/pdf/1507.05717.pdf

@falktan falktan self-assigned this Apr 24, 2021
@falktan falktan added the enhancement New feature or request label Apr 24, 2021
@falktan
Copy link
Owner Author

falktan commented Apr 25, 2021

Started work here:
https://github.com/falktan/ocrjs

@falktan
Copy link
Owner Author

falktan commented Jul 26, 2021

Ideas how to improve tesseract see:
https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant