This is some supplementary material for OViP, which is here: https://github.com/falktan/ovip
When opening the app, it allows to take a photo.
It then applies optical character recognition (OCR) to make the text available for usage.
Any text can be marked and be copied. If it is a URL also an option to directly open it in the browser is provided.
Similarily, if it is a phone number it can directly be called.
In this section OViP is compared to Google Lens. The main advantage of OViP over Google Lens is, that it does not need to send data to any third party and that it is open source. In addition it also works offline.
To compare the performance when recognizing text, a couple of samples were taken and the results compared. Not too surprisingly Google Lens performs much better.
Here are screenshots of how Google Lens performed. It recognized all examples correctly
All further images listed here were correctly recognized by Google Lens, so to avoid too many images they will not be added here. Instead we focus on what OViP did or did not recognize.
Here are a couple of images that OViP did recognize correctly. All of those were fairly simple tasks:
Here is an example of an image that was not recognized correctly
In the following image, OViP did not recognize the text at all (below 0.5 threshold):
So in summary, OVIP recognized 3 of those 5 examples correctly. It recognized one partially and one not at all. Google Lens did recognize all of those images correctly. Hence, there is clearly room for improvement on the algorithms used for OViP. On the upside, at least for images taken under good conditions, OViP already proved to work.