Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto page segmentation failing with one line image input #4370

Open
erjoalgo opened this issue Dec 16, 2024 · 0 comments
Open

Auto page segmentation failing with one line image input #4370

erjoalgo opened this issue Dec 16, 2024 · 0 comments

Comments

@erjoalgo
Copy link

Current Behavior

last-ocr
I'm attaching a image of a text that appears to be clear and should be easily readable, but fails with the default options:

tesseract /home/ealfonso/pictures/auto-scrots/last-ocr.png -
Estimating resolution as 1528
Empty page!!
Estimating resolution as 1528
Empty page!!

However, the recognition does work if I manually specify a certain value for page segmentation:

tesseract --psm 9 /home/ealfonso/pictures/auto-scrots/last-ocr.png -
FRS6HR45KS4

Expected Behavior

Ideally there would be no need to specify page segmentation and this would work out of the box.

Suggested Fix

No response

tesseract -v

No response

Operating System

Debian 12 Bookworm

Other Operating System

No response

uname -a

Linux phantom 6.1.0-21-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.90-1 (2024-05-03) x86_64 GNU/Linux

Compiler

No response

CPU

No response

Virtualization / Containers

No response

Other Information

tesseract --version
tesseract 5.3.0
 leptonica-1.82.0
  libgif 5.2.1 : libjpeg 6b (libjpeg-turbo 2.1.2) : libpng 1.6.39 : libtiff 4.5.0 : zlib 1.2.13 : libwebp 1.2.4 : libopenjp2 2.5.0
 Found AVX2
 Found AVX
 Found FMA
 Found SSE4.1
 Found OpenMP 201511
 Found libarchive 3.6.2 zlib/1.2.13 liblzma/5.4.1 bz2lib/1.0.8 liblz4/1.9.4 libzstd/1.5.4
 Found libcurl/7.88.1 OpenSSL/3.0.13 zlib/1.2.13 brotli/1.0.9 zstd/1.5.4 libidn2/2.3.3 libpsl/0.21.2 (+libidn2/2.3.3) libssh2/1.10.0 nghttp2/1.52.0 librtmp/2.3 OpenLDAP/2.5.13
@amitdo amitdo changed the title Failing Test Case, wrong page segmentation Auto page segmentation failing with one line image input Dec 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants