Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EAST detect some texture as a text! #358

Open
hahmad2008 opened this issue Aug 25, 2020 · 2 comments
Open

EAST detect some texture as a text! #358

hahmad2008 opened this issue Aug 25, 2020 · 2 comments

Comments

@hahmad2008
Copy link

I am running the demo using the pre-trained model

python eval.py --test_data_path=/tmp/images/ --gpu_list=0 --checkpoint_path=/tmp/east_icdar2015_resnet_v1_50_rbox/ \ --output_dir=/tmp/

However, EAST detect some features as text, is there any reason behind that? Is there any configuration to tune or change?

Samples:

Screen Shot 2020-08-24 at 9 41 49 PM

Screen Shot 2020-08-24 at 10 53 06 PM

Screen Shot 2020-08-24 at 9 46 43 PM

@CarMiranda
Copy link

No, there is no "reason". EAST is just another deep learning model, so, as of right now, we cannot explain its outputs. And, again, as any other deep learning model, it can misbehave.

There are multiple things you can do:

  • To improve accuracy in your particular data, you can tune the threshold parameters; maybe they are too low? This is not truly a recommendation since it does not improve your model...
  • To improve overall robustness, train on more data, preferably adversarial to your case. Here, it predicts texture as text, so you can train on images with text and texture to force it to learn these kind of texture are not text. This is also known as hard example mining.
  • To understand what is going on, try to interpret intermediate results (the score map f_score and geometry map f_geometry). Are intermediate results similar between these textures and text? If so, then the model has not learned to discriminate between them, and needs more training.

@francotheengineer
Copy link

@CarMiranda answer is correct.

You could also try using larger Backbones, more UNet layers, deeper box/score maps heads.
Other mods could be changing activation functions, different Batch Norm layers. The list is endless.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants