Skip to content

tnt305/Scene-Text-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ABOUT SCENE TEXT RECOGNITION

Scene Text Recognition is an application of image processing algorithms and character recognition to identify text appearing in natural images. This problem has many practical applications, such as:

  • Text processing in images: Recognizing text in documents, books, newspapers, signs, etc.
  • Information retrieval: Identifying text in images on the web to extract necessary information.
  • Automation: Recognizing text in images to automate tasks such as order processing, payments, etc.

Additionally, Scene Text Recognition plays a crucial role in various fields including:

  • Optical Character Recognition (OCR): Converting scanned documents or images containing text into editable and searchable formats.
  • Augmented Reality (AR): Overlaying digital information or translations onto real-world scenes containing text.
  • Document Analysis: Analyzing and categorizing documents based on the text content for indexing or archival purposes.
  • Assistive Technologies: Helping visually impaired individuals by converting text from images into audio or braille formats.
  • Surveillance and Security: Extracting text from surveillance camera footage for identification or monitoring purposes.

In this project, we will experiment the task on ICDAR2003 dataset. You can download it via this link or you can have it on the main page of the competition

=====

A Scene Text Recognition program typically consists of two main stages:

1. Text Detection (Detector): Identifying the location of text in the image. In this project, we use YOLOv8s as the detection model 2. Text Recognition (Recognizer): Recognizing the text at the identified locations. Assuming that YOLOv8s possible to capture full area of text regions, thereso, we only need a light-weighted Recognizer [CRNN](https://arxiv.org/pdf/1507.05717.pdf)

Getting Started

  1. Install requirements.txt

  2. Detection folder as initial step:

%cd detection
preprocessing.py
detection.py
  1. Recognition is a bit more complicated but you can follow the recognition.py for keep track what to do.

If run correctly, the results should output something like this:

  • Detection:

- Recognition

About

Scene Text Recognition on ICDAR2003 dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published