GitHub - tnt305/Scene-Text-Recognition: Scene Text Recognition on ICDAR2003 dataset

ABOUT SCENE TEXT RECOGNITION

Scene Text Recognition is an application of image processing algorithms and character recognition to identify text appearing in natural images. This problem has many practical applications, such as:

Text processing in images: Recognizing text in documents, books, newspapers, signs, etc.
Information retrieval: Identifying text in images on the web to extract necessary information.
Automation: Recognizing text in images to automate tasks such as order processing, payments, etc.

Additionally, Scene Text Recognition plays a crucial role in various fields including:

Optical Character Recognition (OCR): Converting scanned documents or images containing text into editable and searchable formats.
Augmented Reality (AR): Overlaying digital information or translations onto real-world scenes containing text.
Document Analysis: Analyzing and categorizing documents based on the text content for indexing or archival purposes.
Assistive Technologies: Helping visually impaired individuals by converting text from images into audio or braille formats.
Surveillance and Security: Extracting text from surveillance camera footage for identification or monitoring purposes.

In this project, we will experiment the task on ICDAR2003 dataset. You can download it via this link or you can have it on the main page of the competition

=====

A Scene Text Recognition program typically consists of two main stages:

1. Text Detection (Detector): Identifying the location of text in the image. In this project, we use YOLOv8s as the detection model 2. Text Recognition (Recognizer): Recognizing the text at the identified locations. Assuming that YOLOv8s possible to capture full area of text regions, thereso, we only need a light-weighted Recognizer [CRNN](https://arxiv.org/pdf/1507.05717.pdf)

Getting Started

Install requirements.txt
Detection folder as initial step:

%cd detection
preprocessing.py
detection.py

Recognition is a bit more complicated but you can follow the recognition.py for keep track what to do.

If run correctly, the results should output something like this:

Detection:

- Recognition

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
detection		detection
fig		fig
recognition		recognition
README.md		README.md
inference.py		inference.py
model.py		model.py
requirements.txt		requirements.txt
run.ipynb		run.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ABOUT SCENE TEXT RECOGNITION

In this project, we will experiment the task on ICDAR2003 dataset. You can download it via this link or you can have it on the main page of the competition

A Scene Text Recognition program typically consists of two main stages:

Getting Started

About

Releases

Packages

Languages

tnt305/Scene-Text-Recognition

Folders and files

Latest commit

History

Repository files navigation

ABOUT SCENE TEXT RECOGNITION

In this project, we will experiment the task on ICDAR2003 dataset. You can download it via this link or you can have it on the main page of the competition

A Scene Text Recognition program typically consists of two main stages:

Getting Started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages