-
Notifications
You must be signed in to change notification settings - Fork 4
/
DESCRIPTION
21 lines (21 loc) · 992 Bytes
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Package: Rtesseract
Title: Interface to the tesseract OCR system
Description: This provides a flexible Optical Character Recognition (OCR)
facility via the tesseract C++ library.
This allows us to read text from images. It also allows us to analyze
the results and possible errors in the recognition.
We can do data analysis on the errors, if we know the truth, and
explore how we may improve the recognition.
It also provides some functionality from the leptonica library for
performing image processing. This allows us, for example, to detect
lines in an image, important for interpreting tables.
Version: 0.6-0
Author: Duncan Temple Lang, Matt Espe
Maintainer: Duncan Temple Lang <[email protected]>
License: Apache License
SystemRequirements: tesseract_3.0.3 release candidate or higher
https://code.google.com/p/tesseract-ocr/
tesseract requires leptonica.
Imports: methods, grDevices, utils
Suggests: png, tiff, jpeg, Aspell, XML, RJSONIO
Others: rjpeg