summaryrefslogtreecommitdiff
path: root/graphics/claraocr/DESCR
blob: 9248fce3fed6c7865b89b3ee83883222cd28abf0 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
Clara OCR is a free (GPL) Optical Character Recognition (OCR) program
for systems that support the C library and the X windows system (e.g.
most flavours of Unix). The development platform of Clara OCR is
32-bit Intel running GNU/Linux.

Clara OCR is intended for large scale digitalization projects. It
features a powerful GUI and a web interface for cooperative
digitalization of books. Clara OCR development started in 1999 and
is approaching production quality.

Features:

	Converts pbm/pgm image files to text (ISO-8859)
	Can process scans in batch for large documents
	Can run from the command-line
	Is relatively easy to train

Non-features:

	Is not "omnifont"; you must train it for each document
	Does not scan the images
	Does not support unicode
	Cannot read handwriting