diff options
author | wiz <wiz@pkgsrc.org> | 2007-05-18 06:39:27 +0000 |
---|---|---|
committer | wiz <wiz@pkgsrc.org> | 2007-05-18 06:39:27 +0000 |
commit | 165f5f9cd5b10594e99432e011e41b841eb0ca7f (patch) | |
tree | a705c8f6b7adc55c4b7c0be896b56cd6c86a9c90 /graphics/tesseract/DESCR | |
parent | 06c962c6956b0965aa1ed667d76eb084b7819a4e (diff) | |
download | pkgsrc-165f5f9cd5b10594e99432e011e41b841eb0ca7f.tar.gz |
Initial import of tesseract-1.04b from pkgsrc-wip (packaged by heinz@
and myself):
This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI. It can only process an image of a
single column and create text from it. It can detect fixed pitch
vs proportional text. Having said that, in 1995, this engine was
in the top 3 in terms of character accuracy, and it compiles and
runs on both Linux and Windows. Another current limitation is that
it only recognizes English and its character set is only US-ASCII.
Training code IS included in the open source release however, and
will be included in a future release.
Diffstat (limited to 'graphics/tesseract/DESCR')
-rw-r--r-- | graphics/tesseract/DESCR | 9 |
1 files changed, 9 insertions, 0 deletions
diff --git a/graphics/tesseract/DESCR b/graphics/tesseract/DESCR new file mode 100644 index 00000000000..dcc8fb3daad --- /dev/null +++ b/graphics/tesseract/DESCR @@ -0,0 +1,9 @@ +This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO +OUTPUT FORMATTING, and NO UI. It can only process an image of a +single column and create text from it. It can detect fixed pitch +vs proportional text. Having said that, in 1995, this engine was +in the top 3 in terms of character accuracy, and it compiles and +runs on both Linux and Windows. Another current limitation is that +it only recognizes English and its character set is only US-ASCII. +Training code IS included in the open source release however, and +will be included in a future release. |