Initial import of tesseract-1.04b from pkgsrc-wip (packaged by heinz@
authorwiz <wiz>
Fri, 18 May 2007 06:39:27 +0000 (06:39 +0000)
committerwiz <wiz>
Fri, 18 May 2007 06:39:27 +0000 (06:39 +0000)
commit01179846f462a1005d011fa539882b092eb0df0c
tree48f2305147f44874306bce20222b9e58968c737b
parentfa32da61d944a074ae7b36082f69b9312dd0d9c8
Initial import of tesseract-1.04b from pkgsrc-wip (packaged by heinz@
and myself):

This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI. It can only process an image of a
single column and create text from it. It can detect fixed pitch
vs proportional text.  Having said that, in 1995, this engine was
in the top 3 in terms of character accuracy, and it compiles and
runs on both Linux and Windows. Another current limitation is that
it only recognizes English and its character set is only US-ASCII.
Training code IS included in the open source release however, and
will be included in a future release.
graphics/tesseract/DESCR [new file with mode: 0644]
graphics/tesseract/Makefile [new file with mode: 0644]
graphics/tesseract/PLIST [new file with mode: 0644]
graphics/tesseract/distinfo [new file with mode: 0644]
graphics/tesseract/files/tesseract.sh [new file with mode: 0644]
graphics/tesseract/patches/patch-ae [new file with mode: 0644]
graphics/tesseract/patches/patch-ag [new file with mode: 0644]
graphics/tesseract/patches/patch-ah [new file with mode: 0644]
graphics/tesseract/patches/patch-ai [new file with mode: 0644]