Scanning and editing text with OCR
|
Conclusion
You can scan documents and extract text from them without much fuss with shell tools. The sample script provided here already gives a functional result. With a bit of shell know-how, you can expand upon this script and customize it to your taste, for example, with the Unpaper [7] tool, which is also available in the repositories.
Infos
- Sane: http://www.sane-project.org/
- PDFtk: http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit
- Recode: http://recode.progiciels-bpi.ca/index.html
- GNU enscript: http://www.markkurossi.com/genscript/
- a2ps: http://www.inf.enst.fr/~demaille/a2ps/
- Ghostscript: http://www.ghostscript.com/
- Unpaper: http://unpaper.berlios.de
« Previous 1 2 3 Next »
Buy this article as PDF
Express-Checkout as PDF
Pages: 4
Price $0.99
(incl. VAT)
(incl. VAT)