Disclaimer

This information HAS errors and is made available WITHOUT ANY WARRANTY OF ANY KIND and without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. It is not permissible to be read by anyone who has ever met a lawyer or attorney. Use is confined to Engineers with more than 370 course hours of engineering.
If you see an error contact:
+1(785) 841 3089
inform@xtronics.com

OCR


Scanning for OCR

At this time (2016) The scanning from withing the GUI programs does not support duplex scanning. xsane supports Duplex scanning fairly well - set the number of pagers to some high number.

Cutting the binding of books off with a guillotine paper cutter and feeding through a snap-scan scanner works pretty well. Having searchable text increases the value of a book.

OCR engines

GUI front end to OCR engines

Clear-scan vs Searchable Image

There are two approaches - put a text layer under a bitmap (searchable image) - or make a real document with fonts and pictures where needed (clear-scan) . (Hopefully a ODT file ). Editable Clear-scan to a ODT file is the goal we need to seek. This gives us an editable file that is an open standard - thus will not lose compatibility.

As of 2016 there isn't a good clear-scan solution. Diagrams with text confuse the software. A perfect solution might take some AI Some issues:


Convert PDF to ODT

pdfocr ??


Top Page wiki Index