A graphical ocr solution for GNU/Linux based on Python, Qt4 and tessaract OCR
Lector can help you to scan your tons of paper and create text document!
Lector lets you select areas on which you want to do OCR (Optical Character Recognition). Then you can run tesseract-ocr
simply clicking a button. The resulting text can be proofread, formatted and edited directly in Lector.
* scanning (available only on Linux)
* OCR via tesseract (with support for more than 30 languages!)
* selection areas for OCR
* rotating of images
* text editing
* text formatting including Cap operations
* white space display
* Personal word list based on dictionary
* export (via Save as) to odt (default), txt, html, pdf
* translations Linux Installation
Requires Python, Python Image Library (PIL), and QT4.
There is no installer at the moment. Just download package, uncompress it and run:
If you downloaded source and you have installed required dependencies, or if you downloaded win32 package( tested on Windows XP SP3), run:lector.exe
This version expect that tesseract
is installed on computer and in it in PATH. No installation of Python, QT4 is required.
In some cases you need to reboot computer after installing tesseract
, so Lector will recognize it.
Scanning is not implemented on Windows (yet).