After reading this Article about Googles Open-Source OCR, I had a funny idea how about using OCR to store scanned documents in an efficient way, but without losing the original scanned image.

You OCR the image, store the text and the layout, and the difference between the image and the result of reconstructiong the document from the OCR data.

I wonder whether that would work?

-Richard

Tags Software, Geek Documents

Leave a Reply

*
To prove that you're not a bot, enter this code
Anti-Spam Image