Linked by Thom Holwerda on Wed 23rd May 2007 23:45 UTC, submitted by Austin
Linux "The one 'hole' in my workflow has been OCR. For years, people have been able to scan a document and have it converted into real text. One of my old printers even came with OCR software included - for Windows of course. But when I've really needed OCR, I've just assumed that there were no high quality packages available for Linux. Recently I decided to find out for myself (a complete OCR virgin) what is available, how to use it, and what the results are like. I installed every free OCR package I could find, and systematically tested them. They all work very differently, so I tried to design a simple test for my specific needs."
Thread beginning with comment 243051
To view parent comment, click here.
To read all comments associated with this story, please click here.
RE[2]: OCR is not a problem
by Temcat on Fri 25th May 2007 09:38 UTC in reply to "RE: OCR is not a problem"
Temcat
Member since:
2005-10-18

If we had found one F/OSS tool that was close to doing the job, we would have gladly payed for extra work.

What do you think about this?

http://www.abbyy.com/sdk/?param=59956

ABBYY are the makers of FineReader, so this is serious stuff. Yes, this is only a backend (SDK), but writing a GUI frontend for a full-featured backend is easier than recreating essential backend features requiring tremendous expertise.

Reply Parent Bookmark Score: 2

RE[3]: OCR is not a problem
by Havin_it on Fri 25th May 2007 11:46 in reply to "RE[2]: OCR is not a problem"
Havin_it Member since:
2006-03-10

I don't see anything on that page to suggest that the FineReader Engine is available under a F/OSS license. I think if you develop anything that uses it, you're gonna be paying them for the privilege. In the OP's case, that means paying a licensing fee *and* hiring a coder to put a nice GUI on it.

Reply Parent Bookmark Score: 1

RE[4]: OCR is not a problem
by Temcat on Sat 26th May 2007 11:24 in reply to "RE[3]: OCR is not a problem"
Temcat Member since:
2005-10-18

Yeah, it's not FOSS, but you have the API, so you don't need the source. And it may turn out cheaper in the end. But if a FOSS license, not *nix compatibility, were an absolute requirement (why?), then of course FineReader Engine would not be suitable.

Reply Parent Bookmark Score: 1