Linked by Thom Holwerda on Thu 31st Aug 2006 18:07 UTC, submitted by diegocg
Google Google has announced the release of the source of an old OCR software called Tesseract in source. "In a nutshell, we are all about making information available to users, and when this information is in a paper document, OCR is the process by which we can convert the pages of this document into text that can then be used for indexing."
Thread beginning with comment 158013
To view parent comment, click here.
To read all comments associated with this story, please click here.
License
by KugelKurt on Fri 1st Sep 2006 20:11 UTC in reply to "Good job, Google."
KugelKurt
Member since:
2005-07-06

It's licensed under the Apache License 2.0. See http://tesseract-ocr.cvs.sourceforge.net/tesseract-ocr/tesseract/RE...

Reply Parent Bookmark Score: 1