Learning on the fly: a font-free approach toward multilingual OCR

Authors:
Andrew Kae;David A. Smith;Erik Learned-Miller
Affiliations:
University of Massachusetts Amherst, Department of Computer Science, 140 Governors Drive, 01003-9264, Amherst, MA, USA;University of Massachusetts Amherst, Department of Computer Science, 140 Governors Drive, 01003-9264, Amherst, MA, USA;University of Massachusetts Amherst, Department of Computer Science, 140 Governors Drive, 01003-9264, Amherst, MA, USA
Venue:
International Journal on Document Analysis and Recognition - Special issue - Selected and extended papers from ICDAR2009
Year:
2011

Citing 0
Cited 1

Real time mono-vision based customizable virtual keyboard using finger tip speed analysis

HCI'13 Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IV

Quantified Score

Hi-index	0.00

Visualization

Abstract

Despite ubiquitous claims that optical character recognition (OCR) is a “solved problem,” many categories of documents continue to break modern OCR software such as documents with moderate degradation or unusual fonts. Many approaches rely on pre-computed or stored character models, but these are vulnerable to cases when the font of a particular document was not part of the training set or when there is so much noise in a document that the font model becomes weak. To address these difficult cases, we present a form of iterative contextual modeling that learns character models directly from the document it is trying to recognize. We use these learned models both to segment the characters and to recognize them in an incremental, iterative process. We present results comparable with those of a commercial OCR system on a subset of characters from a difficult test document in both English and Greek.