On the Recognition of Printed Characters of Any Font and Size
IEEE Transactions on Pattern Analysis and Machine Intelligence
Integrating diverse knowledge sources in text recognition
ACM Transactions on Information Systems (TOIS)
History of OCR, Optical Character Recognition
History of OCR, Optical Character Recognition
DL '97 Proceedings of the second ACM international conference on Digital libraries
Twenty Years of Document Image Analysis in PAMI
IEEE Transactions on Pattern Analysis and Machine Intelligence
Robust document image understanding technologies
Proceedings of the 1st ACM workshop on Hardcopy document processing
A Scale Space Approach for Automatically Segmenting Words from Historical Handwritten Documents
IEEE Transactions on Pattern Analysis and Machine Intelligence
A case study on logging visual activities: chess game
TAINN'05 Proceedings of the 14th Turkish conference on Artificial Intelligence and Neural Networks
Hi-index | 0.15 |
By applying semantic analysis to images of extended passages of text, several volumes of a chess encyclopedia have been read with high accuracy. Although carefully proofread, the books were poorly printed and posed a severe challenge to conventional page-layout analysis and character-recognition methods. An experimental page-reader system performed strictly top-down layout analysis for identification of columns, lines, words, and characters. This proceeded rapidly and reliably thanks to a recently developed skew-estimation technique. Resegmentation of broken, touching, and dirty characters was handled in an efficient and integrated manner by a heuristic search operating on isolated words. By analyzing the syntax of game descriptions and applying the rules of chess, the error rate was reduced by a factor of 30 from what was achievable through shape analysis alone. Several computer vision systems integration issues suggested by this experience are discussed.