Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications
IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Database for Handwritten Text Recognition Research
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Rule Acquisition for Spelling Correction
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Word reordering and a dynamic programming beam search algorithm for statistical machine translation
Computational Linguistics
Stochastic Error-Correcting Parsing for OCR Post-Processing
ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
HMM-based word alignment in statistical translation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Statistical phrase-based translation
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A generative probabilistic OCR model for NLP applications
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Hidden Markov Models Combining Discrete Symbols and Continuous Attributes in Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
A maximum entropy word aligner for Arabic-English machine translation
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Inner-outer bracket models for word alignment using hidden blocks
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A Maximum Likelihood Approach to Continuous Speech Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient OCR post-processing combining language, hypothesis and error models
SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Improving on-line handwritten recognition in interactive machine translation
Pattern Recognition
Hi-index | 0.01 |
We propose a method for increasing word recognition accuracies by correcting the output of a handwriting recognition system. We treat the handwriting recognizer as a black box, such that there is no access to its internals. This enables us to keep our algorithm general and independent of any particular system. We use a novel method for correcting the output based on a ''phrase-based'' system in contrast to traditional source-channel models. We report the accuracies of two in-house handwritten word recognizers before and after the correction. We achieve highly encouraging results for a large synthetically generated dataset. We also report results for a commercially available OCR on real data.