Lexicon Reduction in an HMM-Framework Based on Quantized Feature Vectors

Authors:
Guido Kaufmann;Horst Bunke;M. Hadorn
Affiliations:
-;-;-
Venue:
ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
Year:
1997

Citing 0
Cited 6

Sentence lipreading using hidden Markov model with integrated grammar

Hidden Markov models
A Lexicon Reduction Strategy in the Context of Handwritten Medical Forms

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Lexicon reduction using dots for off-line Farsi/Arabic handwritten word recognition

Pattern Recognition Letters
Topic based language models for OCR correction

Proceedings of the second workshop on Analytics for noisy unstructured text data
W-TSV: Weighted topological signature vector for lexicon reduction in handwritten Arabic documents

Pattern Recognition
Effect of delayed strokes on the recognition of online Farsi handwriting

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

For many applications in cursive script recognition the vocabulary is restricted to a small and fixed set of words. In a recognition approach such as Hidden Markov Models, for each of these words a model is constructed and trained. In the recognition task an unknown pattern is matched with all of these models to find the most likely class. In our paper, we describe a method for reducing the size of vocabulary depending on the actual input. In contrast to many other techniques, our approach is not using any topological features. The reduction system is directly based on the quantized feature vectors which are used as input for the HMMs. Thus very little additional work is required for lexicon reduction. The proposed approach was successfully tested on two different systems with small lexicons. In both cases, the lexicon could be reduced to 25% of its original size without increasing in the error rate.