Keyword Spotting in Poorly Printed Documents using Pseudo 2-D Hidden Markov Models

Authors:
S. S. Kuo;O. E. Agazzi
Affiliations:
-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1994

Citing 0
Cited 42

Document Image Decoding by Heuristic Search

IEEE Transactions on Pattern Analysis and Machine Intelligence
Supervised Template Estimation for Document Image Decoding

IEEE Transactions on Pattern Analysis and Machine Intelligence
Alternatives to Variable Duration HMM in Handwriting Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Holistic Verification of Handwritten Phrases

IEEE Transactions on Pattern Analysis and Machine Intelligence
Twenty Years of Document Image Analysis in PAMI

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Statistical Approach for Phrase Location and Recognition within a Text Line: An Application to Street Name Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
A 2-D HMM method for offline handwritten character recognition

Hidden Markov models
Information Retrieval from Documents: A Survey

Information Retrieval
Word Spotting in Bitmapped Fax Documents

Information Retrieval
Elastic image matching is NP-complete

Pattern Recognition Letters
A Novel Algorithm for Handwritten Chinese Character Recognition

ICMI '00 Proceedings of the Third International Conference on Advances in Multimodal Interfaces
Integration MBHMM and Neural Network for Totally Unconstrained Handwritten Numerals Recognition

ICMI '00 Proceedings of the Third International Conference on Advances in Multimodal Interfaces
Word Searching in Document Images Using Word Portion Matching

DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Facial Expression Recognition with Pseudo-3D Hidden Markov Models

Proceedings of the 23rd DAGM-Symposium on Pattern Recognition
Using Hierarchical Shape Models to Spot Keywords in Cursive Handwriting Data

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Word Searching in CCITT Group 4 Compressed Document Images

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Indexing and Retrieval of On-line Handwritten Documents

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Information Retrieval in Document Image Databases

IEEE Transactions on Knowledge and Data Engineering
Hardware Acceleration of Hidden Markov Model Decoding for Person Detection

Proceedings of the conference on Design, Automation and Test in Europe - Volume 3
A Probabilistic Model of Face Mapping with Local Transformations and Its Application to Person Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Hidden Markov models with states depending on observations

Pattern Recognition Letters
Structural Information Implant in a Context Based Segmentation-Free HMM Handwritten Word Recognition System for Latin and Bangla Script

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Deformation Models for Image Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Text search for medieval manuscript images

Pattern Recognition
A probabilistic model for face transformation with application to person identification

EURASIP Journal on Applied Signal Processing
Semantic content analysis and annotation of histological images

Computers in Biology and Medicine
Language Independent Word Spotting in Scanned Documents

ICADL 08 Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
A new approach for face recognition by sketches in photos

Signal Processing
HAH manuscripts: A holistic paradigm for classifying and retrieving historical Arabic handwritten documents

Expert Systems with Applications: An International Journal
Handwritten word-spotting using hidden Markov models and universal vocabularies

Pattern Recognition
Facial expression recognition using embedded hidden Markov model

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Inference and parameter estimation on hierarchical belief networks for image segmentation

Neurocomputing
Unsupervised writer adaptation of whole-word HMMs with application to word-spotting

Pattern Recognition Letters
A Bayesian approach to audio-visual speaker identification

AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Teeth recognition based on multiple attempts in mobile device

Journal of Network and Computer Applications
The fast and the flexible: extended pseudo two-dimensional warping for face recognition

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Real time face detection and recognition system using haar-like feature/HMM in ubiquitous network environments

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and its Applications - Volume Part I
Lexicon-free handwritten word spotting using character HMMs

Pattern Recognition Letters
Performance evaluation of face recognition based on PCA, LDA, ICA and hidden markov model

ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
A non-rigid appearance model for shape description and recognition

Pattern Recognition
Image warping for face recognition: From local optimality towards global optimization

Pattern Recognition
Keyword spotting in unconstrained handwritten Chinese documents using contextual word model

Image and Vision Computing

Quantified Score

Hi-index	0.15

Visualization

Abstract

An algorithm for robust machine recognition of keywords embedded in a poorly printed document is presented. For each keyword, two statistical models, called pseudo 2-D hidden Markov models, are created for representing the actual keyword and all the other extraneous words, respectively. Dynamic programming is then used for matching an unknown input word with the two models and for making a maximum likelihood decision. Although the models are pseudo 2-D in the sense that they are not fully connected 2-D networks, they are shown to be general enough in characterizing printed words efficiently. These models facilitate a nice "elastic matching" property in both horizontal and vertical directions, which makes the recognizer not only independent of size and slant but also tolerant of highly deformed and noisy words. The system is evaluated on a synthetically created database that contains about 26000 words. Currently, the authors achieve a recognition accuracy of 99% when words in testing and training sets are of the same font size, and 96% when they are in different sizes. In the latter case, the conventional 1-D HMM achieves only a 70% accuracy rate.