Statistical methods for speech recognition
Statistical methods for speech recognition
Multimodal error correction for speech user interfaces
ACM Transactions on Computer-Human Interaction (TOCHI)
Computational Complexity of Problems on Probabilistic Grammars and Transducers
ICGI '00 Proceedings of the 5th International Colloquium on Grammatical Inference: Algorithms and Applications
Probabilistic Finite-State Machines-Part II
IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Finite-State Machines-Part I
IEEE Transactions on Pattern Analysis and Machine Intelligence
Towards Restoring Historic Documents Degraded Over Time
DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Word graph based speech rcognition error correction by handwriting input
Proceedings of the 8th international conference on Multimodal interfaces
Computer Assisted Transcription for Ancient Text Images
ICIAR '07 Proceedings of the 4th international conference on Image Analysis and Recognition
A Novel Connectionist System for Unconstrained Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Graph-based partial hypothesis fusion for pen-aided speech input
IEEE Transactions on Audio, Speech, and Language Processing - Special issue on multimodal processing in speech-based interactions
Markov models for offline handwriting recognition: a survey
International Journal on Document Analysis and Recognition
Bi-modal handwritten text recognition (BiHTR) ICPR 2010 contest report
ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multimodal Interactive Pattern Recognition and Applications
Multimodal Interactive Pattern Recognition and Applications
ICDAR 2011 - French Handwriting Recognition Competition
ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
Improving on-line handwritten recognition in interactive machine translation
Pattern Recognition
Hi-index | 0.10 |
The transcription of historical documents is one of the most interesting tasks in which Handwritten Text Recognition can be applied, due to its interest in humanities research. One alternative for transcribing the ancient manuscripts is the use of speech dictation by using Automatic Speech Recognition techniques. In the two alternatives similar models (Hidden Markov Models and n-grams) and decoding processes (Viterbi decoding) are employed, which allows a possible combination of the two modalities with little difficulties. In this work, we explore the possibility of using recognition results of one modality to restrict the decoding process of the other modality, and apply this process iteratively. Results of these multimodal iterative alternatives are significantly better than the baseline uni-modal systems and better than the non-iterative alternatives.