Multimodal interactive transcription of text images

Authors:
Alejandro H. Toselli;Verónica Romero;Moisés Pastor;Enrique Vidal
Affiliations:
Instituto Tecnológico de Informática Universidad Politécnica de Valencia Camino de Vera s/n, 46071 Valencia, Spain;Instituto Tecnológico de Informática Universidad Politécnica de Valencia Camino de Vera s/n, 46071 Valencia, Spain;Instituto Tecnológico de Informática Universidad Politécnica de Valencia Camino de Vera s/n, 46071 Valencia, Spain;Instituto Tecnológico de Informática Universidad Politécnica de Valencia Camino de Vera s/n, 46071 Valencia, Spain
Venue:
Pattern Recognition
Year:
2010

Citing 19
Cited 17

Statistical methods for speech recognition

Statistical methods for speech recognition
An Omnifont Open-Vocabulary OCR System for English and Arabic

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multimodal error correction for speech user interfaces

ACM Transactions on Computer-Human Interaction (TOCHI)
Integration of hand-written address interpretation technology into the United States Postal Service Remote Computer Reader system

ICDAR '97 Proceedings of the 4th International Conference on Document Analysis and Recognition
A Full English Sentence Database for Off-Line Handwriting Recognition

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
A New Database for Research on Bank-Check Processing

IWFHR '02 Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition (IWFHR'02)
Character Recognition Experiments Using Unipen Data

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Methods, Report and Survey for the Comparison of Diverse Isolated Character Recognition Results on the UNIPEN Database

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Spontaneous Handwriting Recognition and Classification

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Probabilistic Finite-State Machines-Part II

IEEE Transactions on Pattern Analysis and Machine Intelligence
Towards Restoring Historic Documents Degraded Over Time

DIAL '06 Proceedings of the Second International Conference on Document Image Analysis for Libraries
Offline Grammar-Based Recognition of Handwritten Sentences

IEEE Transactions on Pattern Analysis and Machine Intelligence
Word graph based speech rcognition error correction by handwriting input

Proceedings of the 8th international conference on Multimodal interfaces
Computer Assisted Transcription of Handwritten Text Images

ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Preprocessing Techniques for Online Handwriting Recognition

ISDA '07 Proceedings of the Seventh International Conference on Intelligent Systems Design and Applications
Computer Assisted Transcription for Ancient Text Images

ICIAR '07 Proceedings of the 4th international conference on Image Analysis and Recognition
Statistical approaches to computer-assisted translation

Computational Linguistics
Interactive pattern recognition

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction

Interactive layout analysis and transcription systems for historic handwritten documents

Proceedings of the 10th ACM symposium on Document engineering
Multimodal interactive machine translation

International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Improving on-line handwritten recognition using translation models in multimodal interactive machine translation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Automatic learning of edit costs based on interactive and adaptive graph recognit

GbRPR'11 Proceedings of the 8th international conference on Graph-based representations in pattern recognition
Exploration of the labelling space given graph edit distance costs

GbRPR'11 Proceedings of the 8th international conference on Graph-based representations in pattern recognition
Character-level interaction in multimodal computer-assisted transcription of text images

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Transcription alignment of Latin manuscripts using hidden Markov models

Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Study of different interactive editing operations in an assisted transcription system

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Speech interaction in a multimodal tool for handwritten text transcription

ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Transcribing handwritten text images with a word soup game

CHI '12 Extended Abstracts on Human Factors in Computing Systems
Natural language inspired approach for handwritten text line detection in legacy documents

LaTeCH '12 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Active graph matching based on pairwise probabilities between nodes

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Multilingual online access to digitised Arabic manuscripts by using metadata

International Journal of Metadata, Semantics and Ontologies
The ESPOSALLES database: An ancient marriage license corpus for off-line handwriting recognition

Pattern Recognition
Transcript mapping for handwritten Chinese documents by integrating character recognition model and geometric context

Pattern Recognition
Semi-supervised learning for character recognition in historical archive documents

Pattern Recognition
Improving on-line handwritten recognition in interactive machine translation

Pattern Recognition

Quantified Score

Hi-index	0.01

Visualization

Abstract

To date, automatic handwriting recognition systems are far from being perfect and heavy human intervention is often required to check and correct the results of such systems. This ''post-editing'' process is both inefficient and uncomfortable to the user. An example is the transcription of historic documents: state-of-the-art handwritten text recognition technology is not suitable to perform this task automatically and expensive paleography expert work is needed to achieve correct transcriptions. As an alternative to fully manual transcription and post-editing, a multimodal interactive approach is proposed here where user feedback is provided by means of touchscreen pen strokes and/or more traditional keyboard and mouse operation. User's feedback directly allows to improve system accuracy, while multimodality increases system ergonomy and user acceptability. Multimodal interaction is approached in such a way that both the main and the feedback data streams help each-other to optimize overall performance and usability. Empirical tests on three cursive handwritten tasks suggest that, using this approach, considerable amounts of user effort can be saved with respect to both pure manual work and non-interactive, post-editing processing.