Balancing error and supervision effort in interactive-predictive handwriting recognition

Authors:
Nicolás Serrano;Albert Sanchis;Alfons Juan
Affiliations:
Universitat Politècnica de València, València, Spain;Universitat Politècnica de València, València, Spain;Universitat Politècnica de València, València, Spain
Venue:
Proceedings of the 15th international conference on Intelligent user interfaces
Year:
2010

Citing 4
Cited 7

Text line segmentation of historical documents: a survey

International Journal on Document Analysis and Recognition
Hidden Markov model-based ensemble methods for offline handwritten text line recognition

Pattern Recognition
The GERMANA Database

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Adaptation from partially supervised handwritten text transcriptions

Proceedings of the 2009 international conference on Multimodal interfaces

Interactive layout analysis and transcription systems for historic handwritten documents

Proceedings of the 10th ACM symposium on Document engineering
Rejection threshold estimation for an unknown language model in an OCR task

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Active learning strategies for handwritten text transcription

International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Confidence measures for error discrimination in an interactive predictive parsing framework

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Language identification for interactive handwriting transcription of multilingual documents

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
A prototype for interactive speech transcription balancing error and supervision effort

Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Effective balancing error and user effort in interactive handwriting recognition

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

An effective approach to transcribe handwritten text documents is to follow an interactive-predictive paradigm in which both, the system is guided by the user, and the user is assisted by the system to complete the transcription task as efficiently as possible. This approach has been recently implemented in a system prototype called GIDOC, in which standard speech technology is adapted to handwritten text (line) images: HMM-based text image modeling, n-gram language modeling, and also confidence measures on recognized words. Confidence measures are used to assist the user in locating possible transcription errors, and thus validate system output after only supervising those (few) words for which the system is not highly confident. However, a certain degree of supervision is required for proper model adaptation from partially supervised transcriptions. Here, we propose a simple yet effective method to find an optimal balance between recognition error and supervision effort.