Speech interaction in a multimodal tool for handwritten text transcription

  • Authors:
  • Maria José Castro-Bleda;Salvador España-Boquera;David Llorens;Andrés Marzal;Federico Prat;Juan Miguel Vilar;Francisco Zamora-Martinez

  • Affiliations:
  • Universitat Politècnica de València, Valencia, Spain;Universitat Politècnica de València, Valencia, Spain;Universitat Jaume I, Castelló, Spain;Universitat Jaume I, Castelló, Spain;Universitat Jaume I, Castelló, Spain;Universitat Jaume I, Castelló, Spain;Universidad CEU-Cardenal Herrera, Valencia, Spain

  • Venue:
  • ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

STATE is a multimodal tool for document processing and text transcription. Its graphical front-end can be easily connected to different text recognition back-ends. New features and improvements are presented in this work: the interactive correction of one word in the transcribed line has been improved to reestimate the entire transcription line using the user feedback and speech input has been integrated in the multimodal interface enabling the user to also utter the word to be corrected, giving the user the possibility to use the interface according to her preferences or the task at hand. Thus, at the current version of STATE, the user can type, write on the screen with a stylus, or utter the incorrectly recognized word, and then, the system uses the user feedback in any of the proposed modalities to reestimate the transcribed line so as to hopefully correct other errors which could be caused by the mistaken word the user has corrected.