ASR post-correction for spoken dialogue systems based on semantic, syntactic, lexical and contextual information

Authors:
Ramón López-Cózar;Zoraida Callejas
Affiliations:
Department of Languages and Computer Systems, Computer Science Faculty, University of Granada, 18071 Granada, Spain;Department of Languages and Computer Systems, Computer Science Faculty, University of Granada, 18071 Granada, Spain
Venue:
Speech Communication
Year:
2008

Citing 15
Cited 3

Fundamentals of speech recognition

Fundamentals of speech recognition
Spontaneous speech dialogue system TOSBURG II and its evaluation

ISSD-93 Selected papers presented at the international symposium on Spoken dialogue
Field trial evaluations of two different information inquiry systems

Speech Communication - Special issue on interactive voice technology for telecommunication applications (IVITA '96)
PADIS—an automatic telephone switchboard and directory information system

Speech Communication - Special issue on interactive voice technology for telecommunication applications (IVITA '96)
On natural language call routing

Speech Communication - Special issue on interactive voice technology for telecommunication applications
Multimodal error correction for speech user interfaces

ACM Transactions on Computer-Human Interaction (TOCHI)
Adaptations in spoken corrections: implications for models of conversational speech

Speech Communication - Dialogue and prosody
Assessment of dialogue systems by means of a new simulation technique

Speech Communication
A method for correcting errors in speech recognition using the statistical features of character co-occurrence

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Characterizing and recognizing spoken corrections in human-computer dialogue

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Dialogue management in the Mercury flight reservation system

ANLP/NAACL-ConvSyst '00 Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems - Volume 3
SmartKom: Foundations of Multimodal Dialogue Systems (Cognitive Technologies)

SmartKom: Foundations of Multimodal Dialogue Systems (Cognitive Technologies)
Advanced Man-Machine Interaction: Fundamentals and Implementation (Signals and Communication Technology)

Advanced Man-Machine Interaction: Fundamentals and Implementation (Signals and Communication Technology)
A class based language model for speech recognition

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Testing the performance of spoken dialogue systems by means of an artificially simulated user

Artificial Intelligence Review

Mobile texting: can post-ASR correction solve the issues? an experimental study on gain vs. costs

Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
On the dynamic adaptation of language models based on dialogue information

Expert Systems with Applications: An International Journal
A domain-independent statistical methodology for dialog management in spoken dialog systems

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a technique to correct speech recognition errors in spoken dialogue systems that presents two main novel contributions. On the one hand, it considers several contexts where a speech recognition result can be corrected. A threshold learnt in the training is used to decide whether the correction must be carried out in the context associated with the current prompt type of a dialogue system, or in another context. On the other hand, the technique deals with the confidence scores of the words employed in the corrections. The correction is carried out at two levels: statistical and linguistic. At the first level the technique employs syntactic-semantic and lexical models, both contextual, to decide whether a recognition result is correct. According to this decision the recognition result may be changed. At the second level the technique employs basic linguistic knowledge to decide about the grammatical correctness of the outcome of the first level. According to this decision the outcome may be changed as well. Experimental results indicate that the technique enhances a dialogue system's word accuracy, speech understanding, implicit recovery and task completion rates by 8.5%, 16.54%, 4% and 44.17%, respectively.