An architecture for more realistic conversational systems
Proceedings of the 6th international conference on Intelligent user interfaces
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
TELIDA: a package for manipulation and visualization of timed linguistic data
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Stability and accuracy in incremental speech recognition
SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
SIGDIAL '11 Proceedings of the SIGDIAL 2011 Conference
Voice typing: a new speech interaction model for dictation on touchscreen devices
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
SDCTD '12 NAACL-HLT Workshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data
A temporal simulator for developing turn-taking methods for spoken dialogue systems
SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Integrating incremental speech recognition and POMDP-based dialogue systems
SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Situated incremental natural language understanding using Markov Logic Networks
Computer Speech and Language
Hi-index | 0.00 |
In incremental spoken dialogue systems, partial hypotheses about what was said are required even while the utterance is still ongoing. We define measures for evaluating the quality of incremental ASR components with respect to the relative correctness of the partial hypotheses compared to hypotheses that can optimize over the complete input, the timing of hypothesis formation relative to the portion of the input they are about, and hypothesis stability, defined as the number of times they are revised. We show that simple incremental post-processing can improve stability dramatically, at the cost of timeliness (from 90 % of edits of hypotheses being spurious down to 10 % at a lag of 320 ms). The measures are not independent, and we show how system designers can find a desired operating point for their ASR. To our knowledge, we are the first to suggest and examine a variety of measures for assessing incremental ASR and improve performance on this basis.