Hypothesis dependent threshold setting for improved out-of-vocabulary data rejection
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
The listening room: a speech-based interactive art installation
Proceedings of the 15th international conference on Multimedia
The semantic web as a newspaper media convergence facilitator
Web Semantics: Science, Services and Agents on the World Wide Web
A comparison of grapheme and phoneme-based units for Spanish spoken term detection
Speech Communication
ACM Transactions on Information Systems (TOIS)
Hi-index | 0.00 |
This paper presents a combination of out-of-vocabulary (OOV) word modeling and rejection techniques in an attempt to accept utterances embedding a keyword and reject utterances with nonkeywords. The goal of this research is to develop a robust, task-independent Spanish keyword spotter and to develop a method for optimizing confidence thresholds for a particular context. To model OOV words, we employed both word and sub-word units as fillers, combined with n-gram language models. We also introduce a methodology for optimizing confidence thresholds to control the tradeoffs between acceptance, confirmation, and rejection of utterances. Our experiments are based on a Mexican Spanish auto-attendant system using the SpeechWorks recognizer release 6.5 Second Edition, in which we achieved a reduction in error of 8.9% as compared to the baseline system. Most of the error reduction is attributed to better keyword detection in utterances that contain both keywords and OOV words.