Speech Communication - Special issue on interactive voice technology for telecommunication applications (IVITA '96)
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Communications of the ACM
The STL Tutorial and Reference Guide: C++ Programming with the Standard Template Library
The STL Tutorial and Reference Guide: C++ Programming with the Standard Template Library
Multilingual Text-to-Speech Synthesis
Multilingual Text-to-Speech Synthesis
Automata: Theoretic Aspects of Formal Power Series
Automata: Theoretic Aspects of Formal Power Series
Spoken query processing for interactive information retrieval
Data & Knowledge Engineering
A Rational Design for a Weighted Finite-State Transducer Library
WIA '97 Revised Papers from the Second International Workshop on Implementing Automata
Finite-state transducers in language and speech processing
Computational Linguistics
Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search
Proceedings of the 21st annual ACM symposium on User interface software and technology
A scalable method for voice search to nationwide business listings
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
A general weighted grammar library
CIAA'04 Proceedings of the 9th international conference on Implementation and Application of Automata
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
As the ubiquitous access to vast and remote information sources from portable devices becomes commonplace, the need from users to perform searches in keyboard-unfriendly situations grows substantially, thus triggering the increased demand of voice search sessions. This paper proposes a methodology that addresses different dimensions of scalability of mixed-initiative voice search in automatic spoken dialog systems. The strategy is based on splitting the complexity of the fully-constrained grammar (one that tightly covers the entire hypothesis space) into a fixed/low complexity phonotactic grammar followed by an index mechanism that dynamically assembles a second-pass grammar that consists of only a handful of hypotheses. The experimental analysis demonstrates different dimensions of scalability achieved by the proposed method using actual Whitepages-residential data.