Index-based incremental language model for scalable directory assistance

Authors:
Antonio Moreno-Daniel;Jay Wilpon;B. H. Juang
Affiliations:
Georgia Institute of Technology, Atlanta, GA, USA;AT&T Labs Research, Florham Park, NJ, USA;Georgia Institute of Technology, Atlanta, GA, USA
Venue:
Speech Communication
Year:
2012

Citing 13
Cited 0

How may I help you?

Speech Communication - Special issue on interactive voice technology for telecommunication applications (IVITA '96)
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Trie memory

Communications of the ACM
The STL Tutorial and Reference Guide: C++ Programming with the Standard Template Library

The STL Tutorial and Reference Guide: C++ Programming with the Standard Template Library
Multilingual Text-to-Speech Synthesis

Multilingual Text-to-Speech Synthesis
Automata: Theoretic Aspects of Formal Power Series

Automata: Theoretic Aspects of Formal Power Series
Spoken query processing for interactive information retrieval

Data & Knowledge Engineering
A Rational Design for a Weighted Finite-State Transducer Library

WIA '97 Revised Papers from the Second International Workshop on Implementing Automata
Finite-state transducers in language and speech processing

Computational Linguistics
Search Vox: leveraging multimodal refinement and partial knowledge for mobile voice search

Proceedings of the 21st annual ACM symposium on User interface software and technology
A scalable method for voice search to nationwide business listings

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
A general weighted grammar library

CIAA'04 Proceedings of the 9th international conference on Implementation and Application of Automata
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

As the ubiquitous access to vast and remote information sources from portable devices becomes commonplace, the need from users to perform searches in keyboard-unfriendly situations grows substantially, thus triggering the increased demand of voice search sessions. This paper proposes a methodology that addresses different dimensions of scalability of mixed-initiative voice search in automatic spoken dialog systems. The strategy is based on splitting the complexity of the fully-constrained grammar (one that tightly covers the entire hypothesis space) into a fixed/low complexity phonotactic grammar followed by an index mechanism that dynamically assembles a second-pass grammar that consists of only a handful of hypotheses. The experimental analysis demonstrates different dimensions of scalability achieved by the proposed method using actual Whitepages-residential data.