Position specific posterior lattices for indexing speech

Authors:
Ciprian Chelba;Alex Acero
Affiliations:
Microsoft Research, Microsoft Corporation, Redmond, WA;Microsoft Research, Microsoft Corporation, Redmond, WA
Venue:
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Year:
2005

Citing 8
Cited 19

Open-vocabulary speech indexing for voice and video mail retrieval

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Effects of out of vocabulary words in spoken document retrieval (poster session)

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
Integration of continuous speech recognition and information retrieval for mutually optimal performance

Integration of continuous speech recognition and information retrieval for mutually optimal performance
Subword-based approaches for spoken document retrieval

Subword-based approaches for spoken document retrieval
Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Analysis and processing of lecture audio data: preliminary investigations

SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004

Spoken document retrieval from call-center conversations

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Searching the audio notebook: keyword search in recorded conversations

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Towards spoken-document retrieval for the internet: lattice indexing for large-scale web-search architectures

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Speech Ogle: indexing uncertainty for spoken document search

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Soft indexing of speech content for search in spoken documents

Computer Speech and Language
Vocabulary independent spoken term detection

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A lattice-based approach to query-by-example spoken document retrieval

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A latent semantic retrieval and clustering system for personal photos with sparse speech annotation

SSCS '09 Proceedings of the third workshop on Searching spontaneous conversational speech
Statistical lattice-based spoken document retrieval

ACM Transactions on Information Systems (TOIS)
Performance analysis for lattice-based speech indexing approaches using words and subword units

IEEE Transactions on Audio, Speech, and Language Processing
Faceted search and browsing of audio content on spoken web

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Two-stream indexing for spoken web search

Proceedings of the 20th international conference companion on World wide web
Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval

ACM Transactions on Speech and Language Processing (TSLP)
Social ranking for spoken web search

Proceedings of the 20th ACM international conference on Information and knowledge management
Beyond shot retrieval: searching for broadcast news items using language models of concepts

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval
Approaches for the detection of the keywords in spoken documents application for the field of e-libraries

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
An approach for efficient open vocabulary spoken term detection

Speech Communication
Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper presents the Position Specific Posterior Lattice, a novel representation of automatic speech recognition lattices that naturally lends itself to efficient indexing of position information and subsequent relevance ranking of spoken documents using proximity.In experiments performed on a collection of lecture recordings --- MIT iCampus data --- the spoken document ranking accuracy was improved by 20% relative over the commonly used baseline of indexing the 1-best output from an automatic speech recognizer. The Mean Average Precision (MAP) increased from 0.53 when using 1-best output to 0.62 when using the new lattice representation. The reference used for evaluation is the output of a standard retrieval engine working on the manual transcription of the speech collection.Albeit lossy, the PSPL lattice is also much more compact than the ASR 3-gram lattice from which it is computed --- which translates in reduced inverted index size as well --- at virtually no degradation in word-error-rate performance. Since new paths are introduced in the lattice, the ORACLE accuracy increases over the original ASR lattice.