Perspectives on Information Retrieval and Speech

Authors:
James Allan
Affiliations:
-
Venue:
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Year:
2001

Citing 2
Cited 14

Word sense disambiguation for large text databases

Word sense disambiguation for large text databases
Topic Detection and Tracking: Event-Based Information Organization

Topic Detection and Tracking: Event-Based Information Organization

ACM SIGIR 2001 workshop "Information Retrieval Techniques for Speech Applications"

ACM SIGIR Forum
Vocal Access to a Newspaper Archive: Assessing the Limitations of Current Voice Information Access Technology

Journal of Intelligent Information Systems
From Multimedia Retrieval to Knowledge Management

Computer
Cross-Language Access to Recorded Speech in the MALACH Project

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
A speech interface for open-domain question-answering

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Automatic analysis of call-center conversations

Proceedings of the 14th ACM international conference on Information and knowledge management
Written versus spoken queries: A qualitative and quantitative comparative analysis

Journal of the American Society for Information Science and Technology - Research Articles
Natural language processing for information retrieval: the time is ripe (again)

Proceedings of the ACM first Ph.D. workshop in CIKM
A critical assessment of spoken utterance retrieval through approximate lattice representations

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
A Soundex-Based Approach for Spoken Document Retrieval

MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Search of spoken documents retrieves well recognized transcripts

ECIR'07 Proceedings of the 29th European conference on IR research
Multimedia content with a speech track: ACM multimedia 2010 workshop on searching spontaneous conversational speech

Proceedings of the international conference on Multimedia
CLEF-2005 CL-SR at maryland: document and query expansion using side collections and thesauri

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Spoken Content Retrieval: A Survey of Techniques and Technologies

Foundations and Trends in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Several years of research have suggested that the accuracy of spoken document retrieval systems is not adversely affected by speech recognition errors. Even with error rates of around 40%, the effectiveness of an IR system falls less than 10%. The paper hypothesizes that this robust behavior is the result of repetition of important words in the text--meaning that losing one or two occurrences is not crippling-- and the result of additional related words providing a greater context-- meaning that those words will match even if the seemingly critical word is misrecognized. This hypothesis is supported by examples from TREC's SDR track, the TDT evaluation, and some work showing the impact of recognition errors on spoken queries.