Word sense disambiguation for large text databases
Word sense disambiguation for large text databases
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
Journal of Intelligent Information Systems
Cross-Language Access to Recorded Speech in the MALACH Project
TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
A speech interface for open-domain question-answering
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Automatic analysis of call-center conversations
Proceedings of the 14th ACM international conference on Information and knowledge management
Written versus spoken queries: A qualitative and quantitative comparative analysis
Journal of the American Society for Information Science and Technology - Research Articles
Natural language processing for information retrieval: the time is ripe (again)
Proceedings of the ACM first Ph.D. workshop in CIKM
A critical assessment of spoken utterance retrieval through approximate lattice representations
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
A Soundex-Based Approach for Spoken Document Retrieval
MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Search of spoken documents retrieves well recognized transcripts
ECIR'07 Proceedings of the 29th European conference on IR research
Proceedings of the international conference on Multimedia
CLEF-2005 CL-SR at maryland: document and query expansion using side collections and thesauri
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval
Hi-index | 0.00 |
Several years of research have suggested that the accuracy of spoken document retrieval systems is not adversely affected by speech recognition errors. Even with error rates of around 40%, the effectiveness of an IR system falls less than 10%. The paper hypothesizes that this robust behavior is the result of repetition of important words in the text--meaning that losing one or two occurrences is not crippling-- and the result of additional related words providing a greater context-- meaning that those words will match even if the seemingly critical word is misrecognized. This hypothesis is supported by examples from TREC's SDR track, the TDT evaluation, and some work showing the impact of recognition errors on spoken queries.