Improving the robustness to recognition errors in speech input question answering

Authors:
Hideki Tsutsui;Toshihiko Manabe;Mika Fukui;Tetsuya Sakai;Hiroko Fujii;Koji Urata
Affiliations:
Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan;Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan;Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan;Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan;Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan;Knowledge Media Laboratory, Corporate R&D Center, TOSHIBA Corp., Kawasaki, Japan
Venue:
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Year:
2006

Citing 3
Cited 0

Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition

Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Dialog navigator: a spoken dialog Q-A system based on large text knowledge base

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Overview of the CLEF 2004 multilingual question answering track

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Quantified Score

Hi-index	0.00

Visualization

Abstract

In our previous work, we developed a prototype of a speech-input help system for home appliances such as digital cameras and microwave ovens. Given a factoid question, the system performs textual question answering using the manuals as the knowledge source. Whereas, given a HOW question, it retrieves and plays a demonstration video. However, our first prototype suffered from speech recognition errors, especially when the Japanese interrogative phrases in factoid questions were misrecognized. We therefore propose a method for solving this problem, which complements a speech query transcript with an interrogative phrase selected from a pre-determined list. The selection process first narrows down candidate phrases based on co-occurrences within the manual text, and then computes the similarity between each candidate and the query transcript in terms of pronunciation. Our method improves the Mean Reciprocal Rank of top three answers from 0.429 to 0.597 for factoid questions.