Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems

  • Authors:
  • Kazunori Komatani;Satoshi Ikeda;Tetsuya Ogata;Hiroshi G. Okuno

  • Affiliations:
  • Graduate School of Informatics, Kyoto University, Yoshida-Hommachi, Sakyo, Kyoto 606-8501, Japan;Graduate School of Informatics, Kyoto University, Yoshida-Hommachi, Sakyo, Kyoto 606-8501, Japan;Graduate School of Informatics, Kyoto University, Yoshida-Hommachi, Sakyo, Kyoto 606-8501, Japan;Graduate School of Informatics, Kyoto University, Yoshida-Hommachi, Sakyo, Kyoto 606-8501, Japan

  • Venue:
  • Speech Communication
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spoken dialogue systems must inevitably deal with out-of-grammar utterances. We address this problem in multi-domain spoken dialogue systems, which deal with more tasks than a single-domain system. We defined a topic by augmenting a domain about which users want to find more information, and we developed a method of recovering out-of-grammar utterances based on topic estimation, i.e., by providing a help message in the estimated domain. Moreover, domain extensibility, that is, the ability to add new domains to the system, should be inherently retained in multi-domain systems. To estimate domains without sacrificing extensibility, we collected documents from the Web as training data. Since the data contained a certain amount of noise, we used latent semantic mapping (LSM), which enables robust topic estimation by removing the effects of noise from the data. Experimental results showed that our method improved topic estimation accuracy by 23.2 points for data including out-of-grammar utterances.