Multilingual spoken language corpus development for communication research

  • Authors:
  • Toshiyuki Takezawa

  • Affiliations:
  • ATR Spoken Language Communication Research Laboratories, National Institute of Information and Communications Technology, Kyoto, Japan

  • Venue:
  • ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.02

Visualization

Abstract

A multilingual spoken language corpus is indispensable for spoken language communication research such as speech-to-speech translation. To promote multilingual spoken language research and development, unified structure and annotation, such as tagging, is indispensable for both speech and natural language processing. We describe our experience with multilingual spoken language corpus development at our research institution, focusing in particular on speech recognition and natural language processing for speech translation of travel conversations.