A trainable approach for multi-lingual speech-to-speech translation system

  • Authors:
  • Y. Gao;J. Sorensen;H. Erdogan;R. Sarikaya;F. Liu;M. Picheny;B. Zhou;Z. Diao

  • Affiliations:
  • IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;Univ. of Colorado at Boulder;Texas A&M University

  • Venue:
  • HLT '02 Proceedings of the second international conference on Human Language Technology Research
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a statistical speech-to-speech machine translation (MT) system for limited domain applications using a cascaded approach. This architecture allows for the creation of multilingual applications. In this paper, the system architecture and its components, including the speech recognition, parsing, information extraction, translation, natural language generation (NLG) and text-to-speech (TTS) components are described. We have implemented the described system for translating speech between Mandarin and English language pair in an air travel application domain. We are current porting the system to the military domain. Encouraging experimental results have been observed and are presented.