A trainable approach for multi-lingual speech-to-speech translation system

Authors:
Y. Gao;J. Sorensen;H. Erdogan;R. Sarikaya;F. Liu;M. Picheny;B. Zhou;Z. Diao
Affiliations:
IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;IBM T. J. Watson Research Center;Univ. of Colorado at Boulder;Texas A&M University
Venue:
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Year:
2002

Citing 4
Cited 1

Natural language parsing as statistical pattern recognition

Natural language parsing as statistical pattern recognition
A maximum entropy approach to natural language processing

Computational Linguistics
Trainable methods for surface natural language generation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Phrase splicing and variable substitution using the IBM trainable speech synthesis system

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01

High-quality speech-to-speech translation for computer-aided language learning

ACM Transactions on Speech and Language Processing (TSLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a statistical speech-to-speech machine translation (MT) system for limited domain applications using a cascaded approach. This architecture allows for the creation of multilingual applications. In this paper, the system architecture and its components, including the speech recognition, parsing, information extraction, translation, natural language generation (NLG) and text-to-speech (TTS) components are described. We have implemented the described system for translating speech between Mandarin and English language pair in an air travel application domain. We are current porting the system to the military domain. Encouraging experimental results have been observed and are presented.