Distributed speech translation technologies for multiparty multilingual communication

Authors:
Sakriani Sakti;Michael Paul;Andrew Finch;Xinhui Hu;Jinfu Ni;Noriyuki Kimura;Shigeki Matsuda;Chiori Hori;Yutaka Ashikari;Hisashi Kawai;Hideki Kashioka;Eiichiro Sumita;Satoshi Nakamura
Affiliations:
Nara Institute of Science and Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;National Institute of Information and Communications Technology, Japan;Nara Institute of Science and Technology, Japan/ National Institute of Information and Communications Technology, Japan
Venue:
ACM Transactions on Speech and Language Processing (TSLP)
Year:
2012

Citing 17
Cited 0

A systematic comparison of various statistical alignment models

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Extended models and tools for high-performance part-of-speech tagger

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Discriminative training and maximum entropy models for statistical machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Statistical phrase-based translation

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach

IEICE - Transactions on Information and Systems
A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging

IEICE - Transactions on Information and Systems
Multilingual mobile-phone translation services for world travelers

COLING '08 22nd International Conference on on Computational Linguistics: Demonstration Papers
On the importance of pivot language selection for statistical machine translation

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Mixture-model adaptation for SMT

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Dynamic model interpolation for statistical machine translation

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Construction of Chinese segmented and POS-tagged conversational corpora and their evaluations on spontaneous speech recognitions

ALR7 Proceedings of the 7th Workshop on Asian Language Resources
A successive state splitting algorithm for efficient allophone modeling

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
A tutorial on particle filters for online nonlinear/non-GaussianBayesian tracking

IEEE Transactions on Signal Processing
Comparative study on corpora for speech translation

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Developing a multilingual speech translation system requires efforts in constructing automatic speech recognition (ASR), machine translation (MT), and text-to-speech synthesis (TTS) components for all possible source and target languages. If the numerous ASR, MT, and TTS systems for different language pairs developed independently in different parts of the world could be connected, multilingual speech translation systems for a multitude of language pairs could be achieved. Yet, there is currently no common, flexible framework that can provide an entire speech translation process by bringing together heterogeneous speech translation components. In this article we therefore propose a distributed architecture framework for multilingual speech translation in which all speech translation components are provided on distributed servers and cooperate over a network. This framework can facilitate the connection of different components and functions. To show the overall mechanism, we first present our state-of-the-art technologies for multilingual ASR, MT, and TTS components, and then describe how to combine those systems into the proposed network-based framework. The client applications are implemented on a handheld mobile terminal device, and all data exchanges among client users and spoken language technology servers are managed through a Web protocol. To support multiparty communication, an additional communication server is provided for simultaneously distributing the speech translation results from one user to multiple users. Field testing shows that the system is capable of realizing multiparty multilingual speech translation for real-time and location-independent communication.