Simultaneous translation of lectures and speeches

Authors:
Christian Fügen;Alex Waibel;Muntsin Kolss
Affiliations:
International Center for Advanced Communication Technologies (InterACT), Fakultät für Informatik, Universität Karlsruhe (TH), Karlsruhe, Germany 76131 and Mobile Technologies LLC, P ...;International Center for Advanced Communication Technologies (InterACT), Fakultät für Informatik, Universität Karlsruhe (TH), Karlsruhe, Germany 76131 and International Center for A ...;International Center for Advanced Communication Technologies (InterACT), Fakultät für Informatik, Universität Karlsruhe (TH), Karlsruhe, Germany 76131
Venue:
Machine Translation
Year:
2007

Citing 28
Cited 4

Hidden Markov models, maximum mutual information estimation, and the speech recognition problem

Hidden Markov models, maximum mutual information estimation, and the speech recognition problem
Testing the correlation of word error rate and perplexity

Speech Communication
A systematic comparison of various statistical alignment models

Computational Linguistics
Speaker Normalization Based on Frequency Warping

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
The Karlsruhe-Verbmobil Speech Recognition Engine

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Lecture and Presentation Tracking in an Intelligent Meeting Room

ICMI '02 Proceedings of the 4th IEEE International Conference on Multimodal Interfaces
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
HMM-based word alignment in statistical translation

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
LingWear: a mobile tourist information system

HLT '01 Proceedings of the first international conference on Human language technology research
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Structural event detection for rich transcription of speech

Structural event detection for rich transcription of speech
Recent Progress in Corpus-Based Spontaneous Speech Recognition

IEICE - Transactions on Information and Systems
JANUS: a speech-to-speech translation system using connectionist and symbolic processing strategies

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Accessibility, transcription, and access everywhere

IBM Systems Journal
A parametric approach to vocal tract length normalization

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
The bucket box intersection (BBI) algorithm for fast approximative evaluation of diagonal mixture Gaussians

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Computers in the Human Interaction Loop

Computers in the Human Interaction Loop
Phrasetable smoothing for statistical machine translation

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Text segmentation criteria for statistical machine translation

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
The AMI meeting corpus: a pre-announcement

MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
Classroom lecture recognition

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
The ISL RT-06S speech-to-text system

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
The IBM rich transcription spring 2006 speech-to-text system for lecture meetings

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
The LIMSI RT06s lecture transcription system

MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
System Combination for Machine Translation of Spoken and Written Language

IEEE Transactions on Audio, Speech, and Language Processing

Enhancing bilingual electronic group meeting comprehension with round-trip translations

International Journal of Information Systems and Change Management
Panning for EBMT gold, or "Remembering not to forget"

Machine Translation
David Bellos (ed): Is that a fish in your ear: translation and the meaning of everything

Machine Translation
An Exploratory Study of How Technology Supports Communication in Multilingual Groups

International Journal of e-Collaboration

Quantified Score

Hi-index	0.00

Visualization

Abstract

With increasing globalization, communication across language and cultural boundaries is becoming an essential requirement of doing business, delivering education, and providing public services. Due to the considerable cost of human translation services, only a small fraction of text documents and an even smaller percentage of spoken encounters, such as international meetings and conferences, are translated, with most resorting to the use of a common language (e.g. English) or not taking place at all. Technology may provide a potentially revolutionary way out if real-time, domain-independent, simultaneous speech translation can be realized. In this paper, we present a simultaneous speech translation system based on statistical recognition and translation technology. We discuss the technology, various system improvements and propose mechanisms for user-friendly delivery of the result. Over extensive component and end-to-end system evaluations and comparisons with human translation performance, we conclude that machines can already deliver comprehensible simultaneous translation output. Moreover, while machine performance is affected by recognition errors (and thus can be improved), human performance is limited by the cognitive challenge of performing the task in real time.