Hidden Markov models, maximum mutual information estimation, and the speech recognition problem
Hidden Markov models, maximum mutual information estimation, and the speech recognition problem
Testing the correlation of word error rate and perplexity
Speech Communication
A systematic comparison of various statistical alignment models
Computational Linguistics
Speaker Normalization Based on Frequency Warping
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
The Karlsruhe-Verbmobil Speech Recognition Engine
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Lecture and Presentation Tracking in an Intelligent Meeting Room
ICMI '02 Proceedings of the 4th IEEE International Conference on Multimodal Interfaces
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
HMM-based word alignment in statistical translation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
LingWear: a mobile tourist information system
HLT '01 Proceedings of the first international conference on Human language technology research
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Minimum error rate training in statistical machine translation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Structural event detection for rich transcription of speech
Structural event detection for rich transcription of speech
Recent Progress in Corpus-Based Spontaneous Speech Recognition
IEICE - Transactions on Information and Systems
JANUS: a speech-to-speech translation system using connectionist and symbolic processing strategies
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Accessibility, transcription, and access everywhere
IBM Systems Journal
A parametric approach to vocal tract length normalization
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Computers in the Human Interaction Loop
Computers in the Human Interaction Loop
Phrasetable smoothing for statistical machine translation
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Text segmentation criteria for statistical machine translation
FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
The AMI meeting corpus: a pre-announcement
MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
The ISL RT-06S speech-to-text system
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
The IBM rich transcription spring 2006 speech-to-text system for lecture meetings
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
The LIMSI RT06s lecture transcription system
MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
System Combination for Machine Translation of Spoken and Written Language
IEEE Transactions on Audio, Speech, and Language Processing
Enhancing bilingual electronic group meeting comprehension with round-trip translations
International Journal of Information Systems and Change Management
Panning for EBMT gold, or "Remembering not to forget"
Machine Translation
An Exploratory Study of How Technology Supports Communication in Multilingual Groups
International Journal of e-Collaboration
Hi-index | 0.00 |
With increasing globalization, communication across language and cultural boundaries is becoming an essential requirement of doing business, delivering education, and providing public services. Due to the considerable cost of human translation services, only a small fraction of text documents and an even smaller percentage of spoken encounters, such as international meetings and conferences, are translated, with most resorting to the use of a common language (e.g. English) or not taking place at all. Technology may provide a potentially revolutionary way out if real-time, domain-independent, simultaneous speech translation can be realized. In this paper, we present a simultaneous speech translation system based on statistical recognition and translation technology. We discuss the technology, various system improvements and propose mechanisms for user-friendly delivery of the result. Over extensive component and end-to-end system evaluations and comparisons with human translation performance, we conclude that machines can already deliver comprehensible simultaneous translation output. Moreover, while machine performance is affected by recognition errors (and thus can be improved), human performance is limited by the cognitive challenge of performing the task in real time.