Improving Generalization with Active Learning
Machine Learning - Special issue on structured connectionist systems
Minimum error rate training in statistical machine translation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Word-Level Confidence Estimation for Machine Translation
Computational Linguistics
Contributions to research on machine translation
Contributions to research on machine translation
Hierarchical sampling for active learning
Proceedings of the 25th international conference on Machine learning
NRC's PORTAGE system for WMT 2007
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Localization of difficult-to-translate phrases
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Learning performance of a machine translation system: a statistical and computational analysis
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Active learning for multilingual statistical machine translation
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Bucking the trend: large-scale cost-focused active learning for statistical machine translation
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Active semi-supervised learning for improving word alignment
ALNLP '10 Proceedings of the NAACL HLT 2010 Workshop on Active Learning for Natural Language Processing
A semi-supervised batch-mode active learning strategy for improved statistical machine translation
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Discriminative sample selection for statistical machine translation
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
An active learning scenario for interactive machine translation
ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Instance selection for machine translation using feature decay algorithms
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Does more data always yield better translations?
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Active learning for interactive machine translation
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Batch-mode semi-supervised active learning for statistical machine translation
Computer Speech and Language
Cost-sensitive active learning for computer-assisted translation
Pattern Recognition Letters
Hi-index | 0.00 |
Statistical machine translation (SMT) models need large bilingual corpora for training, which are unavailable for some language pairs. This paper provides the first serious experimental study of active learning for SMT. We use active learning to improve the quality of a phrase-based SMT system, and show significant improvements in translation compared to a random sentence selection baseline, when test and training data are taken from the same or different domains. Experimental results are shown in a simulated setting using three language pairs, and in a realistic situation for Bangla-English, a language pair with limited translation resources.