Fast translation rule matching for syntax-based statistical machine translation

Authors:
Hui Zhang;Min Zhang;Haizhou Li;Chew Lim Tan
Affiliations:
Institute for Infocomm Research and National University of Singapore;Institute for Infocomm Research;Institute for Infocomm Research;National University of Singapore
Venue:
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Year:
2009

Citing 13
Cited 3

An efficient augmented-context-free parsing algorithm

Computational Linguistics
A systematic comparison of various statistical alignment models

Computational Linguistics
Decoding complexity in word-replacement translation models

Computational Linguistics
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Parsing with treebank grammars: empirical bounds, theoretical models, and the structure of the Penn Treebank

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Tree-to-string alignment template for statistical machine translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Grammar comparison study for translational equivalence modeling and statistical machine translation

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Forest-based translation rule extraction

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Better k-best parsing

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Forest-based tree sequence to string translation model

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1

Convolution kernel over packed parse forest

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Non-isomorphic forest pair translation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Efficient retrieval of tree translation examples for syntax-based machine translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In a linguistically-motivated syntax-based translation system, the entire translation process is normally carried out in two steps, translation rule matching and target sentence decoding using the matched rules. Both steps are very time-consuming due to the tremendous number of translation rules, the exhaustive search in translation rule matching and the complex nature of the translation task itself. In this paper, we propose a hyper-tree-based fast algorithm for translation rule matching. Experimental results on the NIST MT-2003 Chinese-English translation task show that our algorithm is at least 19 times faster in rule matching and is able to help to save 57% of overall translation time over previous methods when using large fragment translation rules.