Data-Oriented Parsing
A program for aligning sentences in bilingual corpora
Computational Linguistics - Special issue on using large corpora: I
Computational Linguistics - Special issue on using large corpora: I
Structural matching of parallel texts
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
K-vec: a new approach for aligning parallel texts
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Loosely tree-based alignment for machine translation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Learning non-isomorphic tree mappings for machine translation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
DMMT '01 Proceedings of the workshop on Data-driven methods in machine translation - Volume 14
Empirical lower bounds on the complexity of translational equivalence
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Automatic generation of parallel treebanks
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Human judgements in parallel treebank alignment
HumanJudge '08 Proceedings of the Workshop on Human Judgements in Computational Linguistics
SSST '08 Proceedings of the Second Workshop on Syntax and Structure in Statistical Translation
SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A discriminative approach to tree alignment
MCTLLL '09 Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Automatic analysis of semantic similarity in comparable text through syntactic tree matching
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Discriminative induction of sub-tree alignment using limited labeled data
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Learning better rule extraction with translation span alignment
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Unsupervised sub-tree alignment for tree-to-tree translation
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
Data-Oriented Translation (DOT), based on Data-Oriented Parsing (DOP), is a language-independent MT engine which exploits parsed, aligned bitexts to produce very high quality translations. However, data acquisition constitutes a serious bottleneck as DOT requires parsed sentences aligned at both sentential and sub-structural levels. Manual sub-structural alignment is time-consuming, error-prone and requires considerable knowledge of both source and target languages and how they are related. Automating this process is essential in order to carry out the large-scale translation experiments necessary to assess the full potential of DOT.We present a novel algorithm which automatically induces sub-structural alignments between context-free phrase structure trees in a fast and consistent fashion requiring little or no knowledge of the language pair. We present results from a number of experiments which indicate that our method provides a serious alternative to manual alignment.