Automatic acquisition of hierarchical transduction models for machine translation

Authors:
Hiyan Alshawi;Srinivas Bangalore;Shona Douglas
Affiliations:
AT&T Labs Research, Florham Park, NJ;AT&T Labs Research, Florham Park, NJ;AT&T Labs Research, Florham Park, NJ
Venue:
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Year:
1998

Citing 8
Cited 13

A statistical approach to machine translation

Computational Linguistics
Identifying word correspondence in parallel texts

HLT '91 Proceedings of the workshop on Speech and Natural Language
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora

Computational Linguistics
Text and speech translation by means of subsequential transducers

Natural Language Engineering
A comparison of head transducers and transfer for a limited domain translation application

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Head automata and bilingual tiling: translation with minimal representations

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics

Head-Transducer Models for Speech Translation and TheirAutomatic Acquisition from Bilingual Data

Machine Translation
A systematic comparison of various statistical alignment models

Computational Linguistics
Stochastic Finite-State Models for Spoken Language MachineTranslation

Machine Translation
Exploiting a probabilistic hierarchical model for generation

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Bootstrapping bilingual data using consensus translation for a multilingual instant messaging system

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A machine learning approach to the automatic evaluation of machine translation

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Statistical sentence condensation using ambiguity packing and stochastic disambiguation methods for Lexical-Functional Grammar

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Stochastic finite-state models for spoken language machine translation

NAACL-ANLP-EMTS '00 Proceedings of the 2000 NAACL-ANLP Workshop on Embedded machine translation systems - Volume 5
Evaluating text quality: judging output texts without a clear source

EWNLG '01 Proceedings of the 8th European workshop on Natural Language Generation - Volume 8
Discriminative Machine Translation Using Global Lexical Selection

ACM Transactions on Asian Language Information Processing (TALIP)
Stochastic finite-state models for spoken language machine translation

EmbedMT '00 ANLP-NAACL 2000 Workshop: Embedded Machine Translation Systems
Three models for discriminative machine translation using global lexical selection and sentence reconstruction

SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
Re-structuring, re-labeling, and re-aligning for syntax-based machine translation

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe a method for the fully automatic learning of hierarchical finite state translation models. The input to the method is transcribed speech utterances and their corresponding human translations, and the output is a set of head transducers, i.e. statistical lexical head-outward transducers. A word-alignment function and a head-ranking function are first obtained, and then counts are generated for hypothesized state transitions of head transducers whose lexical translations and word order changes are consistent with the alignment. The method has been applied to create an English-Spanish translation model for a Speech translation application, with word accuracy of over 75% as measured by a string-distance comparison to three reference translations.