GREAT: open source software for statistical machine translation

Authors:
Jorge González;Francisco Casacuberta
Affiliations:
Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universitat Politècnica de València, València, Spain;Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universitat Politècnica de València, València, Spain
Venue:
Machine Translation
Year:
2011

Citing 24
Cited 0

Bilingual Sentence Alignment: Balancing Robustness and Accuracy

Machine Translation
The EuTrans Spoken Language Translation System

Machine Translation
Defense of the ansatz for dynamical hierarchies

Artificial Life
Applications of Finite-State Transducers in Natural Language Processing

CIAA '00 Revised Papers from the 5th International Conference on Implementation and Application of Automata
Translation with Finite-State Devices

AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
A systematic comparison of various statistical alignment models

Computational Linguistics
Finite-State Speech-to-Speech Translation

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Stochastic Finite-State Models for Spoken Language MachineTranslation

Machine Translation
Probabilistic Finite-State Machines-Part II

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discriminative training and maximum entropy models for statistical machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Machine Translation with Inferred Stochastic Finite-State Transducers

Computational Linguistics
A phrase-based, joint probability model for statistical machine translation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A weighted finite state transducer translation template model for statistical machine translation

Natural Language Engineering
Learning finite-state models for machine translation

Machine Learning
N-gram-based Machine Translation

Computational Linguistics
ON THE STATISTICAL ESTIMATION OF STOCHASTIC FINITE-STATE TRANSDUCERS IN MACHINE TRANSLATION

Applied Artificial Intelligence
Joining linguistic and statistical methods for Spanish-to-Basque speech translation

Speech Communication
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Phrasetable smoothing for statistical machine translation

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Joshua: an open source toolkit for parsing-based machine translation

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Novel reordering approaches in phrase-based statistical machine translation

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
GREAT: a finite-state machine translation toolkit implementing a grammatical inference approach for transducer inference (GIATI)

CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Statistical Machine Translation

Statistical Machine Translation

Quantified Score

Hi-index	0.04

Visualization

Abstract

In this article, the first public release of GREAT as an open-source, statistical machine translation (SMT) software toolkit is described. GREAT is based on a bilingual language modelling approach for SMT, which is so far implemented for n-gram models based on the framework of stochastic finite-state transducers. The use of finite-state models is motivated by their simplicity, their versatility, and the fact that they present a lower computational cost, if compared with other more expressive models. Moreover, if translation is assumed to be a subsequential process, finite-state models are enough for modelling the existing relations between a source and a target language. GREAT includes some characteristics usually present in state-of-the-art SMT, such as phrase-based translation models or a log-linear framework for local features. Experimental results on a well-known corpus such as Europarl are reported in order to validate this software. A competitive translation quality is achieved, yet using both a lower number of model parameters and a lower response time than the widely-used, state-of-the-art SMT system Moses.