Learning finite-state models for machine translation

Authors:
Francisco Casacuberta;Enrique Vidal
Affiliations:
Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain 46071;Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain 46071
Venue:
Machine Learning
Year:
2007

Citing 35
Cited 11

A statistical approach to machine translation

Computational Linguistics
Statistical methods for speech recognition

Statistical methods for speech recognition
Efficient Error-Correcting Viterbi Parsing

IEEE Transactions on Pattern Analysis and Machine Intelligence
The EuTrans Spoken Language Translation System

Machine Translation
Defense of the ansatz for dynamical hierarchies

Artificial Life
Language Simplification through Error-Correcting and Grammatical Inference Techniques

Machine Learning
Learning Subsequential Transducers for Pattern Recognition Interpretation Tasks

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improve the Learning of Subsequential Transducers by Using Alignments and Dictionaries

ICGI '00 Proceedings of the 5th International Colloquium on Grammatical Inference: Algorithms and Applications
Inference of Finite-State Transducers by Using Regular Grammars and Morphisms

ICGI '00 Proceedings of the 5th International Colloquium on Grammatical Inference: Algorithms and Applications
Application of OSTIA to Machine Translation Tasks

ICGI '94 Proceedings of the Second International Colloquium on Grammatical Inference and Applications
Using domain information during the learning of a subsequential transducer

ICG! '96 Proceedings of the 3rd International Colloquium on Grammatical Inference: Learning Syntax from Sentences
Phrase-Based Statistical Machine Translation

KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Computational Complexity of Problems on Probabilistic Grammars and Transducers

ICGI '00 Proceedings of the 5th International Colloquium on Grammatical Inference: Algorithms and Applications
A systematic comparison of various statistical alignment models

Computational Linguistics
Word reordering and a dynamic programming beam search algorithm for statistical machine translation

Computational Linguistics
Finite-State Speech-to-Speech Translation

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Stochastic Finite-State Models for Spoken Language MachineTranslation

Machine Translation
Learning dependency translation models as collections of finite-state head transducers

Computational Linguistics - Special issue on finite-state methods in NLP
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Decoding complexity in word-replacement translation models

Computational Linguistics
Text and speech translation by means of subsequential transducers

Natural Language Engineering
Natural languages and the Chomsky hierarchy

EACL '85 Proceedings of the second conference on European chapter of the Association for Computational Linguistics
Efficient search for interactive statistical machine translation

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Probabilistic Finite-State Machines-Part II

IEEE Transactions on Pattern Analysis and Machine Intelligence
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Greedy decoding for statistical machine translation in almost linear time

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A weighted finite state transducer implementation of the alignment template model for statistical machine translation

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
The Alignment Template Approach to Statistical Machine Translation

Computational Linguistics
Machine Translation with Inferred Stochastic Finite-State Transducers

Computational Linguistics
TransType: a computer-aided translation typing system

NAACL-ANLP-EMTS '00 Proceedings of the 2000 NAACL-ANLP Workshop on Embedded machine translation systems - Volume 5
A phrase-based, joint probability model for statistical machine translation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A weighted finite state transducer translation template model for statistical machine translation

Natural Language Engineering
Stochastic inversion transduction grammars with application to segmentation, bracketing, and alignment of parallel corpora

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Inference of finite-state transducers from regular languages

Pattern Recognition
Phrase-based alignment models for statistical machine translation

IbPRIA'05 Proceedings of the Second Iberian conference on Pattern Recognition and Image Analysis - Volume Part II

Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization

Information Sciences: an International Journal
Introducing Additional Input Information into Interactive Machine Translation Systems

MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
ON THE STATISTICAL ESTIMATION OF STOCHASTIC FINITE-STATE TRANSDUCERS IN MACHINE TRANSLATION

Applied Artificial Intelligence
Joining linguistic and statistical methods for Spanish-to-Basque speech translation

Speech Communication
Statistical approaches to computer-assisted translation

Computational Linguistics
Improving interactive machine translation via mouse actions

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Interactive pattern recognition

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
GREAT: open source software for statistical machine translation

Machine Translation
Hierarchical finite-state models for speech translation using categorization of phrases

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Stochastic K-TSS bi-languages for machine translation

FSMNLP '11 Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing
Cost-sensitive active learning for computer-assisted translation

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

In formal language theory, finite-state transducers are well-know models for simple "input-output" mappings between two languages. Even if more powerful, recursive models can be used to account for more complex mappings, it has been argued that the input-output relations underlying most usual natural language pairs can essentially be modeled by finite-state devices. Moreover, the relative simplicity of these mappings has recently led to the development of techniques for learning finite-state transducers from a training set of input-output sentence pairs of the languages considered. In the last years, these techniques have lead to the development of a number of machine translation systems. Under the statistical statement of machine translation, we overview here how modeling, learning and search problems can be solved by using stochastic finite-state transducers. We also review the results achieved by the systems we have developed under this paradigm. As a main conclusion of this review we argue that, as task complexity and training data scarcity increase, those systems which rely more on statistical techniques tend produce the best results.