Pattern matching-based system for machine translation (MT)

Authors:
George Tambouratzis;Sokratis Sofianopoulos;Vassiliki Spilioti;Marina Vassiliou;Olga Yannoutsou;Stella Markantonatou
Affiliations:
Institute for Language and Speech Processing, Athens, Greece;Institute for Language and Speech Processing, Athens, Greece;Institute for Language and Speech Processing, Athens, Greece;Institute for Language and Speech Processing, Athens, Greece;Institute for Language and Speech Processing, Athens, Greece;Institute for Language and Speech Processing, Athens, Greece
Venue:
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Year:
2006

Citing 5
Cited 2

A framework of a mechanical translation between Japanese and English by analogy principle

Proc. of the international NATO symposium on Artificial and human intelligence
A statistical approach to machine translation

Computational Linguistics
Translating with Scarce Resources

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Example retrieval from a translation memory

Natural Language Engineering
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

METIS-II: low resource machine translation

Machine Translation
Multi-objective optimisation of real-valued parameters of a hybrid MT system using Genetic Algorithms

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

The innovative feature of the system presented in this paper is the use of pattern-matching techniques to retrieve translations resulting in a flexible, language-independent approach, which employs a limited amount of explicit a priori linguistic knowledge. Furthermore, while all state-of-the-art corpus-based approaches to Machine Translation (MT) rely on bitexts, this system relies on extensive target language monolingual corpora. The translation process distinguishes three phases: 1) pre-processing with ‘light' rule and statisticsbased NLP techniques 2) search & retrieval, 3) synthesising. At Phase 1, the source language sentence is mapped onto a lemma-to-lemma translated string. This string then forms the input to the search algorithm, which retrieves similar sentences from the corpus (Phase 2). This retrieval process is performed iteratively at increasing levels of detail, until the best match is detected. The best retrieved sentence is sent to the synthesising algorithm (Phase 3), which handles phenomena such as agreement.