Syntactic structure transfer in a tamil to hindi MT system – a hybrid approach

  • Authors:
  • Sobha Lalitha Devi;Vijay Sundar Ram R;Pravin Pralayankar;Bakiyavathi T

  • Affiliations:
  • AU-KBC Research Centre, MIT Campus of Anna University, Chennai;AU-KBC Research Centre, MIT Campus of Anna University, Chennai;AU-KBC Research Centre, MIT Campus of Anna University, Chennai;AU-KBC Research Centre, MIT Campus of Anna University, Chennai

  • Venue:
  • CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe the syntactic structure transfer, a central design question in machine translation, between two languages Tamil (source) and Hindi (target), belonging to two different language families, Dravidian and Indo-Aryan respectively. Tamil and Hindi differ extensively at the clausal construction level and transferring the structure is difficult. The syntactic structure transfer described here is a hybrid approach where we use CRFs for identifying the clause boundaries in the source language, Transformation Based Learning (TBL) for extracting the rules and use semantic classification of Postpositions (PSP) for choosing semantically appropriate structure in constructions where there are one to many mapping in the target language. We have evaluated the system using web data and the results are encouraging.