Language independent transliteration mining system using finite state automata framework

  • Authors:
  • Sara Noeman;Amgad Madkour

  • Affiliations:
  • IBM Cairo Technology Development Center, Giza, Egypt;IBM Cairo Technology Development Center, Giza, Egypt

  • Venue:
  • NEWS '10 Proceedings of the 2010 Named Entities Workshop
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a Named Entities transliteration mining system using Finite State Automata (FSA). We compare the proposed approach with a baseline system that utilizes the Editex technique to measure the length-normalized phonetic based edit distance between the two words. We submitted three standard runs in NEWS2010 shared task and ranked first for English to Arabic (WM-EnAr) and obtained an F-measure of 0.915, 0.903, and 0.874 respectively.