Cost-benefit analysis of two-stage conditional random fields based English-to-Chinese machine transliteration

  • Authors:
  • Chan-Hung Kuo;Shih-Hung Liu;MikeTian-Jian Jiang;Cheng-Wei Lee;Wen-Lian Hsu

  • Affiliations:
  • Institute of Information Science, Academia Sinica;Institute of Information Science, Academia Sinica and National Taiwan University;Institute of Information Science, Academia Sinica and National Tsing Hua University;Institute of Information Science, Academia Sinica;Institute of Information Science, Academia Sinica

  • Venue:
  • NEWS '12 Proceedings of the 4th Named Entity Workshop
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work presents an English-to-Chinese (E2C) machine transliteration system based on two-stage conditional random fields (CRF) models with accessor variety (AV) as an additional feature to approximate local context of the source language. Experiment results show that two-stage CRF method outperforms the one-stage opponent since the former costs less to encode more features and finer grained labels than the latter.