Word alignment between chinese and japanese using maximum weight matching on bipartite graph

  • Authors:
  • Honglin Wu;Shaoming Liu

  • Affiliations:
  • Natural Language Processing Lab, Institute of Software and Theory, Northeastern, University, Shenyang, China;Corporate Research Group, Fuji Xerox, Co., Ltd., Kanagawa, Japan

  • Venue:
  • ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The word-aligned bilingual corpus is an important knowledge source for many tasks in NLP especially in machine translation. Among the existing word alignment methods, the unknown word problem, the synonym problem and the global optimization problem are very important factors impacting the recall and precision of alignment results. In this paper, we proposed a word alignment model between Chinese and Japanese which measures similarity in terms of morphological similarity, semantic distance, part of speech and co-occurrence, and matches words by maximum weight matching on bipartite graph. The model can partly solve the problems mentioned above. The model was proved to be effective by experiments. It achieved 80% as F-Score than 72% of GIZA++.