Identifying idiomatic expressions using phrase alignments in bilingual parallel corpus

  • Authors:
  • Hyoung-Gyu Lee;Min-Jeong Kim;Gumwon Hong;Sang-Bum Kim;Young-Sook Hwang;Hae-Chang Rim

  • Affiliations:
  • Department of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Department of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Department of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Convergence Technology Center, SK Telecom;Convergence Technology Center, SK Telecom;Department of Computer and Radio Communications Engineering, Korea University, Seoul, Korea

  • Venue:
  • PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Previous efforts to identify idiomatic expressions using a bilingual parallel corpus have focused on the method of using word alignments to catch the sense of individual words. In this paper, we propose a method of using phrase alignments rather than word alignments in a parallel corpus to recognize the sense of phrases as well as words. Our proposed scoring functions are based on the difference of translation tendency between a phrase and individual words. They can help us identify idiomatic expressions with a entropy variation and a translation difference between a phrase and individualwords. Experimental results show that our proposed method is more effective than previous approaches for the identification of idiomatic expressions. In addition, we proved that linguistic constraints can be integrated into our method to improve the performance.