French-english terminology extraction from comparable corpora

  • Authors:
  • Béatrice Daille;Emmanuel Morin

  • Affiliations:
  • LINA – FRE CNRS 2729, University of Nantes, Nantes Cedex 3, France;LINA – FRE CNRS 2729, University of Nantes, Nantes Cedex 3, France

  • Venue:
  • IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article presents a method of extracting bilingual lexica composed of single-word terms (SWTs) and multi-word terms (MWTs) from comparable corpora of a technical domain. First, this method extracts MWTs in each language, and then uses statistical methods to align single words and MWTs by exploiting the term contexts. After explaining the difficulties involved in aligning MWTs and specifying our approach, we show the adopted process for bilingual terminology extraction and the resources used in our experiments. Finally, we evaluate our approach and demonstrate its significance, particularly in relation to non-compositional MWT alignment.