Bilingual lexicon extraction from comparable corpora as metasearch

  • Authors:
  • Amir Hazem;Emmanuel Morin;Sebastian Peña Saldarriaga

  • Affiliations:
  • Université de Nantes, LINA - UMR CNRS, BP, Nantes Cedex;Université de Nantes, LINA - UMR CNRS, BP, Nantes Cedex;rue Notre-Dame Ouest, Montréal, Québec, Canada

  • Venue:
  • BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article we present a novel way of looking at the problem of automatic acquisition of pairs of translationally equivalent words from comparable corpora. We first present the standard and extended approaches traditionally dedicated to this task. We then reinterpret the extended method, and motivate a novel model to reformulate this approach inspired by the metasearch engines in information retrieval. The empirical results show that performances of our model are always better than the baseline obtained with the extended approach and also competitive with the standard approach.