A linguistically grounded graph model for bilingual lexicon extraction

  • Authors:
  • Florian Laws;Lukas Michelbacher;Beate Dorow;Christian Scheible;Ulrich Heid;Hinrich Schütze

  • Affiliations:
  • Universität Stuttgart;Universität Stuttgart;Universität Stuttgart;Universität Stuttgart;Universität Stuttgart;Universität Stuttgart

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new method, based on graph theory, for bilingual lexicon extraction without relying on resources with limited availability like parallel corpora. The graphs we use represent linguistic relations between words such as adjectival modification. We experiment with a number of ways of combining different linguistic relations and present a novel method, multi-edge extraction (MEE), that is both modular and scalable. We evaluate MEE on adjectives, verbs and nouns and show that it is superior to cooccurrence-based extraction (which does not use linguistic analysis). Finally, we publish a reproducible baseline to establish an evaluation benchmark for bilingual lexicon extraction.