Creating a multilingual collocation dictionary from large text corpora

  • Authors:
  • Luka Nerima;Violeta Seretan;Eric Wehrli

  • Affiliations:
  • University of Geneva, Switzerland;University of Geneva, Switzerland;University of Geneva, Switzerland

  • Venue:
  • EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a system of terminological extraction capable of handling multi-word expressions, using a powerful syntactic parser. The system includes a concordancing tool enabling the user to display the context of the collocation, i.e. the sentence or the whole document where the collocation occurs. Since the corpora are multilingual, the system also offers an alignment mechanism for the corresponding translated documents.