Improvements in automatic thesaurus extraction

  • Authors:
  • James R. Curran;Marc Moens

  • Affiliations:
  • University of Edinburgh, Edinburgh, United Kingdom;University of Edinburgh, Edinburgh, United Kingdom

  • Venue:
  • ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The use of semantic resources is common in modern NLP systems, but methods to extract lexical semantics have only recently begun to perform well enough for practical use. We evaluate existing and new similarity metrics for thesaurus extraction, and experiment with the trade-off between extraction performance and efficiency. We propose an approximation algorithm, based on canonical attributes and coarse- and fine-grained matching, that reduces the time complexity and execution time of thesaurus extraction with only a marginal performance penalty.