Term extraction + term clustering: an integrated platform for computer-aided terminology

  • Authors:
  • Didier Bourigault;Christian Jacquemin

  • Affiliations:
  • ERSS, UMR 5610 CNRS, Maison de la Recherche, Toulouse, France;LIMSI-CNRS, Orsay, France

  • Venue:
  • EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

A novel technique for automatic thesaurus construction is proposed. It is based on the complementary use of two tools: (1) a Term Extraction tool that acquires term candidates from tagged corpora through a shallow grammar of noun phrases, and (2) a Term Clustering tool that groups syntactic variants (insertions). Experiments performed on corpora in three technical domains yield clusters of term candidates with precision rates between 93% and 98%.