Reviewing and Evaluating Automatic Term Recognition Techniques

  • Authors:
  • Ioannis Korkontzelos;Ioannis P. Klapaftis;Suresh Manandhar

  • Affiliations:
  • Department of Computer Science, The University of York, Heslington, York, UK YO10 5NG;Department of Computer Science, The University of York, Heslington, York, UK YO10 5NG;Department of Computer Science, The University of York, Heslington, York, UK YO10 5NG

  • Venue:
  • GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic Term Recognition (ATR) is defined as the task of identifying domain specific terms from technical corpora. Termhood-basedapproaches measure the degree that a candidate term refers to a domain specific concept. Unithood-basedapproaches measure the attachment strength of a candidate term constituents. These methods have been evaluated using different, often incompatible evaluation schemes and datasets. This paper provides an overview and a thorough evaluation of state-of-the-art ATRmethods, under a common evaluation framework, i.e. corpora and evaluation method. Our contributions are two-fold: (1) We compare a number of different ATRmethods, showing that termhood-basedmethods achieve in general superior performance. (2) We show that the number of independent occurrences of a candidate term is the most effective source for estimating term nestedness, improving ATRperformance.