Using small random samples for the manual evaluation of statistical association measures

  • Authors:
  • Stefan Evert;Brigitte Krenn

  • Affiliations:
  • IMS, University of Stuttgart, Azenbergstr. 12, 70174 Stuttgart, Germany;ÖFAI, Freyung 6/6, A-1010 Vienna, Austria

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe the empirical evaluation of statistical association measures for the extraction of lexical collocations from text corpora. We argue that the results of an evaluation experiment cannot easily be generalized to a different setting. Consequently, such experiments have to be carried out under conditions that are as similar as possible to the intended use of the measures. Finally, we show how an evaluation strategy based on random samples can reduce the amount of manual annotation work significantly, making it possible to perform many more evaluation experiments under specific conditions.