Using the distribution of performance for studying statistical NLP systems and corpora

  • Authors:
  • Yuval Krymolowski

  • Affiliations:
  • Bar-Ilan University, Ramat Gan, Israel

  • Venue:
  • ELDS '01 Proceedings of the workshop on Evaluation for Language and Dialogue Systems - Volume 9
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Statistical NLP systems are frequently evaluated and compared on the basis of their performances on a single split of training and test data. Results obtained using a single split are, however, subject to sampling noise. In this paper we argue in favour of reporting a distribution of performance figures, obtained by resampling the training data, rather than a single number. The additional information from distributions can be used to make statistically quantified statements about differences across parameter settings, systems, and corpora.