On the robustness of entropy-based similarity measures in evaluation of subcategorization acquisition systems

  • Authors:
  • Anna Korhonen;Yuval Krymolowski

  • Affiliations:
  • University of Cambridge, Cambridge, UK;Bar-Ilan University, Ramat Gan, Israel

  • Venue:
  • COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Some statistical learning systems are evaluated using measures of distributional similarity. To deal with the problem of zero events in the distributions under comparison, smoothing is frequently performed before similarity measures are applied. Smoothing alters the information in the original distribution, and may add noise to the results. Here, we investigate the sensitivity of entropy-based similarity measures to noise from uninformative smoothing. Our experiments with two subcategorization acquisition systems show that similarity measures vary in their robustness. While some are led astray by noise from smoothing, others are more resilient.