Knowledge evaluation: statistical evaluations

  • Authors:
  • David D. Jensen

  • Affiliations:
  • Research Assistant, Professor of Computer Science, University of Massachusetts, Amherst

  • Venue:
  • Handbook of data mining and knowledge discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article introduces basic features of error estimators, including bias, variance, and loss functions. It outlines the logic behind classical hypothesis tests and explains the special challenges faced by knowledge discovery algorithms that search large model spaces. It discusses the statistical effects of multiple comparison procedures (MCPs), and several methods to adjust for those effects, including mathematical adjustments, cross-validation, and randomization tests. Finally, it outlines the basic concepts behind overfitting reduction and pruning.