Data mining tasks and methods: Classification: classification methodology

  • Authors:
  • Hans-Hermann Bock

  • Affiliations:
  • Professor of Applied Statistics, Technical University of Aachen, Germany

  • Venue:
  • Handbook of data mining and knowledge discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article surveys a range of classification (discrimination) methods, which are based on probabilistic or statistical models such as Bayes methods, maximum likelihood, nearest neighbor classifiers, nonparametric kernel density methods, plug-in rules, and so on. Additionally, we point to various algorithmic approaches for classification such as neural networks, support-vector machines, and decision trees which are, however, fully discussed in subsequent sections of this handbook. A major part of this article is devoted to the specification and estimation of various types of recovery rates and misclassification probabilities of a (fixed or data dependent) classifier. Finally, we describe some preprocessing methods for the selection of most informative variables.