Class Decomposition via Clustering: A New Framework for Low-Variance Classifiers

  • Authors:
  • Ricardo Vilalta;Murali-Krishna Achari;Christoph F. Eick

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a pre-processing step to classification thatapplies a clustering algorithm to the training set to discoverlocal patterns in the attribute or input space. Wedemonstrate how this knowledge can be exploited to enhancethe predictive accuracy of simple classifiers. Our focusis mainly on classifiers characterized by high bias butlow variance (e.g., linear classifiers); these classifiers experiencedifficulty in delineating class boundaries over theinput space when a class distributes in complex ways. Decomposingclasses into clusters makes the new class distributioneasier to approximate and provides a viable way toreduce bias while limiting the growth in variance. Experimentalresults on real-world domains show an advantagein predictive accuracy when clustering is used as a pre-processingstep to classification.