Practical approximation of optimal multivariate discretization

  • Authors:
  • Tapio Elomaa;Jussi Kujala;Juho Rousu

  • Affiliations:
  • Institute of Software Systems, Tampere University of Technology, Finland;Institute of Software Systems, Tampere University of Technology, Finland;Department of Computer Science, University of Helsinki, Finland

  • Venue:
  • ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Discretization of the value range of a numerical feature is a common task in data mining and machine learning. Optimal multivariate discretization is in general computationally intractable. We have proposed approximation algorithms with performance guarantees for training error minimization by axis-parallel hyperplanes. This work studies their efficiency and practicability. We give efficient implementations to both greedy set covering and linear programming approximation of optimal multivariate discretization. We also contrast the algorithms empirically to an efficient heuristic discretization method.