Decision trees capacity and probability of misclassification

  • Authors:
  • Victor Nedel'ko

  • Affiliations:
  • Laboratory of Data Analysis, Institute of Mathematics SB RAS, Novosibirsk, Russia

  • Venue:
  • AIS-ADM 2005 Proceedings of the 2005 international conference on Autonomous Intelligent Systems: agents and Data Mining
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Deciding functions statistical robustness problem is considered. The goal is to estimate misclassification probability for decision function by training sample. The paper contains results of investigation an empirical risk bias for decision tree classifier in comparison with a linear classifiers and with exact bias estimates for a discrete case. This allows to find out how far Vapnik–Chervonenkis risk estimates are off for considered decision function classes and to choose optimal complexity parameters for constructed decision functions. For heuristic algorithms those do not perform exhaustive search there was proposed a method for estimating an effective capacity.