Approximating Decision Trees with Multiway Branches

  • Authors:
  • Venkatesan T. Chakaravarthy;Vinayaka Pandit;Sambuddha Roy;Yogish Sabharwal

  • Affiliations:
  • IBM India Research Lab, New Delhi, India;IBM India Research Lab, New Delhi, India;IBM India Research Lab, New Delhi, India;IBM India Research Lab, New Delhi, India

  • Venue:
  • ICALP '09 Proceedings of the 36th International Colloquium on Automata, Languages and Programming: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the problem of constructing decision trees for entity identification from a given table. The input is a table containing information about a set of entities over a fixed set of attributes. The goal is to construct a decision tree that identifies each entity unambiguously by testing the attribute values such that the average number of tests is minimized. The previously best known approximation ratio for this problem was O (log2 N ). In this paper, we present a new greedy heuristic that yields an improved approximation ratio of O (logN ).