Communication Efficient Construction of Decision Trees Over Heterogeneously Distributed Data

  • Authors:
  • Chris Giannella;Kun Liu;Todd Olsen;Hillol Kargupta

  • Affiliations:
  • University of Maryland Baltimore County, Baltimore, MD;University of Maryland Baltimore County, Baltimore, MD;University of Maryland Baltimore County, Baltimore, MD;University of Maryland Baltimore County, Baltimore, MD

  • Venue:
  • ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an algorithm designed to efficiently construct a decision tree over heterogeneously distributed data without centralizing. We compare our algorithm against a standard centralized decision tree implementation in terms of accuracy as well as the communication complexity. Our experimental results show that by using only 20% of the communication cost necessary to centralize the data we can achieve trees with accuracy at least 80% of the trees produced by the centralized version.