IEEE Transactions on Knowledge and Data Engineering
The VLDB Journal — The International Journal on Very Large Data Bases
Induction of multiclass multifeature split decision trees from distributed data
Pattern Recognition
PLANET: massively parallel learning of tree ensembles with MapReduce
Proceedings of the VLDB Endowment
CTS'05 Proceedings of the 2005 international conference on Collaborative technologies and systems
Distributed threshold querying of general functions by a difference of monotonic representation
Proceedings of the VLDB Endowment
Intelligent Decision Technologies
Distributed learning with data reduction
Transactions on computational collective intelligence IV
Implementation of a distributed data mining system
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
Performance evaluation of an agent based distributed data mining system
AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Distributed and Parallel Databases
Hi-index | 0.00 |
We present an algorithm designed to efficiently construct a decision tree over heterogeneously distributed data without centralizing. We compare our algorithm against a standard centralized decision tree implementation in terms of accuracy as well as the communication complexity. Our experimental results show that by using only 20% of the communication cost necessary to centralize the data we can achieve trees with accuracy at least 80% of the trees produced by the centralized version.