Machine Learning
Mining high-speed data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Database Mining: A Performance Perspective
IEEE Transactions on Knowledge and Data Engineering
Option Decision Trees with Majority Votes
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Learning probabilistic relational concept descriptions
Learning probabilistic relational concept descriptions
Accurate decision trees for mining high-speed data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Stress-testing hoeffding trees
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
New ensemble methods for evolving data streams
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Controlled permutations for testing adaptive classifiers
DS'11 Proceedings of the 14th international conference on Discovery science
Mining uncertain data streams using clustering feature decision trees
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
MCBR-CDS'11 Proceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support
Learning in non-stationary environments with class imbalance
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Incrementally optimized decision tree for noisy big data
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Incrementally optimized decision tree for noisy big data
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Quality of Experience Models for Multimedia Streaming
International Journal of Mobile Computing and Multimedia Communications
The CART decision tree for mining data streams
Information Sciences: an International Journal
Hi-index | 0.00 |
Hoeffding trees are state-of-the-art for processing high-speed data streams. Their ingenuity stems from updating sufficient statistics, only addressing growth when decisions can be made that are guaranteed to be almost identical to those that would be made by conventional batch learning methods. Despite this guarantee, decisions are still subject to limited lookahead and stability issues. In this paper we explore Hoeffding Option Trees, a regular Hoeffding tree containing additional option nodes that allow several tests to be applied, leading to multiple Hoeffding trees as separate paths. We show how to control tree growth in order to generate a mixture of paths, and empirically determine a reasonable number of paths. We then empirically evaluate a spectrum of Hoeffding tree variations: single trees, option trees and bagged trees. Finally, we investigate pruning.We show that on some datasets a pruned option tree can be smaller and more accurate than a single tree.