C4.5: programs for machine learning
C4.5: programs for machine learning
Randomized algorithms
Efficient incremental induction of decision trees
Machine Learning
Decision Tree Induction Based on Efficient Tree Restructuring
Machine Learning
Mining high-speed data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
An Analysis of Functional Trees
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
The Journal of Machine Learning Research
Accurate decision trees for mining high-speed data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Incremental rule learning based on example nearness from numerical data streams
Proceedings of the 2005 ACM symposium on Applied computing
Learning decision trees from dynamic data streams
Proceedings of the 2005 ACM symposium on Applied computing
Data streams classification by incremental rule learning with parameterized generalization
Proceedings of the 2006 ACM symposium on Applied computing
Evaluating algorithms that learn from data streams
Proceedings of the 2009 ACM symposium on Applied Computing
Issues in evaluation of stream learning algorithms
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Ambiguous decision trees for mining concept-drifting data streams
Pattern Recognition Letters
Improving Adaptive Bagging Methods for Evolving Data Streams
ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Handling numeric attributes in hoeffding trees
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Learning model trees from evolving data streams
Data Mining and Knowledge Discovery
Classification model for data streams based on similarity
IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part I
Stress-testing hoeffding trees
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
A new method of mining data streams using harmony search
Journal of Intelligent Information Systems
Hi-index | 0.00 |
This paper presents an hybrid adaptive system for induction of forest of trees from data streams. The Ultra Fast Forest Tree system (UFFT) is an incremental algorithm, with constant time for processing each example, works online, and uses the Hoeffding bound to decide when to install a splitting test in a leaf leading to a decision node. Our system has been designed for continuous data. It uses analytical techniques to choose the splitting criteria, and the information gain to estimate the merit of each possible splitting-test. The number of examples required to evaluate the splitting criteria is sound, based on the Hoeffding bound. For multiclass problems, the algorithm builds a binary tree for each possible pair of classes, leading to a forest of trees. During the training phase the algorithm maintains a short term memory. Given a data stream, a fixed number of the most recent examples are maintained in a data-structure that supports constant time insertion and deletion. When a test is installed, a leaf is transformed into a decision node with two descendant leaves. The sufficient statistics of these leaves are initialized with the examples in the short term memory that will fall at these leaves. We study the behavior of UFFT in different problems. The experimental results shows that UFFT is competitive against a batch decision tree learner in large and medium datasets.