Multiple comparison procedures
Multiple comparison procedures
Toward efficient agnostic learning
COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
Decision theoretic generalizations of the PAC model for neural net and other learning applications
Information and Computation
Improving learning performance through rational resource allocation
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Machine Learning
Mining high-speed data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Accurate decision trees for mining high-speed data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 2004 ACM symposium on Applied computing
Decision trees for mining data streams
Intelligent Data Analysis
Learning Model Trees from Data Streams
DS '08 Proceedings of the 11th International Conference on Discovery Science
Tree induction over perennial objects
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Learning model trees from evolving data streams
Data Mining and Knowledge Discovery
Hi-index | 0.01 |
This article advocates a new model for inductive learning. Called sequential induction, it helps bridge classical fixed-sample learning techniques (which are efficient but difficult to formally characterize), and worst-case approaches (which provide strong statistical guarantees but are too inefficient for practical use). Learning proceeds as a sequence of decisions which are informed by training data. By analyzing induction at the level of these decisions, and by utilizing the only enough data to make each decision, sequential induction provides statistical guarantees but with substantially less data than worst-case methods require. The sequential inductive model is also useful as a method for determining a sufficient sample size for inductive learning and as such, is relevant to learning problems where the preponderance of data or the cost of gathering data precludes the use of traditional methods.