On the Algorithmic Implementation of Stochastic Discrimination

Authors:
Eugene M. Kleinberg
Affiliations:
State Univ. of New York, Buffalo
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2000

Citing 8
Cited 16

C4.5: programs for machine learning

C4.5: programs for machine learning
Machine learning, neural and statistical classification

Machine learning, neural and statistical classification
An alternative method of stochastic discrimination with applications to pattern recognition

An alternative method of stochastic discrimination with applications to pattern recognition
Bagging predictors

Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Random decision forests

ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)

Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)

Random Forests

Machine Learning
Fields as limit functions of stochastic discrimination and their adaptability

Neural Computation
Ensembles of Learning Machines

WIRN VIETRI 2002 Proceedings of the 13th Italian Workshop on Neural Nets-Revised Papers
A Mathematically Rigorous Foundation for Supervised Learning

MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
Averaging Weak Classifiers

MCS '01 Proceedings of the Second International Workshop on Multiple Classifier Systems
Immune network based ensembles

Neurocomputing
Nonlinear Boosting Projections for Ensemble Construction

The Journal of Machine Learning Research
A Multi-Level Probabilistic Neural Network

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
Boosting random subspace method

Neural Networks
Classification of DNA microarray data with Random Projection Ensembles of Polynomial SVMs

Proceedings of the 2009 conference on New Directions in Neural Networks: 18th Italian Workshop on Neural Networks: WIRN 2008
Multiple Classifier Systems for Adversarial Classification Tasks

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
Constructing ensembles of classifiers by means of weighted instance selection

IEEE Transactions on Neural Networks
Exploiting propositionalization based on random relational rules for semi-supervised learning

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Learning with ensembles of randomized trees: new insights

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Symmetries from uniform space covering in stochastic discrimination

SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
The use of artificial-intelligence-based ensembles for intrusion detection: a review

Applied Computational Intelligence and Soft Computing

Quantified Score

Hi-index	0.14

Visualization

Abstract

Stochastic discrimination is a general methodology for constructing classifiers appropriate for pattern recognition. It is based on combining arbitrary numbers of very weak components, which are usually generated by some pseudorandom process, and it has the property that the very complex and accurate classifiers produced in this way retain the ability, characteristic of their weak component pieces, to generalize to new data. In fact, it is often observed, in practice, that classifier performance on test sets continues to rise as more weak components are added, even after performance on training sets seems to have reached a maximum. This is predicted by the underlying theory, for even though the formal error rate on the training set may have reached a minimum, more sophisticated measures intrinsic to this method indicate that classifier performance on both training and test sets continues to improve as complexity increases. In this paper, we begin with a review of the method of stochastic discrimination as applied to pattern recognition. Through a progression of examples keyed to various theoretical issues, we discuss considerations involved with its algorithmic implementation. We then take such an algorithmic implementation and compare its performance, on a large set of standardized pattern recognition problems from the University of California Irvine, and Statlog collections, to many other techniques reported on in the literature, including boosting and bagging. In doing these studies, we compare our results to those reported in the literature by the various authors for the other methods, using the same data and study paradigms used by them. Included in this paper is an outline of the underlying mathematical theory of stochastic discrimination and a remark concerning boosting, which provides a theoretical justification for properties of that method observed in practice, including its ability to generalize.