A General Model for Finite-Sample Effects in Training and Testing of Competing Classifiers

Authors:
Sergey V. Beiden;Marcus A. Maloof;Robert F. Wagner
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2003

Citing 8
Cited 7

Effects of Sample Size in Classifier Design

IEEE Transactions on Pattern Analysis and Machine Intelligence
Estimation of Classifier Performance

IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical Pattern Recognition: A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical and neural classifiers: an integrated approach to design

Statistical and neural classifiers: an integrated approach to design
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks
The use of the area under the ROC curve in the evaluation of machine learning algorithms

Pattern Recognition

Estimating the uncertainty in the estimated mean area under the ROC curve of a classifier

Pattern Recognition Letters
Assessing Classifiers from Two Independent Data Sets Using ROC Analysis: A Nonparametric Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Generalized training subset selection for statistical estimation of epicardial activation maps from intravenous catheter measurements

Computers in Biology and Medicine
2008 Special Issue: Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance

Neural Networks
PAT: A pattern classification approach to automatic reference oracles for the testing of mesh simplification programs

Journal of Systems and Software
Evaluating classifiers: relation between area under the receiver operator characteristic curve and overall accuracy

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Classifier variability: Accounting for training and testing

Pattern Recognition

Quantified Score

Hi-index	0.14

Visualization

Abstract

Abstract--The conventional wisdom in the field of statistical pattern recognition (SPR) is that the size of the finite test sample dominates the variance in the assessment of the performance of a classical or neural classifier. The present work shows that this result has only narrow applicability. In particular, when competing algorithms are compared, the finite training sample more commonly dominates this uncertainty. This general problem in SPR is analyzed using a formal structure recently developed for multivariate random-effects receiver operating characteristic (ROC) analysis. Monte Carlo trials within the general model are used to explore the detailed statistical structure of several representative problems in the subfield of computer-aided diagnosis in medicine. The scaling laws between variance of accuracy measures and number of training samples and number of test samples are investigated and found to be comparable to those discussed in the classic text of Fukunaga, but important interaction terms have been neglected by previous authors. Finally, the importance of the contribution of finite trainers to the uncertainties argues for some form of bootstrap analysis to sample that uncertainty. The leading contemporary candidate is an extension of the 0.632 bootstrap and associated error analysis, as opposed to the more commonly used cross-validation.