Performance Measures for Neyman–Pearson Classification

Authors:
C. Scott
Affiliations:
Univ. of Michigan, Ann Arbor
Venue:
IEEE Transactions on Information Theory
Year:
2007

Citing 0
Cited 9

General solution and learning method for binary classification with performance constraints

Pattern Recognition Letters
Radial kernels and their reproducing kernel Hilbert spaces

Journal of Complexity
Neyman-Pearson Classification, Convexity and Stochastic Constraints

The Journal of Machine Learning Research
Class proximity measures - Dissimilarity-based classification and display of high-dimensional data

Journal of Biomedical Informatics
Statistical detection of LSB matching using hypothesis testing theory

IH'12 Proceedings of the 14th international conference on Information Hiding
Application of hypothesis testing theory for optimal detection of LSB matching data hiding

Signal Processing
A plug-in approach to neyman-pearson classification

The Journal of Machine Learning Research
Statistical detection of data hidden in least significant bits of clipped images

Signal Processing
A local adaptive model of natural images for almost optimal detection of hidden data

Signal Processing

Quantified Score

Hi-index	754.84

Visualization

Abstract

In the Neyman-Pearson (NP) classification paradigm, the goal is to learn a classifier from labeled training data such that the probability of a false negative is minimized while the probability of a false positive is below a user-specified level alpha isin (0,1). This work addresses the question of how to evaluate and compare classifiers in the NP setting. Simply reporting false positives and false negatives leaves some ambiguity about which classifier is best. Unlike conventional classification, however, there is no natural performance measure for NP classification. We cannot reject classifiers whose false positive rate exceeds a since, among other reasons, the false positive rate must be estimated from data and hence is not known with certainty. We propose two families of performance measures for evaluating and comparing classifiers and suggest one criterion in particular for practical use. We then present general learning rules that satisfy performance guarantees with respect to these criteria. As in conventional classification, the notion of uniform convergence plays a central role, and leads to finite sample bounds, oracle inequalities, consistency, and rates of convergence. The proposed performance measures are also applicable to the problem of anomaly prediction.