The influence of prior knowledge on the expected performance of a classifier

Authors:
Vladimir Berikov;Alexander Litvinenko
Affiliations:
Sobolev Institute of Mathematics, Siberian Branch of Russian Academy of Sciences, pr. Koptyuga 4, Novosibirsk 630090, Russia;Sobolev Institute of Mathematics, Siberian Branch of Russian Academy of Sciences, pr. Koptyuga 4, Novosibirsk 630090, Russia and Max Planck Institute for Mathematics in the Sciences, Inselstrasse ...
Venue:
Pattern Recognition Letters
Year:
2003

Citing 8
Cited 4

Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Dimensionality, Sample Size, and Classification Error of Nonparametric Linear Classification Algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence
Expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix

Pattern Recognition Letters
Statistical Pattern Recognition: A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
Statistical and neural classifiers: an integrated approach to design

Statistical and neural classifiers: an integrated approach to design
An approach to the evaluation of the performance of a discrete classifier

Pattern Recognition Letters
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)

Estimation of Dependences Based on Empirical Data: Springer Series in Statistics (Springer Series in Statistics)

Distributed Modeling Architecture of a Multi-Agent-Based Behavioral Economic Landscape (MABEL) Model

Simulation
Bayesian Model of Recognition on a Finite Set of Events

SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Proposal for a unified methodology for evaluating supervised and non-supervised classification algorithms

CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification

Pattern Recognition

Quantified Score

Hi-index	0.10

Visualization

Abstract

In this paper, we study the probabilistic properties of pattern classifiers in discrete feature space. The principle of Bayesian averaging of recognition performance is used for this analysis. We consider two cases: (a) prior probabilities of classes are unknown, and (b) prior probabilities of classes are known. The misclassification probability is represented as a random value, for which the characteristic function (expressed via Kummer hypergeometric function) and absolute moments are analytically derived. For the case of unknown priors, an approximate formula for calculation of sufficient learning sample size is obtained. The comparison between the performances for two considered cases is made. As an example, we consider the problem of mutational hotspots classification in genetic sequences.