The influence of prior knowledge on the expected performance of a classifier

  • Authors:
  • Vladimir Berikov;Alexander Litvinenko

  • Affiliations:
  • Sobolev Institute of Mathematics, Siberian Branch of Russian Academy of Sciences, pr. Koptyuga 4, Novosibirsk 630090, Russia;Sobolev Institute of Mathematics, Siberian Branch of Russian Academy of Sciences, pr. Koptyuga 4, Novosibirsk 630090, Russia and Max Planck Institute for Mathematics in the Sciences, Inselstrasse ...

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2003

Quantified Score

Hi-index 0.10

Visualization

Abstract

In this paper, we study the probabilistic properties of pattern classifiers in discrete feature space. The principle of Bayesian averaging of recognition performance is used for this analysis. We consider two cases: (a) prior probabilities of classes are unknown, and (b) prior probabilities of classes are known. The misclassification probability is represented as a random value, for which the characteristic function (expressed via Kummer hypergeometric function) and absolute moments are analytically derived. For the case of unknown priors, an approximate formula for calculation of sufficient learning sample size is obtained. The comparison between the performances for two considered cases is made. As an example, we consider the problem of mutational hotspots classification in genetic sequences.