Risk estimation for nonparametric discrimination and estimation rules: A simulation study (Corresp.)

  • Authors:
  • C. Penrod;T. Wagner

  • Affiliations:
  • -;-

  • Venue:
  • IEEE Transactions on Information Theory
  • Year:
  • 1979

Quantified Score

Hi-index 754.84

Visualization

Abstract

The designer of a nonparametric discrimination or estimation procedure is almost always interested in the conditional risk of his procedure, or nde, conditioned on the available data. Unfortunately,L_{n}, the risk conditioned on a data set containingnobservations, cannot be computed without exact knowledge of the underlying probability distribution functions. Since such knowledge is unavailable, the designer must be content with estimates ofL_{n}. Two such estimates are the deleted estimate,L_{n}^{D}, and the holdout estimate,L_{n}^{H}. This paper presents the results of an experimental study of these two estimates and compares these results with some recently obtained distribution.free theoretical results. Among other things, the experimental data indicates that fork-nearest neighbor rules in{bf R^{1}}with several examples of underlying distributions,P{|L_{n}-L^{D}_{n}| geq 2e^{-2 epsilon^{2}n}mbox{P{|}L_{n}-L^{H}_{n}| geq epsilon} 2e^{-2 epsilon^{2} sqrt{n}.}