Estimation by the nearest neighbor rule

Authors:
T. Cover
Affiliations:
-
Venue:
IEEE Transactions on Information Theory
Year:
2006

Citing 0
Cited 8

Improving analogy-based software cost estimation by a resampling method

Information and Software Technology
K nearest neighbours with mutual information for simultaneous classification and missing data imputation

Neurocomputing
The design methodology of radial basis function neural networks based on fuzzy K-nearest neighbors approach

Fuzzy Sets and Systems
A characterization of molecular similarity methods for property prediction

Mathematical and Computer Modelling: An International Journal
A survey of fuzzy clustering

Mathematical and Computer Modelling: An International Journal
An affine invariant k-nearest neighbor regression estimate

Journal of Multivariate Analysis
The efficient imputation method for neighborhood-based collaborative filtering

Proceedings of the 21st ACM international conference on Information and knowledge management
AdaM: adaptive-maximum imputation for neighborhood-based collaborative filtering

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Quantified Score

Hi-index	754.84

Visualization

Abstract

LetR^{ast}denote the Bayes risk (minimum expected loss) for the problem of estimatingtheta varepsilon Theta, given an observed random variablex, joint probability distributionF(x,theta), and loss functionL. Consider the problem in which the only knowledge ofFis that which can be inferred from samples(x_{1},theta_{1}),(x_{2},theta_{2}), cdots ,(x_{n}, theta_{n}), where the(x_{i}, theta_{i})'sare independently identically distributed according toF. Let the nearest neighbor estimate of the parameterthetaassociated with an observationxbe defined to be the parametertheta_{n}^{'}associated with the nearest neighborx_{n}^{'}tox. Let R be the large sample risk of the nearest neighbor rule. It will be shown, for a wide range of probability distributions, thatR leq 2R^{ast}for metric loss functions andR = 2R^{ast}for squared-error loss functions. A simple estimator using the nearestkneighbors yieldsR = R^{ast} (1 + 1/k)in the squared-error loss case. In this sense, it can be said that at least haft the information in the infinite training set is contained in the nearest neighbor. This paper is an extension of earlier work[q from the problem of classification by the nearest neighbor rule to that of estimation. However, the unbounded loss functions in the estimation problem introduce additional problems concerning the convergence of the unconditional risk. Thus some work is devoted to the investigation of natural conditions on the underlying distribution assuring the desired convergence.