Completely Lazy Learning

Authors:
Eric K. Garcia;Sergey Feldman;Maya R. Gupta;Santosh Srivastava
Affiliations:
University of Washington, Seattle;University of Washington, Seattle;University of Washington, Seattle;Fred Hutchinson Cancer Research Center, Seattle
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
2010

Citing 0
Cited 7

Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification

Pattern Recognition
IPADE: iterative prototype adjustment for nearest neighbor classification

IEEE Transactions on Neural Networks
Enhancing IPADE algorithm with a different individual codification

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Multi-task regularization of generative similarity models

SIMBAD'11 Proceedings of the First international conference on Similarity-based pattern recognition
Fast exact local-mean based classification algorithm using class rejection and initial distance assigning processes

Pattern Recognition Letters
Random subspace evidence classifier

Neurocomputing
Lazy overfitting control

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Local classifiers are sometimes called lazy learners because they do not train a classifier until presented with a test sample. However, such methods are generally not completely lazy because the neighborhood size k (or other locality parameter) is usually chosen by cross validation on the training set, which can require significant preprocessing and risks overfitting. We propose a simple alternative to cross validation of the neighborhood size that requires no preprocessing: instead of committing to one neighborhood size, average the discriminants for multiple neighborhoods. We show that this forms an expected estimated posterior that minimizes the expected Bregman loss with respect to the uncertainty about the neighborhood choice. We analyze this approach for six standard and state-of-the-art local classifiers, including discriminative adaptive metric kNN (DANN), a local support vector machine (SVM-KNN), hyperplane distance nearest neighbor (HKNN), and a new local Bayesian quadratic discriminant analysis (local BDA). The empirical effectiveness of this technique versus cross validation is confirmed with experiments on seven benchmark data sets, showing that similar classification performance can be attained without any training.