Data Compression and Local Metrics for Nearest Neighbor Classification

  • Authors:
  • Francesco Ricci;Paolo Avesani

  • Affiliations:
  • Istituto per la Ricerca Scientifica e Tecnologica, Povo (TN), Italy;Istituto per la Ricerca Scientifica e Tecnologica, Pavo (TN), Italy

  • Venue:
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Year:
  • 1999

Quantified Score

Hi-index 0.15

Visualization

Abstract

A local distance measure for the nearest neighbor classification rule is shown to achieve high compression rates and high accuracy on real data sets. In the approach proposed here, first, a set of prototypes is extracted during training and, then, a feedback learning algorithm is used to optimize the metric. Even if the prototypes are randomly selected, the proposed metric outperforms, both in compression rate and accuracy, common editing procedures like ICA, RNN, and PNN. Finally, when accuracy is the major concern, we show how compression can be traded for accuracy by exploiting voting techniques. That indicates how voting can be successfully integrated with instance-bases approaches, overcoming previous negative results.