From outliers to prototypes: Ordering data

  • Authors:
  • Stefan Harmeling;Guido Dornhege;David Tax;Frank Meinecke;Klaus-Robert Müller

  • Affiliations:
  • Fraunhofer FIRST.IDA, Kekuléstrasse 7, 12489 Berlin, Germany and Department of Computer Science, University of Potsdam, August-Bebel-Strasse 89, 14482 Potsdam, Germany;Fraunhofer FIRST.IDA, Kekuléstrasse 7, 12489 Berlin, Germany;Delft University of Technology, Information and Communication Theory Group, P.O. Box 5031, 2600 GA, Delft, The Netherlands;Fraunhofer FIRST.IDA, Kekuléstrasse 7, 12489 Berlin, Germany;Fraunhofer FIRST.IDA, Kekuléstrasse 7, 12489 Berlin, Germany and Department of Computer Science, University of Potsdam, August-Bebel-Strasse 89, 14482 Potsdam, Germany

  • Venue:
  • Neurocomputing
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

We propose simple and fast methods based on nearest neighbors that order objects from high-dimensional data sets from typical points to untypical points. On the one hand, we show that these easy-to-compute orderings allow us to detect outliers (i.e. very untypical points) with a performance comparable to or better than other often much more sophisticated methods. On the other hand, we show how to use these orderings to detect prototypes (very typical points) which facilitate exploratory data analysis algorithms such as noisy nonlinear dimensionality reduction and clustering. Comprehensive experiments demonstrate the validity of our approach.