Augmented embedding of dissimilarity data into (pseudo-)euclidean spaces

  • Authors:
  • Artsiom Harol;Elżbieta Pękalska;Sergey Verzakov;Robert P. W. Duin

  • Affiliations:
  • Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, The Netherlands;School of Computer Science, University of Manchester, United Kingdom;Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, The Netherlands;Information and Communication Theory group, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, The Netherlands

  • Venue:
  • SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Pairwise proximities describe the properties of objects in terms of their similarities. By using different distance-based functions one may encode different characteristics of a given problem. However, to use the framework of statistical pattern recognition some vector representation should be constructed. One of the simplest ways to do that is to define an isometric embedding to some vector space. In this work, we will focus on a linear embedding into a (pseudo-)Euclidean space. This is usually well defined for training data. Some inadequacy, however, appears when projecting new or test objects due to the resulting projection errors. In this paper we propose an augmented embedding algorithm that enlarges the dimensionality of the space such that the resulting projection error vanishes. Our preliminary results show that it may lead to a better classification accuracy, especially for data with high intrinsic dimensionality.