Determining the cause of negative dissimilarity eigenvalues

Authors:
Weiping Xu;Richard C. Wilson;Edwin R. Hancock
Affiliations:
Dept. of Computer Science, University of York, UK;Dept. of Computer Science, University of York, UK;Dept. of Computer Science, University of York, UK
Venue:
CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Year:
2011

Citing 4
Cited 0

On the information and representation of non-Euclidean pairwise data

Pattern Recognition
Non-Euclidean dissimilarities: causes and informativeness

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Spherical embedding and classification

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Non-Euclidean or non-metric measures can be informative

SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Pairwise dissimilarity representations are frequently used as an alternative to feature vectors in pattern recognition. One of the problems encountered in the analysis of such data, is that the dissimilarities are rarely Euclidean, and are sometimes non-metric too. As a result the objects associated with the dissimilarities can not be embedded into a Euclidean space without distortion. One way of gauging the extent of this problem is to compute the total mass associated with the negative eigenvalues of the dissimilarity matrix. However,this test does not reveal the origins of non-Euclidean or non-metric artefacts in the data. The aim in this paper is to provide simple empirical tests that can be used to determine the origins of the negative dissimilarity eigenvalues. We consider three sources of the negative dissimilarity eigenvalues, namely a) that the data resides on a manifold (here for simplicity we consider a sphere), b) that the objects may be extended and c) that there is Gaussian error. We develop three measures based on the non-metricity and the negative spectrum to characterize the possible causes of non-Euclidean data. We then experimentally test our measures on various real-world dissimilarity datasets.