An Unsupervised Bayesian Distance Measure

  • Authors:
  • Petri Kontkanen;Jussi Lahtinen;Petri Myllymäki;Henry Tirri

  • Affiliations:
  • -;-;-;-

  • Venue:
  • EWCBR '00 Proceedings of the 5th European Workshop on Advances in Case-Based Reasoning
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a distance measure based on the idea that two vectors are considered similar if they lead to similar predictive probability distributions. The suggested approach avoids the scaling problem inherent to many alternative techniques as the method automatically transforms the original attribute space to a probability space where all the numbers lie between 0 and 1. The method is also flexible in the sense that it allows different attribute types (discrete or continuous) in the same consistent framework. To study the validity of the suggested measure, we ran a series of experiments with publicly available data sets. The empirical results demonstrate that the unsupervised distance measure is sensible in the sense that it can be used for discovering the hidden clustering structure of the data.