Specific-class distance measures for nominal attributes

  • Authors:
  • Khalil El Hindi

  • Affiliations:
  • Computer Science Department, College of Computer and Information Sciences, King Saud University, Riaydh, Saudi Arabia E-mails: khindi@ksu.edu.sa, kelhindi@gmail.com

  • Venue:
  • AI Communications
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The classification accuracy of many machine learning methods depends upon their ability to accurately measure the similarity between different instances. Similarity is measured using a distance metric or measure. In this work, several novel distance measures for nominal values are proposed. These distance measures exploit the class of a training example against which a new instance is compared. The experiments, conducted using 50 benchmark data sets, indicate that the proposed functions are superior in many cases to the Value Difference Metric VDM that is widely used in instance based learning. Some of the proposed measures have proven to be less sensitive to missing values and noise in the training data sets and have maintained good classification accuracy in the presence of unknown and noisy attribute values. Like VDM, the proposed measures work only with labelled training data sets which makes them unsuitable for unsupervised learning methods.