Specific-class distance measures for nominal attributes

Authors:
Khalil El Hindi
Affiliations:
Computer Science Department, College of Computer and Information Sciences, King Saud University, Riaydh, Saudi Arabia E-mails: khindi@ksu.edu.sa, kelhindi@gmail.com
Venue:
AI Communications
Year:
2013

Citing 15
Cited 0

Toward memory-based reasoning

Communications of the ACM - Special issue on parallelism
Instance-Based Learning Algorithms

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Machine Learning

Machine Learning
Self-Organizing Maps

Self-Organizing Maps
Probability Based Metrics for Nearest Neighbor Classification and Case-Based Reasoning

ICCBR '99 Proceedings of the Third International Conference on Case-Based Reasoning and Development
Radial Basis Functions

Radial Basis Functions
Nearest Neighbors by Neighborhood Counting

IEEE Transactions on Pattern Analysis and Machine Intelligence
MODL: A Bayes optimal discretization method for continuous attributes

Machine Learning
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research
Improved heterogeneous distance functions

Journal of Artificial Intelligence Research
A flexible and robust similarity measure based on contextual probability

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Neighborhood Counting Measure and Minimum Risk Metric

IEEE Transactions on Pattern Analysis and Machine Intelligence
Least squares quantization in PCM

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

The classification accuracy of many machine learning methods depends upon their ability to accurately measure the similarity between different instances. Similarity is measured using a distance metric or measure. In this work, several novel distance measures for nominal values are proposed. These distance measures exploit the class of a training example against which a new instance is compared. The experiments, conducted using 50 benchmark data sets, indicate that the proposed functions are superior in many cases to the Value Difference Metric VDM that is widely used in instance based learning. Some of the proposed measures have proven to be less sensitive to missing values and noise in the training data sets and have maintained good classification accuracy in the presence of unknown and noisy attribute values. Like VDM, the proposed measures work only with labelled training data sets which makes them unsuitable for unsupervised learning methods.