Geometric decision rules for instance-based learning problems

Authors:
Binay Bhattacharya;Kaustav Mukherjee;Godfried Toussaint
Affiliations:
School of Computing Science, Simon Fraser University, Burnaby, B.C., Canada;School of Computing Science, Simon Fraser University, Burnaby, B.C., Canada;School of Computer Science, McGill University, Montréal, Québec, Canada
Venue:
PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
Year:
2005

Citing 5
Cited 6

Reduction Techniques for Instance-BasedLearning Algorithms

Machine Learning
Advances in Instance Selection for Instance-Based Learning Algorithms

Data Mining and Knowledge Discovery
Reference Set Thinning for the k-Nearest Neighbor Decision Rule

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Application of computational geometry to pattern recognition problems

Application of computational geometry to pattern recognition problems
Learning pattern classification-a survey

IEEE Transactions on Information Theory

Hit Miss Networks with Applications to Instance Selection

The Journal of Machine Learning Research
Improving accuracy of LVQ algorithm by instance weighting

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part III
Pruning classification rules with reference vector selection methods

ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
2011 Special Issue: LVQ algorithm with instance weighting for generation of prototype-based rules

Neural Networks
Letters: Random optimized geometric ensembles

Neurocomputing
Proximity-graph instance-based learning, support vector machines, and high dimensionality: an empirical comparison

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the typical nonparametric approach to classification in instance-based learning and data mining, random data (the training set of patterns) are collected and used to design a decision rule (classifier). One of the most well known such rules is the k-nearest neighbor decision rule (also known as lazy learning) in which an unknown pattern is classified into the majority class among the k-nearest neighbors in the training set. This rule gives low error rates when the training set is large. However, in practice it is desired to store as little of the training data as possible, without sacrificing the performance. It is well known that thinning (condensing) the training set with the Gabriel proximity graph is a viable partial solution to the problem. However, this brings up the problem of efficiently computing the Gabriel graph of large training data sets in high dimensional spaces. In this paper we report on a new approach to the instance-based learning problem. The new approach combines five tools: first, editing the data using Wilson-Gabriel-editing to smooth the decision boundary, second, applying Gabriel-thinning to the edited set, third, filtering this output with the ICF algorithm of Brighton and Mellish, fourth, using the Gabriel-neighbor decision rule to classify new incoming queries, and fifth, using a new data structure that allows the efficient computation of approximate Gabriel graphs in high dimensional spaces. Extensive experiments suggest that our approach is the best on the market.