A simple noise-tolerant abstraction algorithm for fast k-NN classification

Authors:
Stefanos Ougiaroglou;Georgios Evangelidis
Affiliations:
Dept. of Applied Informatics, University of Macedonia, Thessaloniki, Greece;Dept. of Applied Informatics, University of Macedonia, Thessaloniki, Greece
Venue:
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Year:
2012

Citing 15
Cited 0

A sample set condensation algorithm for the class sensitive artificial neural network

Pattern Recognition Letters
Reduction Techniques for Instance-BasedLearning Algorithms

Machine Learning
Learning Symbolic Prototypes

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Fast k-Nearest Neighbor Classification Using Cluster-Based Trees

IEEE Transactions on Pattern Analysis and Machine Intelligence
Using Representative-Based Clustering for Nearest Neighbor Dataset Editing

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Foundations of Multidimensional and Metric Data Structures (The Morgan Kaufmann Series in Computer Graphics and Geometric Modeling)

Foundations of Multidimensional and Metric Data Structures (The Morgan Kaufmann Series in Computer Graphics and Geometric Modeling)
Fast condensed nearest neighbor rule

ICML '05 Proceedings of the 22nd international conference on Machine learning
Similarity Search: The Metric Space Approach (Advances in Database Systems)

Similarity Search: The Metric Space Approach (Advances in Database Systems)
Clustering-Based Reference Set Reduction for k-Nearest Neighbor

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Part II--Advances in Neural Networks
KEEL: a software tool to assess evolutionary algorithms for data mining problems

Soft Computing - A Fusion of Foundations, Methodologies and Applications - Special Issue on Evolutionary and Metaheuristics based Data Mining (EMBDM); Guest Editors: José A. Gámez, María J. del Jesús, José M. Puerta
Cluster-Based Similarity Search in Time Series

BCI '09 Proceedings of the 2009 Fourth Balkan Conference in Informatics
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study

IEEE Transactions on Pattern Analysis and Machine Intelligence
Towards efficient imputation by nearest-neighbors: a clustering-based approach

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
A Taxonomy and Experimental Study on Prototype Generation for Nearest Neighbor Classification

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

The k-Nearest Neighbor (k-NN) classifier is a widely-used and effective classification method. The main k-NN drawback is that it involves high computational cost when applied on large datasets. Many Data Reduction Techniques have been proposed in order to speed-up the classification process. However, their effectiveness depends on the level of noise in the data. This paper shows that the k-means clustering algorithm can be used as a noise-tolerant Data Reduction Technique. The conducted experimental study illustrates that if the reduced dataset includes the k-means centroids as representatives of the initial data, performance is not negatively affected as much by the addition of noise.