Boosting k-nearest neighbor classifier by means of input space projection

Authors:
Nicolás García-Pedrajas;Domingo Ortiz-Boyer
Affiliations:
Department of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, 14071 Córdoba, Spain;Department of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, 14071 Córdoba, Spain
Venue:
Expert Systems with Applications: An International Journal
Year:
2009

Citing 25
Cited 11

Stacked regressions

Machine Learning
Bagging predictors

Machine Learning
Discriminant Adaptive Nearest Neighbor Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
Combination of Multiple Classifiers Using Local Accuracy Estimates

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Approximate statistical tests for comparing supervised classification learning algorithms

Neural Computation
A comparative study of neural network based feature extraction paradigms

Pattern Recognition Letters
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
MultiBoosting: A Technique for Combining Boosting and Wagging

Machine Learning
Random Forests

Machine Learning
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Combining Classifiers with Meta Decision Trees

Machine Learning
Locally Adaptive Metric Nearest-Neighbor Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
FeatureBoost: A Meta-Learning Algorithm that Improves Model Robustness

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Nearest Neighbors in Random Subspaces

SSPR '98/SPR '98 Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition
Optimizing Nearest Neighbour in Random Subspaces using a Multi-Objective Genetic Algorithm

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Nearest Neighbor Ensemble

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Boosting Nearest Neighbor Classi.ers for Multiclass Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Boosting the distance estimation

Pattern Recognition Letters
Ensembling evidential k-nearest neighbor classifiers through multi-modal perturbation

Applied Soft Computing
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Nonlinear Boosting Projections for Ensemble Construction

The Journal of Machine Learning Research
Boosting random subspace method

Neural Networks
Bagging, boosting, and C4.S

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Ensembling local learners ThroughMultimodal perturbation

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

A complete fuzzy discriminant analysis approach for face recognition

Applied Soft Computing
Prediction of Sequential Values for Debt Recovery

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Fast exact k nearest neighbors search using an orthogonal search tree

Pattern Recognition
Discriminant analysis approach using fuzzy fourfold subspaces model

Neurocomputing
Learning active fusion of multiple experts' decisions: An attention-based approach

Neural Computation
Class imbalance methods for translation initiation site recognition in DNA sequences

Knowledge-Based Systems
Ensemble based sensing anomaly detection in wireless sensor networks

Expert Systems with Applications: An International Journal
Accurate Prediction of Coronary Artery Disease Using Reliable Diagnosis System

Journal of Medical Systems
Boosting k-NN for Categorization of Natural Scenes

International Journal of Computer Vision
A fuzzy supervised learning method with dynamical parameter estimation for nonlinear discriminant analysis

Computers & Mathematics with Applications
A fuzzy nearest neighbor neural network statistical model for predicting demand for natural gas and energy cost savings in public buildings

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

The k-nearest neighbors classifier is one of the most widely used methods of classification due to several interesting features, such as good generalization and easy implementation. Although simple, it is usually able to match, and even beat, more sophisticated and complex methods. However, no successful method has been reported so far to apply boosting to k-NN. As boosting methods have proved very effective in improving the generalization capabilities of many classification algorithms, proposing an appropriate application of boosting to k-nearest neighbors is of great interest. Ensemble methods rely on the instability of the classifiers to improve their performance, as k-NN is fairly stable with respect to resampling, these methods fail in their attempt to improve the performance of k-NN classifier. On the other hand, k-NN is very sensitive to input selection. In this way, ensembles based on subspace methods are able to improve the performance of single k-NN classifiers. In this paper we make use of the sensitivity of k-NN to input space for developing two methods for boosting k-NN. The two approaches modify the view of the data that each classifier receives so that the accurate classification of difficult instances is favored. The two approaches are compared with the classifier alone and bagging and random subspace methods with a marked and significant improvement of the generalization error. The comparison is performed using a large test set of 45 problems from the UCI Machine Learning Repository. A further study on noise tolerance shows that the proposed methods are less affected by class label noise than the standard methods.