A new definition of neighborhood of a point in multi-dimensional space
Pattern Recognition Letters
Reduction Techniques for Instance-BasedLearning Algorithms
Machine Learning
Advances in Instance Selection for Instance-Based Learning Algorithms
Data Mining and Knowledge Discovery
Analysis of new techniques to obtain quality training sets
Pattern Recognition Letters - Special issue: Sibgrapi 2001
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Cover trees for nearest neighbor
ICML '06 Proceedings of the 23rd international conference on Machine learning
Statistical Comparisons of Classifiers over Multiple Data Sets
The Journal of Machine Learning Research
Hit Miss Networks with Applications to Instance Selection
The Journal of Machine Learning Research
When Similar Problems Don't Have Similar Solutions
ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Avoiding Boosting Overfitting by Removing Confusing Samples
ECML '07 Proceedings of the 18th European conference on Machine Learning
The Good, the Bad and the Incorrectly Classified: Profiling Cases for Case-Base Editing
ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
A Scalable Noise Reduction Technique for Large Case-Based Systems
ICCBR '09 Proceedings of the 8th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Noise-tolerant instance-based learning algorithms
IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Remembering to forget: a competence-preserving case deletion policy for case-based reasoning systems
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Noise reduction for instance-based learning with a local maximal margin approach
Journal of Intelligent Information Systems
Fast and Scalable Local Kernel Machines
The Journal of Machine Learning Research
Adaptive case-based reasoning using retention and forgetting strategies
Knowledge-Based Systems
CBTV: visualising case bases for similarity measure design and selection
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
The condensed nearest neighbor rule (Corresp.)
IEEE Transactions on Information Theory
The reduced nearest neighbor rule (Corresp.)
IEEE Transactions on Information Theory
An algorithm for a selective nearest neighbor decision rule (Corresp.)
IEEE Transactions on Information Theory
Hi-index | 0.00 |
The dependency on the quality of the training data has led to significant work in noise reduction for instance-based learning algorithms. This paper presents an empirical evaluation of current noise reduction techniques, not just from the perspective of their comparative performance, but from the perspective of investigating the types of instances that they focus on for removal. A novel instance profiling technique known as RDCL profiling allows the structure of a training set to be analysed at the instance level categorising each instance based on modelling their local competence properties. This profiling approach offers the opportunity of investigating the types of instances removed by the noise reduction techniques that are currently in use in instance-based learning. The paper also considers the effect of removing instances with specific profiles from a dataset and shows that a very simple approach of removing instances that are misclassified by the training set and cause other instances in the dataset to be misclassified is an effective noise reduction technique.