Machine Learning
Selection of relevant features and examples in machine learning
Artificial Intelligence - Special issue on relevance
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Data mining: concepts and techniques
Data mining: concepts and techniques
Pattern Recognition and Neural Networks
Pattern Recognition and Neural Networks
Boosting the margin: A new explanation for the effectiveness of voting methods
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A study of cross-validation and bootstrap for accuracy estimation and model selection
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Hi-index | 0.00 |
Subset selection with a wrapper approach to identify atypical examples can be preferable to a filter approach (which may not be consistent with the classifier in use) but its running time is prohibitive. The fastest available wrappers are quadratic in the number of examples, which is far too expensive for sample subset selection. The presented approach is a linear wrapper method that is roughly 80 times faster than the quadratic wrappers. Atypical points are defined in this paper as the misclassified points that the proposed algorithm (Atypical Sequential Ranking: ASR) finds not useful to the classification task. They may include both outliers and overlapping samples. ASR can identify and rank atypical points in the whole dataset without damaging the prediction accuracy. It is general enough that classifiers without reject option can use it. Experiments on 20 benchmark datasets and 5 classifiers show promising results and confirm that this wrapper method has some advantages and can be used in sample subset selection for atypical detection.