Lazy attribute selection: Choosing attributes at classification time

Authors:
Rafael B. Pereira;Alexandre Plastino;Bianca Zadrozny;Luiz Henrique de C. Merschmann;Alex A. Freitas
Affiliations:
Fluminense Federal University, Rio de Janeiro, Brazil;Fluminense Federal University, Rio de Janeiro, Brazil;IBM Research, Brazil;Ouro Preto Federal University, Ouro Preto/MG, Brazil;University of Kent, Canterbury, UK
Venue:
Intelligent Data Analysis
Year:
2011

Citing 18
Cited 0

A practical approach to feature selection

ML92 Proceedings of the ninth international workshop on Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Estimating attributes: analysis and extensions of RELIEF

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Induction of Decision Trees

Machine Learning
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Local Attribute Value Grouping for Lazy Rule Induction

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Toward Integrating Feature Selection Algorithms for Classification and Clustering

IEEE Transactions on Knowledge and Data Engineering
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Lazy Associative Classification

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Computational Methods of Feature Selection (Chapman & Hall/Crc Data Mining and Knowledge Discovery Series)

Computational Methods of Feature Selection (Chapman & Hall/Crc Data Mining and Knowledge Discovery Series)
Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)

Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)
Lazy decision trees

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Nearest neighbor pattern classification

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

Attribute selection is a data preprocessing step which aims at identifying relevant attributes for the target machine learning task --namely classification in this paper. In this paper, we propose a new attribute selection strategy --based on a lazy learning approach --which postpones the identification of relevant attributes until an instance is submitted for classification. Our strategy relies on the hypothesis that taking into account the attribute values of an instance to be classified may contribute to identifying the best attributes for the correct classification of that particular instance. Experimental results using the k-NN and Naive Bayes classifiers, over 40 different data sets from the UCI Machine Learning Repository and five large data sets from the NIPS 2003 feature selection challenge, show the effectiveness of delaying attribute selection to classification time. The proposed lazy technique in most cases improves the accuracy of classification, when compared with the analogous attribute selection approach performed as a data preprocessing step. We also propose a metric to estimate when a specific data set can benefit from the lazy attribute selection approach.