Projection-based measure for efficient feature selection

Authors:
Roberto Ruiz;José/ C. Riquelme;Jesú/s S. Aguilar-Ruiz
Affiliations:
(Correspd. Tel.: +34 95 455 38 67/ Fax: +34 95 455 71 39) Department of Computer Science, University of Seville, Avda. Reina Mercedes S/n, 41012 Sevilla, Spain. E-mail: {rruiz, riquelme, aguilar}@ ...;Department of Computer Science, University of Seville, Avda. Reina Mercedes S/n, 41012 Sevilla, Spain. E-mail: {rruiz, riquelme, aguilar}@lsi.us.es;Department of Computer Science, University of Seville, Avda. Reina Mercedes S/n, 41012 Sevilla, Spain. E-mail: {rruiz, riquelme, aguilar}@lsi.us.es
Venue:
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - IBERAMIA '02
Year:
2002

Citing 0
Cited 5

New heuristics in feature selection for high dimensional data

AI Communications
Improving the accuracy of a two-stage algorithm in evolutionary product unit neural networks for classification by means of feature selection

IWINAC'11 Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications - Volume Part II
Gene ranking from microarray data for cancer classification: a machine learning approach

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Analysis of feature rankings for classification

IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Selection of discriminative sub-regions for palmprint recognition

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

The attribute selection techniques for supervised learning, used in the preprocessing phase to emphasize the most relevant attributes, allow making models of classification simpler and easy to understand. Depending on the method to apply: starting point, search organization, evaluation strategy, and the stopping criterion, there is an added cost to the classification algorithm that we are going to use, that normally will be compensated, in greater or smaller extent, by the attribute reduction in the classification model. The method proposed in this work utilizes a measure based on projections to guide the selection of the attributes. The algorithm (SOAP: Selection of Attributes by Projection) has some interesting characteristics: lower computational cost (O(mn log n) m attributes and n examples in the data set) with respect to other typical algorithms due to the absence of distance and statistical calculations; its applicability to any labelled data set, that is to say, it can contain continuous and discrete variables, with no need for transformation. The performance of SOAP is analysed in two ways: percentage of reduction and classification. SOAP has been compared to CFS [4] and ReliefF [8]. The results are generated by C4.5 before and after the application of the algorithms.