Rough set based approaches to feature selection for Case-Based Reasoning classifiers

  • Authors:
  • Maria Salamó;Maite López-Sánchez

  • Affiliations:
  • Dept. de Matemítica Aplicada i Anílisi, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585-08007 Barcelona, Spain;Dept. de Matemítica Aplicada i Anílisi, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585-08007 Barcelona, Spain

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

This paper investigates feature selection based on rough sets for dimensionality reduction in Case-Based Reasoning classifiers. In order to be useful, Case-Based Reasoning systems should be able to manage imprecise, uncertain and redundant data to retrieve the most relevant information in a potentially overwhelming quantity of data. Rough Set Theory has been shown to be an effective tool for data mining and for uncertainty management. This paper has two central contributions: (1) it develops three strategies for feature selection, and (2) it proposes several measures for estimating attribute relevance based on Rough Set Theory. Although we concentrate on Case-Based Reasoning classifiers, the proposals are general enough to be applicable to a wide range of learning algorithms. We applied these proposals on twenty data sets from the UCI repository and examined the impact of feature selection over classification performance. Our evaluation shows that all three proposals benefit the basic Case-Based Reasoning system. They also present robustness in comparison to well-known feature selection strategies.