Finding Reducts Without Building the Discernibility Matrix

  • Authors:
  • Marcin Korze;Szymon Jaroszewicz

  • Affiliations:
  • Technical University of Szczecin, Poland;Technical University of Szczecin, Poland

  • Venue:
  • ISDA '05 Proceedings of the 5th International Conference on Intelligent Systems Design and Applications
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present algorithms for fast generation of short reducts which avoid building the discernibility matrix explicitly. We show how information obtained from this matrix can be obtained based only on the distributions of attribute values. Since the size of discernibility matrix is quadratic in the number of data records, not building the matrix explicitly gives a very significant speedup and makes it possible to find reducts even in very large databases. Algorithms are given for both absolute and relative reducts. Experiments show that our approach outperforms other reduct finding algorithms. Furthermore it is shown that many heuristic reduct finding algorithms using the discernibility matrix in fact select attributes based on their Gini index. A new definition of conditional Gini index is presented, motivated by reduct finding heuristics.