A soft computing approach for data mining based query processing using rough sets and genetic algorithms

  • Authors:
  • K. G. Srinivasa;K. R. Venugopal;L. M. Patnaik

  • Affiliations:
  • (Correspd. E-mail: kgsrinivas@msrit.edu) Department of Computer Science and Engineering, Data Mining Laboratory, M S Ramaiah Institute of Technology, Bangalore - 560054, India;University Visvesvaraya College of Engineering, Bangalore - 560001, India;Microprocessor Applications Laboratory, Indian Institute of Science, Bangalore, India

  • Venue:
  • International Journal of Hybrid Intelligent Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The optimization of queries is critical in database management systems and the complexity involved in finding optimal solutions has led to the development of heuristic approaches. Answering data mining query involves a random search over large databases. Due to the enormity of the data set involved, model simplification is necessary for quick answering of data mining queries. In this paper, we propose a hybrid model using rough sets and genetic algorithms for fast and efficient query answering. Rough sets are used to classify and summarize the datasets, whereas genetic algorithms are used for answering association related queries and feedback for adaptive classification. Here, we consider three types of queries, i.e., select, aggregate and classification based data mining queries. Summary tables that are built using rough sets and analytical model of attributes are used to speed up select queries. Mining associations, building concept hierarchies and reinforcement of reducts are achieved through genetic algorithms. The experiments are conducted on three real-life data sets, which include KDD 99 Cup data, Forest Cover-type data and Iris data. The performance of the proposed algorithm is analyzed for both execution time and classification accuracy and the results obtained are good.