AQUAGP: approximate QUery answers using genetic programming

  • Authors:
  • Jason B. Peltzer;Ankur M. Teredesai;Garrett Reinard

  • Affiliations:
  • Department of Computer Science, Rochester Institute of Technology (RIT), Rochester, NY;Department of Computer Science, Rochester Institute of Technology (RIT), Rochester, NY;Department of Computer Science, Rochester Institute of Technology (RIT), Rochester, NY

  • Venue:
  • EuroGP'06 Proceedings of the 9th European conference on Genetic Programming
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Speed, cost, and accuracy are crucial performance parameters while evaluating the quality of information and query retrieval within any Database Management System. For some queries it may be possible to derive a similar result set using an approximate query answering algorithm or tool when the perfect/exact results are not required. Query approximation becomes useful when the following conditions are true: (a) a high percentage of the relevant data is retrieved correctly, (b) irrelevant or extra data is minimized, and (c) an approximate answer (if available) results in significant (notable) savings in terms of the overall query cost and retrieval time. In this paper we discuss a novel approach for approximate query answering using Genetic Programming (GP) paradigms. We have developed an evolutionary computing based query space exploration framework which, given an input query and the database schema, uses tree-based GP to generate and evaluate approximate query candidates, automatically. We highlight and discuss various avenues of exploration and evaluate the success of our experiments based on the speed, cost, and accuracy of the results retrieved by the re-formulated (GP generated) queries and present the results on a variety of query types for TPC-benchmark and PKDD-benchmark datasets.