Optimizing complex queries based on similarities of subqueries

  • Authors:
  • Qiang Zhu;Yingying Tao;Calisto Zuzarte

  • Affiliations:
  • Department of Computer and Information Science, The University of Michigan–Dearborn, 48187, Dearborn, MI, USA;Department of Computer and Information Science, The University of Michigan–Dearborn, 48187, Dearborn, MI, USA;IBM Toronto Laboratory, 48187, Markham, Ontario, Canada

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

As database technology is applied to more and more application domains, user queries are becoming increasingly complex (e.g. involving a large number of joins and a complex query structure). Query optimizers in existing database management systems (DBMS) were not developed for efficiently processing such queries and often suffer from problems such as intolerably long optimization time and poor optimization results. To tackle this challenge, we present a new similarity-based approach to optimizing complex queries in this paper. The key idea is to identify similar subqueries that often appear in a complex query and share the optimization result among similar subqueries in the query. Different levels of similarity for subqueries are introduced. Efficient algorithms to identify similar queries in a given query and optimize the query based on similarity are presented. Related issues, such as choosing good starting nodes in a query graph, evaluating identified similar subqueries and analyzing algorithm complexities, are discussed. Our experimental results demonstrate that the proposed similarity-based approach is quite promising in optimizing complex queries with similar subqueries in a DBMS.