Efficient execution of multiple queries on deep memory hierarchy

  • Authors:
  • Yan Zhang;Zhi-Feng Chen;Yuan-Yuan Zhou

  • Affiliations:
  • National Laboratory on Machine Perception, Peking University, Beijing, China;Google Inc., Mountain View, CA;Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL

  • Venue:
  • Journal of Computer Science and Technology
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of cache misses by improving the data temporal locality for multiple concurrent queries. Our idea is based on the observation that, in many workloads such as decision support systems (DSS), there is usually significant amount of data sharing among different concurrent queries. MiniTasking exploits such data sharing to improve data temporal locality by scheduling query execution at three levels: query level batching, operator level grouping and mini-task level scheduling. The experimental results with various types of concurrent TPC-H query workloads show that, with the traditional N-ary Storage Model (NSM) layout MiniTasking significantly reduces the L2 cache misses by up to 83%, and thereby achieves 24% reduction in execution time. With the Partition Attributes Across (PAX) layout, MiniTasking further reduces the cache misses by 65% and the execution time by 9%. For the TPC-H throughput test workload, MiniTasking improves the end performance up to 20%.