Managing long-running queries

Authors:
Stefan Krompass;Harumi Kuno;Janet L. Wiener;Kevin Wilkinson;Umeshwar Dayal;Alfons Kemper
Affiliations:
Technische Universität München, Munich, Germany;Hewlett-Packard Laboratories, Palo Alto, CA;Hewlett-Packard Laboratories, Palo Alto, CA;Hewlett-Packard Laboratories, Palo Alto, CA;Hewlett-Packard Laboratories, Palo Alto, CA;Technische Universität München, Munich, Germany
Venue:
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Year:
2009

Citing 11
Cited 6

The COMFORT automatic tuning project

Information Systems
Network and cpu co-allocation in high throughput computing environments

Network and cpu co-allocation in high throughput computing environments
Toward a progress indicator for database queries

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Estimating progress of execution for SQL queries

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Increasing the Accuracy and Coverage of SQL Progress Indicators

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
When can we trust progress estimators for SQL queries?

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
The making of TPC-DS

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Query suspend and resume

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Stop-and-restart style execution for long running decision support queries

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Dynamic workload management for very large data warehouses: juggling feathers and bowling balls

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Multi-query SQL progress indicators

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology

A testbed for managing dynamic mixed workloads

Proceedings of the VLDB Endowment
Adaptive query scheduling for mixed database workloads with multiple objectives

Proceedings of the Third International Workshop on Testing Database Systems
Slicing long-running queries

Proceedings of the VLDB Endowment
Predicting completion times of batch query workloads using interaction-aware models and simulation

Proceedings of the 14th International Conference on Extending Database Technology
Managing dynamic mixed workloads for operational business intelligence

DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
Workload management for big data analytics

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data

Quantified Score

Hi-index	0.00

Visualization

Abstract

Business Intelligence query workloads that run against very large data warehouses contain queries whose execution times range, sometimes unpredictably, from seconds to hours. The presence of even a handful of long-running queries can significantly slow down a workload consisting of thousands of queries, creating havoc for queries that require a quick response. Long-running queries are a known problem in all commercial database products. However, we have not seen a thorough classification of long-running queries nor a systematic study of the most effective corrective actions. We present here a systematic study of workload management policies, including many implemented by commercial database vendors. Our goal is to enable a system to: (1) recognize long-running queries and categorize them in terms of their impact on performance and (2) determine and take (automatically!) the most effective control actions to remedy the situation. To this end, we identify common workload management scenarios involving long-running queries, and create a taxonomy of long-running queries. We carry out an extensive set of experiments to evaluate different management policies and the relative and absolute thresholds that they may use. We find that in some scenarios, the right combination of policies can reduce the runtime of a workload by a factor of two, but that in other scenarios, any action taken increases runtime. One surprising result was that relative thresholds for execution control can compensate for inaccurate cost estimates, so that Kill&Requeue actions perform as well as Suspend&Resume.