Efficient and extensible algorithms for multi query optimization
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Multiple query optimization in middleware using query teamwork
Software—Practice & Experience
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Storage workload estimation for database management systems
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Detecting performance anomalies in global applications
WORLDS'05 Proceedings of the 2nd conference on Real, Large Distributed Systems - Volume 2
Adaptive Learning of Metric Correlations for Temperature-Aware Database Provisioning
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Applications
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
Exploiting nonstationarity for performance prediction
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
BI batch manager: a system for managing batch workloads on enterprise data-warehouses
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Automatic virtual machine configuration for database workloads
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
PQR: Predicting Query Execution Times for Autonomous Workload Management
ICAC '08 Proceedings of the 2008 International Conference on Autonomic Computing
Modeling and exploiting query interactions in database systems
Proceedings of the 17th ACM conference on Information and knowledge management
Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers
ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Managing operational business intelligence workloads
ACM SIGOPS Operating Systems Review
Communications of the ACM - A Direct Path to Dependable Software
Proceedings of the ACM/IFIP/USENIX 2007 International Conference on Middleware
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Self-Tuning for SQL Performance in Oracle Database 11g
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Query interactions in database workloads
Proceedings of the Second International Workshop on Testing Database Systems
A scalable, predictable join operator for highly concurrent data warehouses
Proceedings of the VLDB Endowment
Tuning database configuration parameters with iTuned
Proceedings of the VLDB Endowment
Automated experiment-driven management of (database) systems
HotOS'09 Proceedings of the 12th conference on Hot topics in operating systems
JustRunIt: experiment-based management of virtualized data centers
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Interaction-aware scheduling of report-generation workloads
The VLDB Journal — The International Journal on Very Large Data Bases
Predicting system performance for multi-tenant database workloads
Proceedings of the Fourth International Workshop on Testing Database Systems
A bayesian approach to online performance modeling for database appliances using gaussian models
Proceedings of the 8th ACM international conference on Autonomic computing
Robust estimation of resource consumption for SQL queries using statistical techniques
Proceedings of the VLDB Endowment
Parallel analytics as a service
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Performance and resource modeling in highly-concurrent OLTP workloads
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Workload management for big data analytics
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Towards building performance models for data-intensive workloads in public clouds
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Provisioning data analytic workloads in a cloud
Future Generation Computer Systems
Towards predicting query execution time for concurrent and dynamic database workloads
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
A question that database administrators (DBAs) routinely need to answer is how long a batch query workload will take to complete. This question arises, for example, while planning the execution of different report-generation workloads to fit within available time windows. To answer this question accurately, we need to take into account that the typical workload in a database system consists of mixes of concurrent queries. Interactions among different queries in these mixes need to be modeled, rather than the conventional approach of considering each query separately. This paper presents a new approach for estimating workload completion times that takes the significant impact of query interactions into account. This approach builds performance models using an experiment-driven technique, by sampling the space of possible query mixes and fitting statistical models to the observed performance at these samples. No prior assumptions are made about the internal workings of the database system or the cause of query interactions, making the models robust and portable. We show that a careful choice of sampling and statistical modeling strategies can result in accurate models, and we present a novel interaction-aware workload simulator that uses these models to estimate workload completion times. An experimental evaluation with complex TPC-H queries on IBM DB2 shows that this approach consistently predicts workload completion times with less than 20% error.