Characterizing tenant behavior for placement and crisis mitigation in multitenant DBMSs

Authors:
Aaron J. Elmore;Sudipto Das;Alexander Pucher;Divyakant Agrawal;Amr El Abbadi;Xifeng Yan
Affiliations:
UC Santa Barbara, Santa Barbara, CA, USA;Microsoft Research, Redmond, WA, USA;UC Santa Barbara, Santa Barbara, CA, USA;UC Santa Barbara, Santa Barbara, CA, USA;UC Santa Barbara, Santa Barbara, CA, USA;UC Santa Barbara, Santa Barbara, CA, USA
Venue:
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Year:
2013

Citing 22
Cited 0

Resource overbooking and application profiling in shared hosting platforms

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Storage workload estimation for database management systems

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Performance modeling and system management for multi-component online services

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Applications

ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
Virtual Machine Hosting for Networked Clusters: Building the Foundations for "Autonomic" Orchestration

VTDC '06 Proceedings of the 2nd International Workshop on Virtualization Technology in Distributed Computing
Dynamic resource allocation for database servers running on virtual storage

FAST '09 Proccedings of the 7th conference on File and storage technologies
The design of the force.com multitenant internet application development platform

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
Benchmarking cloud serving systems with YCSB

Proceedings of the 1st ACM symposium on Cloud computing
Characterizing, modeling, and generating workload spikes for stateful services

Proceedings of the 1st ACM symposium on Cloud computing
Predicting system performance for multi-tenant database workloads

Proceedings of the Fourth International Workshop on Testing Database Systems
Zephyr: live migration in shared nothing databases for elastic cloud platforms

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Workload-aware database monitoring and consolidation

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Performance prediction for concurrent database workloads

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A bayesian approach to online performance modeling for database appliances using gaussian models

Proceedings of the 8th ACM international conference on Autonomic computing
Intelligent management of virtualized resources for database systems in cloud environment

ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
Adapting microsoft SQL server for cloud computing

ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
Pesto: online storage performance management in virtualized datacenters

Proceedings of the 2nd ACM Symposium on Cloud Computing
Scalable load balancing in cluster storage systems

Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Towards Multi-tenant Performance SLOs

ICDE '12 Proceedings of the 2012 IEEE 28th International Conference on Data Engineering
Romano: autonomous storage management using performance prediction in multi-tenant datacenters

Proceedings of the Third ACM Symposium on Cloud Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A multitenant database management system (DBMS) in the cloud must continuously monitor the trade-off between efficient resource sharing among multiple application databases (tenants) and their performance. Considering the scale of \attn{hundreds to} thousands of tenants in such multitenant DBMSs, manual approaches for continuous monitoring are not tenable. A self-managing controller of a multitenant DBMS faces several challenges. For instance, how to characterize a tenant given its variety of workloads, how to reduce the impact of tenant colocation, and how to detect and mitigate a performance crisis where one or more tenants' desired service level objective (SLO) is not achieved. We present Delphi, a self-managing system controller for a multitenant DBMS, and Pythia, a technique to learn behavior through observation and supervision using DBMS-agnostic database level performance measures. Pythia accurately learns tenant behavior even when multiple tenants share a database process, learns good and bad tenant consolidation plans (or packings), and maintains a pertenant history to detect behavior changes. Delphi detects performance crises, and leverages Pythia to suggests remedial actions using a hill-climbing search algorithm to identify a new tenant placement strategy to mitigate violating SLOs. Our evaluation using a variety of tenant types and workloads shows that Pythia can learn a tenant's behavior with more than 92% accuracy and learn the quality of packings with more than 86% accuracy. During a performance crisis, Delphi is able to reduce 99th percentile latencies by 80%, and can consolidate 45% more tenants than a greedy baseline, which balances tenant load without modeling tenant behavior.