Dynamic partitioning of the cache hierarchy in shared data centers

Authors:
Gokul Soundararajan;Jin Chen;Mohamed A. Sharaf;Cristiana Amza
Affiliations:
University of Toronto;University of Toronto;University of Toronto;University of Toronto
Venue:
Proceedings of the VLDB Endowment
Year:
2008

Citing 29
Cited 8

Applied regression analysis and other multivariable methods

Applied regression analysis and other multivariable methods
The LRU-K page replacement algorithm for database disk buffering

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Informed prefetching and caching

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Goal-oriented buffer management revisited

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Automatic I/O hint generation through speculative execution

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Information and control in gray-box systems

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
LIRS: an efficient low inter-reference recency set replacement policy to improve buffer cache performance

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Track-Aligned Extents: Matching Access Patterns to Disk Drive Characteristics

FAST '02 Proceedings of the Conference on File and Storage Technologies
Managing Memory to Meet Multiclass Workload Response Time Goals

VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
My Cache or Yours? Making Storage More Exclusive

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Dynamic tracking of page miss ratio curve for memory management

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
Empirical evaluation of multi-level buffer cache collaboration for storage systems

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Façade: Virtual Storage Devices with Performance Guarantees

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Adaptive self-tuning memory in DB2

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Geiger: monitoring the buffer cache in a virtual machine environment

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Storage workload estimation for database management systems

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Second-tier cache management using write hints

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Neptune: scalable replication management and programming support for cluster-based network services

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
pClock: an arrival curve based approach for QoS guarantees in shared storage systems

Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Argon: performance insulation for shared storage servers

FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
Karma: know-it-all replacement for a multilevel cache

FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
STEP: Sequentiality and Thrashing Detection Based Prefetching to Improve Performance of Networked Storage Servers

ICDCS '07 Proceedings of the 27th International Conference on Distributed Computing Systems
Adaptive control of virtualized resources in utility computing environments

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
An evaluation of buffer management strategies for relational database systems

VLDB '85 Proceedings of the 11th international conference on Very Large Data Bases - Volume 11
Operator scheduling in a data stream manager

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Context-aware prefetching at the storage server

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Quality contracts for real-time enterprises

BIRTE'06 Proceedings of the 1st international conference on Business intelligence for the real-time enterprises

RapidMRC: approximating L2 miss rate curves on commodity systems for online optimizations

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Dynamic resource allocation for database servers running on virtual storage

FAST '09 Proccedings of the 7th conference on File and storage technologies
CLIC: client-informed caching for storage servers

FAST '09 Proccedings of the 7th conference on File and storage technologies
Dynamic storage cache allocation in multi-server architectures

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Adaptive multi-level cache allocation in distributed storage architectures

Proceedings of the 24th ACM International Conference on Supercomputing
A query language and runtime tool for evaluating behavior of multi-tier servers

Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Dynamic global resource allocation in shared data centers and clouds

CASCON '12 Proceedings of the 2012 Conference of the Center for Advanced Studies on Collaborative Research
Enhancing both fairness and performance using rate-aware dynamic storage cache partitioning

DISCS-2013 Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Due to the imperative need to reduce the management costs of large data centers, operators multiplex several concurrent database applications on a server farm connected to shared network attached storage. Determining and enforcing per-application resource quotas in the resulting cache hierarchy, on the fly, poses a complex resource allocation problem spanning the database server and the storage server tiers. This problem is further complicated by the need to provide strict Quality of Service (QoS) guarantees to hosted applications. In this paper, we design and implement a novel coordinated partitioning technique of the database buffer pool and storage cache between applications for any given cache replacement policy and per-application access pattern. We use statistical regression to dynamically determine the mapping between cache quota settings and the resulting per-application QoS. A resource controller embedded within the database engine actuates the partitioning of the two-level cache, converging towards the configuration with maximum application utility, expressed as the service provider revenue in that configuration, based on a set of latency sample points. Our experimental evaluation, using the MySQL database engine, a server farm with consolidated storage, and two e-commerce benchmarks, shows the effectiveness of our technique in enforcing application QoS, as well as maximizing the revenue of the service provider in shared server farms.