Ursa: Scalable Load and Power Management in Cloud Storage Systems

Authors:
Gae-Won You;Seung-Won Hwang;Navendu Jain
Affiliations:
Pohang University of Science and Technology;Pohang University of Science and Technology;Microsoft Research Redmond
Venue:
ACM Transactions on Storage (TOS)
Year:
2013

Citing 19
Cited 0

Introduction to operations research, 4th ed.

Introduction to operations research, 4th ed.
Managing energy and server resources in hosting centers

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
TCP Nice: a mechanism for background transfers

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
A Strategy of Load Balancing in Object Storage System

CIT '05 Proceedings of the The Fifth International Conference on Computer and Information Technology
Quickly finding near-optimal storage designs

ACM Transactions on Computer Systems (TOCS)
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
The Case for Energy-Proportional Computing

Computer
Write off-loading: Practical power management for enterprise storage

ACM Transactions on Storage (TOS)
VL2: a scalable and flexible data center network

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
A load balancing framework for clustered storage systems

HiPC'08 Proceedings of the 15th international conference on High performance computing
Analyzing the energy efficiency of a database server

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
On energy management, load balancing and replication

ACM SIGMOD Record
BASIL: automated IO load balancing across storage devices

FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
SRCMap: energy proportional storage using dynamic consolidation

FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Everest: scaling down peak loads through I/O off-loading

OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Schism: a workload-driven approach to database replication and partitioning

Proceedings of the VLDB Endowment
Zephyr: live migration in shared nothing databases for elastic cloud platforms

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Albatross: lightweight elasticity in shared storage databases for the cloud using live data migration

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Enterprise and cloud data centers are comprised of tens of thousands of servers providing petabytes of storage to a large number of users and applications. At such a scale, these storage systems face two key challenges: (1) hot-spots due to the dynamic popularity of stored objects; and (2) high operational costs due to power and cooling. Existing storage solutions, however, are unsuitable to address these challenges because of the large number of servers and data objects. This article describes the design, implementation, and evaluation of Ursa, a system that scales to a large number of storage nodes and objects, and aims to minimize latency and bandwidth costs during system reconfiguration. Toward this goal, Ursa formulates an optimization problem that selects a subset of objects from hot-spot servers and performs topology-aware migration to minimize reconfiguration costs. As exact optimization is computationally expensive, we devise scalable approximation techniques for node selection and efficient divide-and-conquer computation. We also show that the same dynamic reconfiguration techniques can be leveraged to reduce power costs by dynamically migrating data off under-utilized nodes, and powering up servers neighboring existing hot-spots to reduce reconfiguration costs. Our evaluation shows that Ursa achieves cost-effective load management, is time-responsive in computing placement decisions (e.g., about two minutes for 10K nodes and 10M objects), and provides power savings of 15%--37%.