Fast crash recovery in RAMCloud

Authors:
Diego Ongaro;Stephen M. Rumble;Ryan Stutsman;John Ousterhout;Mendel Rosenblum
Affiliations:
Stanford University;Stanford University;Stanford University;Stanford University;Stanford University
Venue:
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Year:
2011

Citing 20
Cited 43

The Sprite Network Operating System

Computer
A case for redundant arrays of inexpensive disks (RAID)

SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Linearizability: a correctness condition for concurrent objects

ACM Transactions on Programming Languages and Systems (TOPLAS)
The design and implementation of a log-structured file system

ACM Transactions on Computer Systems (TOCS)
Balanced allocations (extended abstract)

STOC '94 Proceedings of the twenty-sixth annual ACM symposium on Theory of computing
Implementation techniques for main memory database systems

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Main Memory Database Systems: An Overview

IEEE Transactions on Knowledge and Data Engineering
Chord: a scalable peer-to-peer lookup protocol for internet applications

IEEE/ACM Transactions on Networking (TON)
The power of two choices in randomized load balancing

The power of two choices in randomized load balancing
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
File system logging versus clustering: a performance comparison

TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings
Dynamo: amazon's highly available key-value store

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Bigtable: A Distributed Storage System for Structured Data

ACM Transactions on Computer Systems (TOCS)
PNUTS: Yahoo!'s hosted data serving platform

Proceedings of the VLDB Endowment
H-store: a high-performance, distributed main memory transaction processing system

Proceedings of the VLDB Endowment
Sinfonia: A new paradigm for building scalable distributed systems

ACM Transactions on Computer Systems (TOCS)
Evolution and future directions of large-scale storage and computation systems at Google

Proceedings of the 1st ACM symposium on Cloud computing
ZooKeeper: wait-free coordination for internet-scale systems

USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
The Hadoop Distributed File System

MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
The case for RAMCloud

Communications of the ACM

Kineograph: taking the pulse of a fast-changing and connected world

Proceedings of the 7th ACM european conference on Computer Systems
Practical TDMA for datacenter ethernet

Proceedings of the 7th ACM european conference on Computer Systems
Don't lose sleep over availability: the GreenUp decentralized wakeup service

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Less is more: trading a little bandwidth for ultra-low latency in the data center

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
LogBase: a scalable log-structured database system in the cloud

Proceedings of the VLDB Endowment
HyperDex: a distributed, searchable key-value store

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
RAMCube: exploiting network proximity for ram-based key-value store

HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters

HotCloud'12 Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing
Multi-structured redundancy

HotStorage'12 Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems
Wimpy nodes with 10GbE: leveraging one-sided operations in soft-RDMA to boost memcached

USENIX ATC'12 Proceedings of the 2012 USENIX conference on Annual Technical Conference
HyperDex: a distributed, searchable key-value store

ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Flex-KV: enabling high-performance and flexible KV systems

Proceedings of the 2012 workshop on Management of big data systems
Unity: secure and durable personal cloud storage

Proceedings of the 2012 ACM Workshop on Cloud computing security workshop
Flat datacenter storage

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
DDOS: taming nondeterminism in distributed systems

Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
Pollux: towards scalable distributed real-time search on microblogs

Proceedings of the 16th International Conference on Extending Database Technology
Elastic online analytical processing on RAMCloud

Proceedings of the 16th International Conference on Extending Database Technology
Trinity: a distributed graph engine on a memory cloud

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
MemC3: compact and concurrent MemCache with dumber caching and smarter hashing

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Improving availability in distributed systems with failure informers

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Exploiting in-network processing for big data management

Proceedings of the 2013 Sigmod/PODS Ph.D. symposium on PhD symposium
QuickSAN: a storage area network for fast, distributed, solid state disks

Proceedings of the 40th Annual International Symposium on Computer Architecture
Toward common patterns for distributed, concurrent, fault-tolerant code

HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
Large-scale computation not at the cost of expressiveness

HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems
Exploiting Redundancies and Deferred Writes to Conserve Energy in Erasure-Coded Storage Clusters

ACM Transactions on Storage (TOS)
Distributed data management using MapReduce

ACM Computing Surveys (CSUR)
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
IOFlow: a software-defined storage architecture

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Discretized streams: fault-tolerant streaming computation at scale

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Limplock: understanding the impact of limpware on scale-out cloud systems

Proceedings of the 4th annual Symposium on Cloud Computing
Copysets: reducing the frequency of data loss in cloud storage

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
TABLEFS: enhancing metadata efficiency in the local file system

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
On the efficiency of durable state machine replication

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Scale-out NUMA

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Warming up storage-level caches with bonfire

FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
HARDFS: hardening HDFS with selective and lightweight versioning

FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Log-structured memory for DRAM-based storage

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Parity logging with reserved space: towards efficient updates and recovery in erasure-coded clustered storage

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Toward a scale-out data-management middleware for low-latency enterprise computing

IBM Journal of Research and Development
Exalt: empowering researchers to evaluate large-scale storage systems

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
FaRM: fast remote memory

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
MICA: a holistic approach to fast in-memory key-value storage

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
SENIC: scalable NIC for end-host rate limiting

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

RAMCloud is a DRAM-based storage system that provides inexpensive durability and availability by recovering quickly after crashes, rather than storing replicas in DRAM. RAMCloud scatters backup data across hundreds or thousands of disks, and it harnesses hundreds of servers in parallel to reconstruct lost data. The system uses a log-structured approach for all its data, in DRAM as well as on disk: this provides high performance both during normal operation and during recovery. RAMCloud employs randomized techniques to manage the system in a scalable and decentralized fashion. In a 60-node cluster, RAMCloud recovers 35 GB of data from a failed server in 1.6 seconds. Our measurements suggest that the approach will scale to recover larger memory sizes (64 GB or more) in less time with larger clusters.