Analysis of HDFS under HBase: a facebook messages case study

Authors:
Tyler Harter;Dhruba Borthakur;Siying Dong;Amitanand Aiyer;Liyin Tang;Andrea C. Arpaci-Dusseau;Remzi H. Arpaci-Dusseau
Affiliations:
University of Wisconsin, Madison;Facebook Inc.;Facebook Inc.;Facebook Inc.;Facebook Inc.;University of Wisconsin, Madison;University of Wisconsin, Madison
Venue:
FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Year:
2014

Citing 20
Cited 0

Measurements of a distributed file system

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Petal: distributed virtual disks

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Frangipani: a scalable distributed file system

Proceedings of the sixteenth ACM symposium on Operating systems principles
File system usage in Windows NT 4.0

Proceedings of the seventeenth ACM symposium on Operating systems principles
A trace-driven analysis of the UNIX 4.2 BSD file system

Proceedings of the tenth ACM symposium on Operating systems principles
End-to-end arguments in system design

ACM Transactions on Computer Systems (TOCS)
The structure of the “THE”-multiprogramming system

Communications of the ACM
Hints for computer system design

SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Journal-guided resynchronization for software RAID

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Bigtable: a distributed storage system for structured data

OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
The Hadoop Distributed File System

MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
GreenHDFS: towards an energy-conserving, storage-efficient, hybrid Hadoop compute cluster

HotPower'10 Proceedings of the 2010 international conference on Power aware computing and systems
Availability in globally distributed storage systems

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Apache hadoop goes realtime at Facebook

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A file is not a file: understanding the I/O behavior of Apple desktop applications

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Interactive analytical processing in big data systems: a cross-industry study of MapReduce workloads

Proceedings of the VLDB Endowment
Robustness in the Salus scalable block store

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
An analysis of Facebook photo caching

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a multilayer study of the Facebook Messages stack, which is based on HBase and HDFS. We collect and analyze HDFS traces to identify potential improvements, which we then evaluate via simulation. Messages represents a new HDFS workload: whereas HDFS was built to store very large files and receive mostly-sequential I/O, 90% of files are smaller than 15MB and I/O is highly random. We find hot data is too large to easily fit in RAM and cold data is too large to easily fit in flash; however, cost simulations show that adding a small flash tier improves performance more than equivalent spending on RAM or disks. HBase's layered design offers simplicity, but at the cost of performance; our simulations show that network I/O can be halved if compaction bypasses the replication layer. Finally, although Messages is read-dominated, several features of the stack (i.e., logging, compaction, replication, and caching) amplify write I/O, causing writes to dominate disk I/O.