BTRFS: The Linux B-Tree Filesystem

Authors:
Ohad Rodeh;Josef Bacik;Chris Mason
Affiliations:
IBM;FusionIO;FusionIO
Venue:
ACM Transactions on Storage (TOS)
Year:
2013

Citing 11
Cited 2

A case for redundant arrays of inexpensive disks (RAID)

SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
The design and implementation of a log-structured file system

ACM Transactions on Computer Systems (TOCS)
The log-structured merge-tree (LSM-tree)

Acta Informatica
Ubiquitous B-Tree

ACM Computing Surveys (CSUR)
File system design for an NFS file server appliance

WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
Bigtable: a distributed storage system for structured data

OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Scalability in the XFS file system

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
B-trees, shadowing, and clones

ACM Transactions on Storage (TOS)
FlexVol: flexible, efficient file volume virtualization in WAFL

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Tracking back references in a write-anywhere file system

FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
bLSM: a general purpose log structured merge tree

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data

ViewBox: integrating local file systems with cloud storage services

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Checking the integrity of transactional mechanisms

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

BTRFS is a Linux filesystem that has been adopted as the default filesystem in some popular versions of Linux. It is based on copy-on-write, allowing for efficient snapshots and clones. It uses B-trees as its main on-disk data structure. The design goal is to work well for many use cases and workloads. To this end, much effort has been directed to maintaining even performance as the filesystem ages, rather than trying to support a particular narrow benchmark use-case. Linux filesystems are installed on smartphones as well as enterprise servers. This entails challenges on many different fronts. ---Scalability. The filesystem must scale in many dimensions: disk space, memory, and CPUs. ---Data integrity. Losing data is not an option, and much effort is expended to safeguard the content. This includes checksums, metadata duplication, and RAID support built into the filesystem. ---Disk diversity. The system should work well with SSDs and hard disks. It is also expected to be able to use an array of different sized disks, which poses challenges to the RAID and striping mechanisms. This article describes the core ideas, data structures, and algorithms of this filesystem. It sheds light on the challenges posed by defragmentation in the presence of snapshots, and the tradeoffs required to maintain even performance in the face of a wide spectrum of workloads.