On distributed file tree walk of parallel file systems

Authors:
Jharrod LaFon;Satyajayant Misra;Jon Bringhurst
Affiliations:
New Mexico State University and Los Alamos National Laboratory;New Mexico State University;Los Alamos National Laboratory
Venue:
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Year:
2012

Citing 16
Cited 0

Upper and lower time bounds for parallel random access machines without simultaneous writes

SIAM Journal on Computing
A new distributed algorithm to find breadth first search trees

IEEE Transactions on Information Theory
Parallel depth first search. Part I. implementation

International Journal of Parallel Programming
An O(n2 log n) parallel max-flow algorithm

Journal of Algorithms
The design of a standard message passing interface for distributed memory concurrent computers

Parallel Computing - Special issue: message passing interfaces
The Parallel Evaluation of General Arithmetic Expressions

Journal of the ACM (JACM)
Introduction to algorithms

Introduction to algorithms
The Panasas ActiveScale Storage Cluster: Delivering Scalable High Bandwidth Storage

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
A Scalable Distributed Parallel Breadth-First Search Algorithm on BlueGene/L

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Collective communication: theory, practice, and experience: Research Articles

Concurrency and Computation: Practice & Experience
Ceph: a scalable, high-performance distributed file system

OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Scheduling multithreaded computations by work stealing

SFCS '94 Proceedings of the 35th Annual Symposium on Foundations of Computer Science
Accelerating CUDA graph algorithms at maximum warp

Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
Modeling and tolerating heterogeneous failures in large parallel systems

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Parallel breadth-first search on distributed memory systems

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Supercomputers generate vast amounts of data, typically organized into large directory hierarchies on parallel file systems. While the supercomputing applications are parallel, the tools used to process them requiring complete directory traversal, are typically serial. We present an algorithm framework and three fully distributed algorithms for traversing large parallel file systems, and performing file operations in parallel. The first algorithm introduces a randomized work-stealing scheduler; the second improves the first with proximity-awareness; and the third improves upon the second by using a hybrid approach. We have tested our implementation on Cielo, a 1.37 petaflop supercomputer at the Los Alamos National Laboratory and its 7 petabyte file system. Test results show that our algorithms execute orders of magnitude faster than state-of-the-art algorithms while achieving ideal load balancing and low communication cost. We present performance insights from the use of our algorithms in production systems at LANL, performing daily file system operations.