DataStager: scalable data staging services for petascale applications

Authors:
Hasan Abbasi;Matthew Wolf;Greg Eisenhauer;Scott Klasky;Karsten Schwan;Fang Zheng
Affiliations:
Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA;Oak Ridge National Laboratory, Oak Ridge, TN, USA;Georgia Institute of Technology, Atlanta, GA, USA;Georgia Institute of Technology, Atlanta, GA, USA
Venue:
Proceedings of the 18th ACM international symposium on High performance distributed computing
Year:
2009

Citing 22
Cited 34

A stop-and-go queueing framework for congestion management

SIGCOMM '90 Proceedings of the ACM symposium on Communications architectures & protocols
Input/output behavior of supercomputing applications

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Server-directed collective I/O in Panda

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Disk-directed I/O for MIMD multiprocessors

ACM Transactions on Computer Systems (TOCS)
Efficient wire formats for high performance computing

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Adaptive System Sensitive Partitioning of AMR Applications on Heterogeneous Clusters

Cluster Computing
Reducing Hot-Spot Contention in Shared-Memory Multiprocessor Systems

IEEE Concurrency
GPFS: A Shared-Disk File System for Large Computing Clusters

FAST '02 Proceedings of the Conference on File and Storage Technologies
Portals 3.0: Protocol Building Blocks for Low Overhead Communication

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
SmartPointers: personalized scientific data portals in your hand

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Event Services for High Performance Computing

HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
A High-Performance Cluster Storage Server

HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Alleviating Memory Contention in Matrix Computations on Large-Scale Shared-Memory Multiprocessors

Alleviating Memory Contention in Matrix Computations on Large-Scale Shared-Memory Multiprocessors
IQ-services: network-aware middleware for interactive large-data applications

MGC '04 Proceedings of the 2nd workshop on Middleware for grid computing
Leading Computational Methods on Scalar and Vector HEC Platforms

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Parallel genomic sequence-searching on an ad-hoc grid: experiences, lessons learned, and implications

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
PVFS: a parallel file system for linux clusters

ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
Investigation of leading HPC I/O performance using a scientific-application derived benchmark

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)

CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Scaling parallel I/O performance through I/O delegate and caching system

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Comparative evaluation of overlap strategies with study of I/O overlap in MPI-IO

ACM SIGOPS Operating Systems Review
LIVE data workspace: A flexible, dynamic and extensible platform for petascale applications

CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing

Event-based systems: opportunities and challenges at exascale

Proceedings of the Third ACM International Conference on Distributed Event-Based Systems
Monalytics: online monitoring and analytics for managing large scale data centers

Proceedings of the 7th international conference on Autonomic computing
File-Access Characteristics of Data-Intensive Workflow Applications

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
DataSpaces: an interaction and coordination framework for coupled simulation workflows

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
File-access patterns of data-intensive workflow applications and their implications to distributed filesystems

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Accelerating I/O Forwarding in IBM Blue Gene/P Systems

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Managing Variability in the IO Performance of Petascale Storage Systems

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
An HDF5 MPI virtual file driver for parallel in-situ post-processing

EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Exploiting Latent I/O Asynchrony in Petascale Science Applications

International Journal of High Performance Computing Applications
Just in time: adding value to the IO pipelines of high performance applications with JITStaging

Proceedings of the 20th international symposium on High performance distributed computing
Six degrees of scientific data: reading patterns for extreme scale science IO

Proceedings of the 20th international symposium on High performance distributed computing
Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Examples of in transit visualization

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Using active NVRAM for I/O staging

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
High end scientific codes with computational I/O pipelines: improving their end-to-end performance

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Region scheduling: efficiently using the cache architectures via page-level affinity

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
An application-level parallel I/O library for Earth system models

International Journal of High Performance Computing Applications
In-situ I/O processing: a case for location flexibility

Proceedings of the sixth workshop on Parallel Data Storage
Can checkpoint/restart mechanisms benefit from hierarchical data staging?

Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
Enabling event tracing at leadership-class scale through I/O forwarding middleware

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
ISOBAR hybrid compression-I/O interleaving for large-scale parallel I/O optimization

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
The RAMDISK storage accelerator: a method of accelerating I/O performance on HPC systems using RAMDISKs

Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers
DataSpaces: an interaction and coordination framework for coupled simulation workflows

Cluster Computing
Parallel computational steering and analysis for HPC applications using a paraview interface and the HDF5 DSM virtual file driver

EG PGV'11 Proceedings of the 11th Eurographics conference on Parallel Graphics and Visualization
Design and modeling of a non-blocking checkpointing system

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Combining in-situ and in-transit processing to enable extreme-scale scientific analysis

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Multi-domain job coscheduling for leadership computing systems

The Journal of Supercomputing
A 1 PB/s file system to checkpoint three million MPI tasks

Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Scalable in situ scientific data encoding for analytical query processing

Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Energy-aware I/O optimization for checkpoint and restart on a NAND flash memory system

Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale
Insights for exascale IO APIs from building a petascale IO API

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Using cross-layer adaptations for dynamic data management in large scale coupled scientific workflows

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Exploring power behaviors and trade-offs of in-situ data analytics

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Optimizing I/O forwarding techniques for extreme-scale event tracing

Cluster Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Known challenges for petascale machines are that (1) the costs of I/O for high performance applications can be substantial, especially for output tasks like checkpointing, and (2) noise from I/O actions can inject undesirable delays into the runtimes of such codes on individual compute nodes. This paper introduces the flexible 'DataStager' framework for data staging and alternative services within that jointly address (1) and (2). Data staging services moving output data from compute nodes to staging or I/O nodes prior to storage are used to reduce I/O overheads on applications' total processing times, and explicit management of data staging offers reduced perturbation when extracting output data from a petascale machine's compute partition. Experimental evaluations of DataStager on the Cray XT machine at Oak Ridge National Laboratory establish both the necessity of intelligent data staging and the high performance of our approach, using the GTC fusion modeling code and benchmarks running on 1000+ processors.