Managing Variability in the IO Performance of Petascale Storage Systems

Authors:
Jay Lofstead;Fang Zheng;Qing Liu;Scott Klasky;Ron Oldfield;Todd Kordenbrock;Karsten Schwan;Matthew Wolf
Affiliations:
-;-;-;-;-;-;-;-
Venue:
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Year:
2010

Citing 26
Cited 12

The design and implementation of a log-structured file system

ACM Transactions on Computer Systems (TOCS)
Anticipatory scheduling: a disk scheduling framework to overcome deceptive idleness in synchronous I/O

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
GPFS: A Shared-Disk File System for Large Computing Clusters

FAST '02 Proceedings of the Conference on File and Storage Technologies
Disk cache replacement algorithm for storage resource managers in data grids

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Automatic ARIMA Time Series Modeling for Adaptive I/O Prefetching

IEEE Transactions on Parallel and Distributed Systems
Grid -Based Parallel Data Streaming implemented for the Gyrokinetic Toroidal Code

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
CHAMELEON: a self-evolving, fully-adaptive resource arbitrator for storage systems

ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
OPAL: An Open-Source MPI-IO Library over Cray XT

SNAPI '07 Proceedings of the Fourth International Workshop on Storage Network Architecture and Parallel I/Os
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)

CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Scaling parallel I/O performance through I/O delegate and caching system

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Characterizing and predicting the I/O performance of HPC applications using a parameterized synthetic benchmark

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Implementing Diverse Messaging Models with Self-Managing Properties using IFLOW

ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
CA-NFS: a congestion-aware network file system

FAST '09 Proccedings of the 7th conference on File and storage technologies
LIVE data workspace: A flexible, dynamic and extensible platform for petascale applications

CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing
DataStager: scalable data staging services for petascale applications

Proceedings of the 18th ACM international symposium on High performance distributed computing
Adaptable, metadata rich IO methods for portable high performance IO

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
SmartStore: a new metadata organization paradigm with semantic-awareness for next-generation file systems

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Scalable massively parallel I/O to task-local files

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
PLFS: a checkpoint filesystem for parallel applications

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Adaptive and scalable metadata management to support a trillion files

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
...and eat it too: high read performance in write-optimized HPC I/O middleware file formats

Proceedings of the 4th Annual Workshop on Petascale Data Storage
Performance variability of highly parallel architectures

ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Monalytics: online monitoring and analytics for managing large scale data centers

Proceedings of the 7th international conference on Autonomic computing
Black-box problem diagnosis in parallel file systems

FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Scheduling multiple flows on parallel disks

HiPC'05 Proceedings of the 12th international conference on High Performance Computing

Just in time: adding value to the IO pipelines of high performance applications with JITStaging

Proceedings of the 20th international symposium on High performance distributed computing
Six degrees of scientific data: reading patterns for extreme scale science IO

Proceedings of the 20th international symposium on High performance distributed computing
Simplified parallel domain traversal

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Thread Tranquilizer: Dynamically reducing performance variation

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Examples of in transit visualization

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
High end scientific codes with computational I/O pipelines: improving their end-to-end performance

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
In-situ I/O processing: a case for location flexibility

Proceedings of the sixth workshop on Parallel Data Storage
ISOBAR hybrid compression-I/O interleaving for large-scale parallel I/O optimization

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
The RAMDISK storage accelerator: a method of accelerating I/O performance on HPC systems using RAMDISKs

Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers
Characterizing output bottlenecks in a supercomputer

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Insights for exascale IO APIs from building a petascale IO API

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Runtime I/O re-routing + throttling on HPC storage

HotStorage'13 Proceedings of the 5th USENIX conference on Hot Topics in Storage and File Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Significant challenges exist for achieving peak or even consistent levels of performance when using IO systems at scale. They stem from sharing IO system resources across the processes of single largescale applications and/or multiple simultaneous programs causing internal and external interference, which in turn, causes substantial reductions in IO performance. This paper presents interference effects measurements for two different file systems at multiple supercomputing sites. These measurements motivate developing a 'managed' IO approach using adaptive algorithms varying the IO system workload based on current levels and use areas. An implementation of these methods deployed for the shared, general scratch storage system on Oak Ridge National Laboratory machines achieves higher overall performance and less variability in both a typical usage environment and with artificially introduced levels of 'noise'. The latter serving to clearly delineate and illustrate potential problems arising from shared system usage and the advantages derived from actively managing it.