...and eat it too: high read performance in write-optimized HPC I/O middleware file formats

Authors:
Milo Polte;Jay Lofstead;John Bent;Garth Gibson;Scott A. Klasky;Qing Liu;Manish Parashar;Norbert Podhorszki;Karsten Schwan;Meghan Wingate;Matthew Wolf
Affiliations:
Carnegie Mellon University;Georgia Institute of Technology;Los Alamos National Lab;Carnegie Mellon University;Oak Ridge National Lab;Oak Ridge National Lab;Rutgers University;Oak Ridge National Lab;Georgia Institute of Technology;Los Alamos National Lab;Georgia Institute of Technology
Venue:
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Year:
2009

Citing 2
Cited 8

Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)

CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Adaptable, metadata rich IO methods for portable high performance IO

IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing

Managing Variability in the IO Performance of Petascale Storage Systems

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Six degrees of scientific data: reading patterns for extreme scale science IO

Proceedings of the 20th international symposium on High performance distributed computing
Examples of in transit visualization

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Light-Weight parallel i/o analysis at scale

EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
ISOBAR hybrid compression-I/O interleaving for large-scale parallel I/O optimization

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
I/O acceleration with pattern detection

Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Insights for exascale IO APIs from building a petascale IO API

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Structuring PLFS for extensibility

PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop

Quantified Score

Hi-index	0.00

Visualization

Abstract

As HPC applications run on increasingly high process counts on larger and larger machines, both the frequency of checkpoints needed for fault tolerance [14] and the resolution and size of Data Analysis Dumps are expected to increase proportionally. In order to maintain an acceptable ratio of time spent performing useful computation work to time spent performing I/O, write bandwidth to the underlying storage system must increase proportionally to this increase in the checkpoint and computation size. Unfortunately, popular scientific self-describing file formats such as netCDF [8] and HDF5 [3] are designed with a focus on portability and flexibility. Extra care and careful crafting of the output structure and API calls is required to optimize for write performance using these APIs. To provide sufficient write bandwidth to continue to support the demands of scientific applications, the HPC community has developed a number of I/O middleware layers, that structure output into write-optimized file formats. However, the obvious concern with any write optimized file format would be a corresponding penalty on reads. In the log-structured filesystem [13], for example, a file generated by random writes could be written efficiently, but reading the file back sequentially later would result in very poor performance. Simulation results require efficient read-back for visualization and analytics, and though most checkpoint files are never used, the efficiency of a restart is very important in the face of inevitable failures. The utility of write speed improving middleware would be greatly diminished if it sacrificed acceptable read performance. In this paper we examine the read performance of two write-optimized middleware layers on large parallel machines and compare it to reading data natively in popular file formats.