I/O acceleration with pattern detection

Authors:
Jun He;John Bent;Aaron Torres;Gary Grider;Garth Gibson;Carlos Maltzahn;Xian-He Sun
Affiliations:
University of Wisconsin, Madison, Madison, WI, USA;EMC, Hopkinton, MA, USA;Los Alamos National Laboratory, Los Alamos, NM, USA;Los Alamos National Laboratory, Los Alamos, NM, USA;Carnegie Mellon University and Panasas, Pittsburgh, USA;University of California, Santa Cruz, Santa Cruz, CA, USA;Illinois Institute of Technology, Chicago, IL, USA
Venue:
Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
Year:
2013

Citing 25
Cited 1

A static analysis of I/O characteristics of scientific applications in a production workload

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Informed prefetching and caching

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Lessons from characterizating the input/output behavior of parallel scientific applications

Performance Evaluation - Special issue on tools for performance evaluation
On implementing MPI-IO portably and with high performance

Proceedings of the sixth workshop on I/O in parallel and distributed systems
Information and control in gray-box systems

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Markov model prediction of I/O requests for scientific applications

ICS '02 Proceedings of the 16th international conference on Supercomputing
Dynamic I/O characterization of I/O intensive scientific applications

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Learning to Classify Parallel Input/Output Access Patterns

IEEE Transactions on Parallel and Distributed Systems
Iteration aware prefetching for large multidimensional datasets

SSDBM'2005 Proceedings of the 17th international conference on Scientific and statistical database management
Why does file system prefetching work?

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)

CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Parallel I/O prefetching using MPI file caching and I/O signatures

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
BORG: block-reORGanization for self-optimizing storage systems

FAST '09 Proccedings of the 7th conference on File and storage technologies
ScalaTrace: Scalable compression and replay of communication traces for high-performance computing

Journal of Parallel and Distributed Computing
PLFS: a checkpoint filesystem for parallel applications

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
...and eat it too: high read performance in write-optimized HPC I/O middleware file formats

Proceedings of the 4th Annual Workshop on Petascale Data Storage
Scalable I/O tracing and analysis

Proceedings of the 4th Annual Workshop on Petascale Data Storage
Reducing seek overhead with application-directed prefetching

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Understanding and Improving Computational Science Storage Access through Continuous Characterization

ACM Transactions on Storage (TOS)
SciHadoop: array-based query processing in Hadoop

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Layout-aware scientific computing: a case study using MILC

Proceedings of the second workshop on Scalable algorithms for large-scale systems
Pattern-aware file reorganization in MPI-IO

Proceedings of the sixth workshop on Parallel Data Storage
A universal algorithm for sequential data compression

IEEE Transactions on Information Theory
The Power and Challenges of Transformative I/O

CLUSTER '12 Proceedings of the 2012 IEEE International Conference on Cluster Computing
KNOWAC: I/O Prefetch via Accumulated Knowledge

CLUSTER '12 Proceedings of the 2012 IEEE International Conference on Cluster Computing

Structuring PLFS for extensibility

PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop

Quantified Score

Hi-index	0.00

Visualization

Abstract

The I/O bottleneck in high-performance computing is becoming worse as application data continues to grow. In this work, we explore how patterns of I/O within these applications can significantly affect the effectiveness of the underlying storage systems and how these same patterns can be utilized to improve many aspects of the I/O stack and mitigate the I/O bottleneck. We offer three main contributions in this paper. First, we develop and evaluate algorithms by which I/O patterns can be efficiently discovered and described. Second, we implement one such algorithm to reduce the metadata quantity in a virtual parallel file system by up to several orders of magnitude, thereby increasing the performance of writes and reads by up to 40 and 480 percent respectively. Third, we build a prototype file system with pattern-aware prefetching and evaluate it to show a 46 percent reduction in I/O latency. Finally, we believe that efficient pattern discovery and description, coupled with the observed predictability of complex patterns within many high-performance applications, offers significant potential to enable many additional I/O optimizations.