Input/output behavior of supercomputing applications
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Disk I/O in high-performance computing systems
Disk I/O in high-performance computing systems
File archive activity in a supercomputing environment
ICS '93 Proceedings of the 7th international conference on Supercomputing
A static analysis of I/O characteristics of scientific applications in a production workload
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Design and Evaluation of primitives for Parallel I/O
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Parallel access to files in the Vesta file system
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
RAID: high-performance, reliable secondary storage
ACM Computing Surveys (CSUR)
Performance of the CM-5 scalable file system
ICS '94 Proceedings of the 8th international conference on Supercomputing
Experimental analysis of parallel systems: techniques and open problems
Proceedings of the 7th international conference on Computer performance evaluation : modelling techniques and tools: modelling techniques and tools
Remote interactive visualization and analysis (RIVA) using parallel supercomputers
PRS '95 Proceedings of the IEEE symposium on Parallel rendering
PPFS: a high performance portable parallel file system
ICS '95 Proceedings of the 9th international conference on Supercomputing
A trace-driven analysis of the UNIX 4.2 BSD file system
Proceedings of the tenth ACM symposium on Operating systems principles
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
Dynamic file-access characteristics of a production parallel scientific workload
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Dynamic I/O characterization of I/O intensive scientific applications
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Characterizing parallel file-access patterns on a large-scale multiprocessor
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
File migration.
Tuning the performance of I/O-intensive parallel applications
Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference
HFS: a performance-oriented flexible file system based on building-block compositions
Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference
File-Access Characteristics of Parallel Scientific Workloads
IEEE Transactions on Parallel and Distributed Systems
Disk-directed I/O for MIMD multiprocessors
ACM Transactions on Computer Systems (TOCS)
HFS: a performance-oriented flexible file system based on building-block compositions
ACM Transactions on Computer Systems (TOCS)
On implementing MPI-IO portably and with high performance
Proceedings of the sixth workshop on I/O in parallel and distributed systems
Thread scheduling for out-of-core applications with memory server on multicomputers
Proceedings of the sixth workshop on I/O in parallel and distributed systems
A General Interprocedural Framework for Placement of Split-Phase Large Latency Operations
IEEE Transactions on Parallel and Distributed Systems
Compiling object-oriented data intensive applications
Proceedings of the 14th international conference on Supercomputing
An evaluation of Java's I/O capabilities for high-performance computing
Proceedings of the ACM 2000 conference on Java Grande
IEEE Transactions on Parallel and Distributed Systems
High performance visualization of time-varying volume data over a wide-area network status
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Compiler-based I/O prefetching for out-of-core applications
ACM Transactions on Computer Systems (TOCS)
A case for using MPI's derived datatypes to improve I/O performance
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Optimization and evaluation of Hartree-Fock application's I/O with PASSION
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
CLIP: a checkpointing tool for message-passing parallel programs
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Markov model prediction of I/O requests for scientific applications
ICS '02 Proceedings of the 16th international conference on Supercomputing
Models of Parallel Applications with Large Computation and I/O Requirements
IEEE Transactions on Software Engineering
An Experimental Evaluation of I/O Optimizations on Different Applications
IEEE Transactions on Parallel and Distributed Systems
Data parallel language and compiler support for data intensive applications
Parallel Computing - Parallel data-intensive algorithms and applications
An Experimental Evaluation of I/O Optimizations on Different Applications
IEEE Transactions on Parallel and Distributed Systems
Mapping Functions and Data Redistribution for Parallel Files
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
An Experimental Study of Input/Output Characteristics of NASA Earth and Space Sciences Applications
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Benchmark Design for Characterization of Balanced High-Performance Architectures
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Effective File-I/O Bandwidth Benchmark
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Disk Resident Arrays: An Array-Oriented I/O Library for Out-Of-Core Computations
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
I/O Requirements of Scientific Applications: An Evolutionary View
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
Sourcebook of parallel computing
An Adaptive Cache Coherence Protocol Specification for Parallel Input/Output Systems
IEEE Transactions on Parallel and Distributed Systems
Exploiting Inter-File Access Patterns Using Multi-Collective I/O
FAST '02 Proceedings of the 1st USENIX Conference on File and Storage Technologies
Identity Boxing: A New Technique for Consistent Global Identity
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Multicollective I/O: A technique for exploiting inter-file access patterns
ACM Transactions on Storage (TOS)
Scalable Design and Implementations for MPI Parallel Overlapping I/O
IEEE Transactions on Parallel and Distributed Systems
Large files, small writes, and pNFS
Proceedings of the 20th annual international conference on Supercomputing
Taming the memory hogs: using compiler-inserted releases to manage physical memory intelligently
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Direct-pNFS: scalable, transparent, and versatile access to parallel file systems
Proceedings of the 16th international symposium on High performance distributed computing
Noncontiguous locking techniques for parallel file systems
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Massively parallel genomic sequence search on the Blue Gene/P architecture
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Parallel I/O prefetching using MPI file caching and I/O signatures
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
A cost-effective distributed file service with QoS guarantees
Proceedings of the ACM/IFIP/USENIX 2007 International Conference on Middleware
Conflict Detection Algorithm to Minimize Locking for MPI-IO Atomicity
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Data layout optimization for petascale file systems
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Algorithms for memory hierarchies: advanced lectures
Algorithms for memory hierarchies: advanced lectures
A Scalable Message Passing Interface Implementation of an Ad-Hoc Parallel I/o system
International Journal of High Performance Computing Applications
A cost-effective distributed file service with QoS guarantees
MIDDLEWARE2007 Proceedings of the 8th ACM/IFIP/USENIX international conference on Middleware
Evaluating I/O characteristics and methods for storing structured scientific data
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Exploiting inter-file access patterns using multi-collective I/O
FAST'02 Proceedings of the 1st USENIX conference on File and storage technologies
Pattern-aware file reorganization in MPI-IO
Proceedings of the sixth workshop on Parallel Data Storage
Integrated in-system storage architecture for high performance computing
Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers
A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers
Journal of Parallel and Distributed Computing
Characterizing output bottlenecks in a supercomputer
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Throttling I/O streams to accelerate file-IO performance
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Hi-index | 0.00 |
Rapid increases in computing and communication performance are exacerbating the long-standing problem of performance-limited input/output. Indeed, for many otherwise scalable parallel applications. input/output is emerging as a major performance bottleneck. The design of scalable input/output systems depends critically on the input/output requirements and access patterns for this emerging class of large-scale parallel applications. However, hard data on the behavior of such applications is only now becoming available. In this paper, we describe the input-output requirements of three scalable parallel applications (electron scattering, terrain rendering, and quantum chemistry, on the Intel Paragon XP/S. As part of an ongoing parallel input/output characterization effort, we used instrumented versions of the application codes to capture and analyze input/output volume, request size distributions, and temporal request structure. Because complete traces of individual application input/output requests were captured, in-depth, off-line analyses were possible. In addition, we conducted informal interviews of the application developers to understand the relation between the codes' current and desired input/output structure. The results of our studies show a wide variety of temporal and spatial access patterns, including highly read-intensive and write-intensive phases, extremely large and extremely small request sizes, and both sequential and highly irregular access patterns. We conclude with a discussion of the broad spectrum of access patterns and their profound implications for parallel file caching and prefetching schemes.