A cost-effective, high-bandwidth storage architecture
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Efficient wire formats for high performance computing
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Infosphere project: system support for information flow applications
ACM SIGMOD Record
Native Data Representation: An Efficient Wire Format for High-Performance Distributed Computing
IEEE Transactions on Parallel and Distributed Systems
SmartPointers: personalized scientific data portals in your hand
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Data Sieving and Collective I/O in ROMIO
FRONTIERS '99 Proceedings of the The 7th Symposium on the Frontiers of Massively Parallel Computation
Event Services for High Performance Computing
HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
An Approach for Automatic Data Virtualization
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Gridflow Description, Query, and Execution at SCEC using the SDSC Matrix
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Visualizing Gyrokinetic Simulations
VIS '04 Proceedings of the conference on Visualization '04
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Distributed computing in practice: the Condor experience: Research Articles
Concurrency and Computation: Practice & Experience - Grid Performance
Resource-Aware Distributed Stream Management Using Dynamic Overlays
ICDCS '05 Proceedings of the 25th IEEE International Conference on Distributed Computing Systems
A taxonomy of scientific workflow systems for grid computing
ACM SIGMOD Record
Leading Computational Methods on Scalar and Vector HEC Platforms
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Publish-Subscribe for High-Performance Computing
IEEE Internet Computing
XChange: coupling parallel applications in a dynamic environment
CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
Scientific workflow management and the Kepler system: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
WSRF-Based Modeling of Clinical Trial Information for Collaborative Cancer Research
CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
DataStager: scalable data staging services for petascale applications
Proceedings of the 18th ACM international symposium on High performance distributed computing
Experiments with wide area data coupling using the seine coupling framework
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
Towards scalable I/O architecture for exascale systems
Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
Memory-conscious collective I/O for extreme scale HPC systems
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers
Hi-index | 0.00 |
We present a collection of techniques for exploiting latent I/O asynchrony which can substantially improve performance in data-intensive parallel applications. Latent asynchrony refers to an applicationâ聙聶s tolerance for decoupling ancillary operations from its core computation, and is a property of HPC codes not fully explored by current HPC I/O systems. Decoupling operations such as buffering and staging, reorganization, and format conversion in space and in time from core codes can shorten I/O phases, preserving valuable MPP compute cycles. We describe in this paper DataTaps, IOgraphs, and Metabots, three tools which allow HPC developers to implement decoupled I/O operations. Using these tools, asynchrony can be exploited by data generators which overlap computation with communication, and by data consumers that perform data conversion and reorganization out-of-band and on-demand. In the context of a data-intensive fusion simulation, we show that exploiting latent asynchrony through decoupling of operations can provide significant performance benefits.