Compiler support for out-of-core arrays on parallel machines

Authors:
M. Paleczny;K. Kennedy;C. Koelbel
Affiliations:
-;-;-
Venue:
FRONTIERS '95 Proceedings of the Fifth Symposium on the Frontiers of Massively Parallel Computation (Frontiers'95)
Year:
1995

Citing 0
Cited 24

Tuning the performance of I/O-intensive parallel applications

Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference
An interprocedural framework for placement of asynchronous I/O operations

ICS '96 Proceedings of the 10th international conference on Supercomputing
Automatic I/O hint generation through speculative execution

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Thread scheduling for out-of-core applications with memory server on multicomputers

Proceedings of the sixth workshop on I/O in parallel and distributed systems
A General Interprocedural Framework for Placement of Split-Phase Large Latency Operations

IEEE Transactions on Parallel and Distributed Systems
Compiling object-oriented data intensive applications

Proceedings of the 14th international conference on Supercomputing
Compiler-based I/O prefetching for out-of-core applications

ACM Transactions on Computer Systems (TOCS)
Compiler-Directed Collective-I/O

IEEE Transactions on Parallel and Distributed Systems
Compiler supported high-level abstractions for sparse disk-resident datasets

ICS '02 Proceedings of the 16th international conference on Supercomputing
An I/O-Conscious Tiling Strategy for Disk-Resident Data Sets

The Journal of Supercomputing
Data parallel language and compiler support for data intensive applications

Parallel Computing - Parallel data-intensive algorithms and applications
Advanced Library Support for Irregular and Out-of-Core Parallel Computing

HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Compiler-Directed I/O Optimization

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
I/O Granularity Transformations

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Compiling Data Intensive Applications with Spatial Coordinates

LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
Persistent Array Access Using Server-Directed I/O

SSDBM '96 Proceedings of the Eighth International Conference on Scientific and Statistical Database Management
Language and Compiler Support for Out-of-Core Irregular Applications on Distributed-Memory Multiprocessors

LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
A Collective I/O Scheme Based on Compiler Analysis

LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
References

Sourcebook of parallel computing
The MHETA Execution Model for Heterogeneous Clusters

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Discretionary Caching for I/O on Clusters

Cluster Computing
Improving I/O performance of applications through compiler-directed code restructuring

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Algorithms for memory hierarchies: advanced lectures

Algorithms for memory hierarchies: advanced lectures
Compiler and middleware support for scalable data mining

LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many computational methods are currently limited by the size of physical memory, the latency of disk storage, and the difficulty of writing an efficient out-of-core version of the application. We are investigating a compiler-based approach to the above problem. In general, our compiler techniques attempt to choreograph I/O for an application based on high-level programmer annotations similar to Fortran D's DECOMPOSITION, ALIGN, and DISTRIBUTE statements. The central problem is to generate "deferred routines" which delay computations until all the data they require have been read into main memory. We present the results for two applications, LU factorization and red-black relaxation, on 1 to 32 nodes of an Intel Paragon after hand application of these compiler techniques.