Speedup Versus Efficiency in Parallel Systems
IEEE Transactions on Computers
Process decomposition through locality of reference
PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
The CODE 2.0 graphical parallel programming language
ICS '92 Proceedings of the 6th international conference on Supercomputing
PYRROS: static task scheduling and code generation for message passing multiprocessors
ICS '92 Proceedings of the 6th international conference on Supercomputing
The Wisconsin Wind Tunnel: virtual prototyping of parallel computers
SIGMETRICS '93 Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Using integer sets for data-parallel program analysis and optimization
PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
A performance analysis environment for life
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Predictive analysis of a wavefront application using LogGP
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance prediction of large parallel applications using parallel simulations
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Compiler-supported simulation of highly scalable parallel applications
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Interpreting the performance of HPF/Fortran 90D
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Visual Programming and Debugging for Parallel Computing
IEEE Parallel & Distributed Technology: Systems & Technology
Complete Computer System Simulation: The SimOS Approach
IEEE Parallel & Distributed Technology: Systems & Technology
Maisie: A Language for the Design of Efficient Discrete-Event Simulations
IEEE Transactions on Software Engineering
POEMS: End-to-End Performance Design of Large Parallel Adaptive Computational Systems
IEEE Transactions on Software Engineering
FAST: A Functional Algoritm Simulation Testbed
MASCOTS '94 Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems
The Impact of Instruction-Level Parallelism on Multiprocessor Performance and Simulation Methodology
HPCA '97 Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture
Compositional Development of Performance Models in Poems
International Journal of High Performance Computing Applications
Proceedings of the fifteenth workshop on Parallel and distributed simulation
Language and Compiler Support for Adaptive Distributed Applications
OM '01 Proceedings of the 2001 ACM SIGPLAN workshop on Optimization of middleware and distributed systems
Compiler-optimized simulation of large-scale applications on high performance architectures
Journal of Parallel and Distributed Computing - Parallel and Distributed Discrete Event Simulation--An Emerging Technology
POEMS: End-to-End Performance Design of Large Parallel Adaptive Computational Systems
IEEE Transactions on Software Engineering
Compiler Synthesis of Task Graphs for Parallel Program Performance Prediction
LCPC '00 Proceedings of the 13th International Workshop on Languages and Compilers for Parallel Computing-Revised Papers
Sourcebook of parallel computing
Performance Modeling of Distributed Hybrid Architectures
IEEE Transactions on Parallel and Distributed Systems
Program control language: a programming language for adaptive distributed applications
Journal of Parallel and Distributed Computing
Parallel Simulation of Large-Scale Parallel Applications
International Journal of High Performance Computing Applications
Novel runtime systems support for adaptive compositional modeling in PSEs
Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
Cross-Platform Performance Prediction of Parallel Applications Using Partial Execution
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Novel runtime systems support for adaptive compositional modeling in PSEs
Future Generation Computer Systems - Special section: Complex problem-solving environments for grid computing
New techniques for simulating high performance MPI applications on large storage networks
The Journal of Supercomputing
ICCS'03 Proceedings of the 2003 international conference on Computational science
An idiom-finding tool for increasing productivity of accelerators
Proceedings of the international conference on Supercomputing
Hi-index | 0.00 |
Effective performance prediction for large parallel applications on very large-scale systems requires a comprehensive modeling approach that combines analytical models, simulation models, and measurement for different application and system components. This paper presents a common parallel program representation, designed to support such a comprehensive approach, with four design goals: (1) the representation must support a wide range of modeling techniques; (2) it must be automatically computable using parallelizing compiler technology, in order to minimize the need for user intervention; (3) it must be efficient and scalable enough to model teraflop-scale applications; and (4) it should be flexible enough to capture the performance impact of changes to the application, including changes to the parallelization strategy, communication, and scheduling. The representation we present is based on a combination of static and dynamic task graphs. It exploits recent compiler advances that make it possible to use concise, symbolic static graphs and to instantiate dynamic graphs. This representation has led to the development of a compiler-supported simulation approach that can simulate regular, message-passing programs on systems or problems 10 to 100 times larger than was possible with previous state-of-the-art simulation techniques.