A Simulation Study of the CRAY X-MP Memory System
IEEE Transactions on Computers
System simulation programming styles and languages
System simulation programming styles and languages
Computer
Operation systems: advanced concepts
Operation systems: advanced concepts
Software—Practice & Experience
Computer
A close look at vector performance of register-to-register vector computers and a new model
SIGMETRICS '87 Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Performance Prediction and Calibration for a Class of Multiprocessors
IEEE Transactions on Computers
A framework for determining useful parallelism
ICS '88 Proceedings of the 2nd international conference on Supercomputing
An overview of the PTRAN analysis system for multiprocessing
Proceedings of the 1st International Conference on Supercomputing
Interprocessor communication speed and performance in distributed-memory parallel processors
ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Micro Time Cost Analysis of Parallel Computations
IEEE Transactions on Computers
A static performance estimator to guide data partitioning decisions
PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
A method for parallel program generation with an application to the Booster language
ICS '90 Proceedings of the 4th international conference on Supercomputing
CSIM: a C-based process-oriented simulation language
WSC '86 Proceedings of the 18th conference on Winter simulation
Performance prediction of loop constructs on multiprocessor hierarchical-memory systems
ICS '89 Proceedings of the 3rd international conference on Supercomputing
Communicating sequential processes
Communications of the ACM
Partitioning and Scheduling Parallel Programs for Multiprocessors
Partitioning and Scheduling Parallel Programs for Multiprocessors
A Comparison of 12 Parallel FORTRAN Dialects
IEEE Software
Predicting Performance of Parallel Computations
IEEE Transactions on Parallel and Distributed Systems
Compiling Global Name-Space Parallel Loops for Distributed Execution
IEEE Transactions on Parallel and Distributed Systems
LAPAS: A Performance Evaluation Tool for Large Parallel Systems
Architektur von Rechensystemen, Tagungsband, 11. ITG/GI-Fachtagung
Precise compile-time performance prediction for superscalar-based computers
PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Compiling performance models from parallel programs
ICS '94 Proceedings of the 8th international conference on Supercomputing
The importance of synchronization structure in parallel program optimization
ICS '97 Proceedings of the 11th international conference on Supercomputing
The construction of a retargetable simulator for an architecture template
Proceedings of the 6th international workshop on Hardware/software codesign
Performance analysis for parallel solutions to generic search problems
SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
A Tool to Help Tune where Computation Is Performed
IEEE Transactions on Software Engineering
A Methodology for Architecture Exploration of Heterogeneous Signal Processing Systems
Journal of VLSI Signal Processing Systems - Special issue on signal processing systems design and implementation
System level design with spade: an M-JPEG case study
Proceedings of the 2001 IEEE/ACM international conference on Computer-aided design
Performance-steered design of software architectures for embedded multicore systems
Software—Practice & Experience
Symbolic Performance Modeling of Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
A Versatile Simulation Model for Hierarchical Treecodes
ICCS '02 Proceedings of the International Conference on Computational Science-Part I
Symbolic Cost Estimation of Parallel Applications
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Symbolic Performance Prediction of Data-Dependent Parallel Programs
TOOLS '02 Proceedings of the 12th International Conference on Computer Performance Evaluation, Modelling Techniques and Tools
An Approach for Quantitative Analysis of Application-Specific Dataflow Architectures
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
A strategy for determining a Jacobi specific dataflow processor
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
A Comparison of Data-Parallel Collective Communication Performance and Its Application
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Performance Modeling of Distributed Hybrid Architectures
IEEE Transactions on Parallel and Distributed Systems
Building high-resolution sky images using the Cell/B.E.
Scientific Programming - High Performance Computing with the Cell Broadband Engine
Analytic modeling of network processors for parallel workload mapping
ACM Transactions on Embedded Computing Systems (TECS)
Hi-index | 0.00 |
In this paper we present a new methodology for the performance prediction of parallel programs on parallel platforms ranging from shared-memory to distributed-memory (vector) machines. The methodology comprises a procedural program and machine specification paradigm based on PAMELA (PerformAnce ModEling LAnguage), along with a performance calculus, called “serialization analysis”. This calculus extends conventional parallel program analysis technology by explicitly accounting for resource contention, yet at the low evaluation cost typical for static techniques. It is shown that, where conventional techniques introduce fundamental errors, predictions from serialization analysis remain realistic. Apart from the merits of the methodology itself, this high reliability/cost ratio makes PAMELA an attractive candidate for compile-time application within the performance prediction hierarchy often found in parallel programming environments.