Performance analysis of the FFT algorithm on a shared-memory parallel architecture
IBM Journal of Research and Development
Application Load Imbalance on Parallel Processors
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Performance modeling and analysis of correlated parallel computations
Parallel Computing
Roofline: an insightful visual performance model for multicore architectures
Communications of the ACM - A Direct Path to Dependable Software
Hi-index | 0.00 |
A general methodology for studying the degree of matching between an architecture and an algorithm is introduced and applied to the case of synchronized iterative algorithms in MIMD machines.