Predicting the performance of parallel programs

Authors:
V. Blanco;J. A. González;C. León;C. Rodríguez;G. Rodríguez;M. Printista
Affiliations:
Dpto. Estadística, I.O. y Computación, Universidad de La Laguna, 38271 Tenerife, La Laguna, Spain;Dpto. Estadística, I.O. y Computación, Universidad de La Laguna, 38271 Tenerife, La Laguna, Spain;Dpto. Estadística, I.O. y Computación, Universidad de La Laguna, 38271 Tenerife, La Laguna, Spain;Dpto. Estadística, I.O. y Computación, Universidad de La Laguna, 38271 Tenerife, La Laguna, Spain;Dpto. Estadística, I.O. y Computación, Universidad de La Laguna, 38271 Tenerife, La Laguna, Spain;Universidad Nacional de San Luis, Ejército de los Andes 950, San Luis, Argentina
Venue:
Parallel Computing
Year:
2004

Citing 12
Cited 1

A bridging model for parallel computation

Communications of the ACM
Parallel branch-and-bound

Advances in parallel algorithms
A static parameter based performance prediction tool for parallel programs

ICS '93 Proceedings of the 7th international conference on Supercomputing
Computer organization & design: the hardware/software interface

Computer organization & design: the hardware/software interface
On the versatility of parallel sorting by regular sampling

Parallel Computing
BSPlib: The BSP programming library

Parallel Computing
Visualizing the Performance of Parallel Programs

IEEE Software
The Paderborn University BSP (PUB) Library - Design, Implementation and Performance

IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
DiP: A Parallel Program Development Environment

Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
The PALLAS Parallel Programming Environment

Proceedings of the 4th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A Portable Programming Interface for Performance Evaluation on Modern Processors

International Journal of High Performance Computing Applications
Predictability of bulk synchronous programs using MPI

EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing

Parallel execution time prediction of the multitask parallel programs

Performance Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work presents a new approach to the relation between theoretical complexity models and performance analysis and tuning. The analysis of an algorithm produces a complexity function that gives an approach to the asymptotic number of operations performed by the algorithm. The time spent on these operations depends on the software-hardware platform being used. Usually such platforms are described, from the performance point of view, through a number of parameters. Those parameters are evaluated by a benchmarking program. Though for a given available platform, the algorithmic constants associated with the complexity formula can be computed using multidimensional linear regression, there is still the problem of predicting the performance when the platform is not available. We introduce the concept of Universal Instruction Class and derive from it a set of equations relating the values of the algorithmic constants with the platform parameters. Due to the hierarchical design of current memory systems, the performance behavior of most algorithms varies in a small number of large regions corresponding to small size, medium size and large size inputs. The constants involved in the complexity formula usually have different values for these regions. Assuming we have a complexity formula for the memory resources, it is possible to find a partition of the input size space and the different values of the algorithmic constants. This way, though the complexity formula is the same, the family of constants provides the adaptability of the formula to the different stationary uses of the memory.