Performance-based parallel loop self-scheduling on heterogeneous multicore PC clusters

Authors:
Chao-Tung Yang;Jen-Hsiang Chang;Chao-Chin Wu
Affiliations:
High-Performance Computing Laboratory, Department of Computer Science, Tunghai University, Taichung, Taiwan ROC;High-Performance Computing Laboratory, Department of Computer Science, Tunghai University, Taichung, Taiwan ROC;Department of Computer Science and Information Engineering, National Changhua University of Education, Changhua, Taiwan
Venue:
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Year:
2009

Citing 5
Cited 0

Guided self-scheduling: A practical scheduling scheme for parallel supercomputers

IEEE Transactions on Computers
Factoring: a method for scheduling parallel loops

Communications of the ACM
A Class of Loop Self-Scheduling for Heterogeneous Clusters

CLUSTER '01 Proceedings of the 3rd IEEE International Conference on Cluster Computing
An Enhanced Parallel Loop Self-Scheduling Scheme for Cluster Environments

The Journal of Supercomputing
On development of an efficient parallel loop self-scheduling for grid computing environments

Parallel Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, Multicore computers have been widely included in cluster systems. They adopt shared memory architectures. However, previous researches on parallel loop self-scheduling did not consider the feature of multicore computers. It is more suitable for shared-memory multiprocessors to adopt OpenMP for parallel programming. In this paper, we propose a performance-based approach that partitions loop iterations according to the performance weighting of cluster nodes. Because the iterations assigned to one MPI process will be processed in parallel by OpenMP threads running by the processor cores in the same computational node, the number of loop iterations to be allocated to one computational node at each scheduling step also depends on the number of processor cores in that node. Experimental results show that the proposed approach performs better than previous schemes.