Effective Cross-Platform, Multilevel Parallelism via Dynamic Adaptive Execution
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Performance Evaluation of the Omni OpenMP Compiler
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
The Omni OpenMP Compiler on the Distributed Shared Memory of Cenju-4
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
Algorithms for memory hierarchies: advanced lectures
Algorithms for memory hierarchies: advanced lectures
Hi-index | 0.00 |
Multiprocessors and high performance networks offer the opportunity to construct CLUster of Multi Processors (CLUMPs) and use them as parallel computing platforms. The distinctive feature of the CLUMPs over traditional parallel computers is their hybrid memory model (message passing between the nodes and shared memory inside the nodes). In this paper, we investigate the performance characteristics of a cluster of biprocessor PCs for the NAS 2.3 parallel Benchmark using a programming model based on MPI for message passing between biprocessor nodes and OpenMP for shared memory inside biprocessor nodes. The paper provides several contributions. These include: a) Speed-up measurements of a cluster of biprocessor PCs over a cluster of uniprocessor PCs using the hybrid memory model b) A detailed analysis of the speed-up results from a breakdown of the benchmarks execution time and c) A performance comparison of a commodity CLUMP with some high performance parallel computers.