Cedar Fortran and its compiler
CONPAR 90 Proceedings of the joint international conference on Vector and parallel processing
Efficient run-time support for irregular block-structured applications
Journal of Parallel and Distributed Computing - Special issue on irregular problems in supercomputing applications
A Programming Methodology for Dual-Tier Multicomputers
IEEE Transactions on Software Engineering - Special issue on architecture-independent languages and software tools for parallel processing
Communication overlap in multi-tier parallel algorithms
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Run-Time Support for Multi-tier Programming of Block-Structured Applications on SMP Clusters
ISCOPE '97 Proceedings of the Scientific Computing in Object-Oriented Parallel Environments
Message Proxies for Efficient, Protected Communication on SMP Clusters
HPCA '97 Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture
A taxonomy of programming models for symmetric multiprocessors and SMP clusters
PMMP '95 Proceedings of the conference on Programming Models for Massively Parallel Computers
Minimizing overhead in parallel algorithms through overlapping communication/computation
Minimizing overhead in parallel algorithms through overlapping communication/computation
A programming model for block-structured scientific calculations on smp clusters
A programming model for block-structured scientific calculations on smp clusters
SCALLOP: A Highly Scalable Parallel Poisson Solver in Three Dimensions
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Hiding Communication Latency with Non-SPMD, Graph-Based Execution
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Hi-index | 0.00 |
Multi-tier platforms are hierarchically organized multicomputers with multiprocessor nodes. Compared with previous-generation single-tier systems based on uniprocessor nodes, they present a more complex array of performance tradeoffs. We describe performance programming techniques targeted to finite difference methods running on two large scale multi-tier computers manufactured by IBM: NPACI's Blue Horizon and ASCI Blue-Pacific Combined Technology Refresh. Our techniques resulted in performance improvements ranging from 10% to 17% over a traditional single-tier SPMD implementation.