Fat-trees: universal networks for hardware-efficient supercomputing
IEEE Transactions on Computers
A bridging model for parallel computation
Communications of the ACM
Parallel sorting by regular sampling
Journal of Parallel and Distributed Computing
The network architecture of the Connection Machine CM-5 (extended abstract)
SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures
The formal semantics of programming languages: an introduction
The formal semantics of programming languages: an introduction
LogP: towards a realistic model of parallel computation
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Scalable parallel geometric algorithms for coarse grained multicomputers
SCG '93 Proceedings of the ninth annual symposium on Computational geometry
A new deterministic parallel sorting algorithm with an experimental evaluation
Journal of Experimental Algorithmics (JEA)
A calculus of functional BSP programs
Science of Computer Programming - Special issue on mathematics of program construction
Theory and Algorithms for Parallel Computation
Euro-Par '98 Proceedings of the 4th International Euro-Par Conference on Parallel Processing
The Paderborn University BSP (PUB) library
Parallel Computing
Parallelism in random access machines
STOC '78 Proceedings of the tenth annual ACM symposium on Theory of computing
LogGP: Incorporating Long Messages into the LogP Model --- One step closer towards a realistic model for parallel computation
Heterogeneous Chip Multiprocessors
Computer
HLogGP: a new parallel computational model for heterogeneous clusters
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
Introduction to the cell multiprocessor
IBM Journal of Research and Development - POWER5 and packaging
Introduction to the cell broadband engine architecture
IBM Journal of Research and Development
Overview of the IBM Blue Gene/P project
IBM Journal of Research and Development
Entering the petaflop era: the architecture and performance of Roadrunner
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
A bridging model for multi-core computing
Journal of Computer and System Sciences
A programming model for BSP with partitioned synchronisation
Formal Aspects of Computing
Hi-index | 0.00 |
This paper introduces the Scatter-Gather parallel-programming and parallel execution model in the form of a simple imperative Scatter-Gather Language (SGL). Its design is based on past experience with bulk-synchronous parallel (BSP) programming and BSP language design. SGL's novel features are motivated by the last decade's move towards multi-level and heterogeneous parallel architectures involving multi-core processors, graphics accelerators and hierarchical routing networks in the largest multiprocessing systems. The design of SGL is coherent with L. Valiant's multi-BSP while offering a programming interface that is even simpler than the primitives of bulk-synchronous parallel ML (BSML). SGL appears to cover a large subset of all BSP algorithms while avoiding complex message-passing programming. Like all BSP-inspired systems, it supports predictable, portable and scalable performance. Moreover, SGL's explicit data distribution will allow automatic or programmable load-balancing.