CCL: A Portable and Tunable Collective Communication Library for Scalable Parallel Computers
IEEE Transactions on Parallel and Distributed Systems
Efficient message passing interface (MPI) for parallel computing on clusters of workstations
Journal of Parallel and Distributed Computing - Special issue on workstation clusters and network-based computing
The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
Adaptive communication algorithms for distributed heterogeneous systems
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
A grid-enabled MPI: message passing in heterogeneous distributed computing systems
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Efficient Collective Communication on Heterogeneous Networks of Workstations
ICPP '98 Proceedings of the 1998 International Conference on Parallel Processing
ECO: Efficient Collective Operations for Communication on Heterogeneous Networks
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Low-Latency Message Passing on Workstation Clusters using SCRAMNet
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
All-to-All Communication on Meshes with Wormhole Routing
Proceedings of the 8th International Symposium on Parallel Processing
Low Latency Message-Passing for Reflective Memory Networks
CANPC '99 Proceedings of the Third International Workshop on Network-Based Parallel Computing: Communication, Architecture, and Applications
Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs
HCW '97 Proceedings of the 6th Heterogeneous Computing Workshop (HCW '97)
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Legion-a view from 50,000 feet
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
Utilizing Heterogeneous Networks in Distributed Parallel Computing Systems
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
The Delay-Constrained Minimum Spanning Tree Problem
ISCC '97 Proceedings of the 2nd IEEE Symposium on Computers and Communications (ISCC '97)
Comparison of Two Middleware Data Dissemination Services in a Wide-Area Distributed System
ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Group Communication Protocol for Realtime Applications
ICDCS '98 Proceedings of the The 18th International Conference on Distributed Computing Systems
(R) Efficient Reliable Multicast on MYRINET
ICPP '96 Proceedings of the Proceedings of the 1996 International Conference on Parallel Processing - Volume 3
Broadcast Trees for Heterogeneous Platforms
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Broadcasting on networks of workstations
Proceedings of the seventeenth annual ACM symposium on Parallelism in algorithms and architectures
HPCASIA '05 Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
Matrix product on heterogeneous master-worker platforms
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
The impact of heterogeneity on master-slave scheduling
Parallel Computing
Mapping pipeline skeletons onto heterogeneous platforms
Journal of Parallel and Distributed Computing
A framework for adaptive collective communications for heterogeneous hierarchical computing systems
Journal of Computer and System Sciences
Mapping Pipeline Skeletons onto Heterogeneous Platforms
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Bi-criteria Pipeline Mappings for Parallel Image Processing
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Efficient content distribution in wireless P2P networks
Proceedings of the 5th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness
Accurate and Efficient Estimation of Parameters of Heterogeneous Communication Performance Models
International Journal of High Performance Computing Applications
Multi-Criteria Scheduling of Pipeline Workflows (and Application To the JPEG Encoder)
International Journal of High Performance Computing Applications
Mapping filtering streaming applications with communication costs
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Scheduling for atomic broadcast operation in heterogeneous networks with one port model
The Journal of Supercomputing
A heterogeneous peer-to-peer network testbed
ICUFN'09 Proceedings of the first international conference on Ubiquitous and future networks
Accurate Heterogeneous Communication Models and a Software Tool for Their Efficient Estimation
International Journal of High Performance Computing Applications
A dominant input stream for LUD incremental computing on a contention network
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
Fast file dissemination in peer-to-peer networks with upstream bandwidth constraint
Future Generation Computer Systems
Static worksharing strategies for heterogeneous computers with unrecoverable failures
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Scheduling heuristics for efficient broadcast operations on grid environments
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Centralized versus distributed schedulers for multiple bag-of-task applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
The impact of heterogeneity on master-slave on-line scheduling
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
High-performance modeling acoustic and elastic waves using the parallel Dichotomy Algorithm
Journal of Computational Physics
A new heuristic for broadcasting in cluster of clusters
GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
Throughput optimization for pipeline workflow scheduling with setup times
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
A survey of pipelined workflow scheduling: Models and algorithms
ACM Computing Surveys (CSUR)
Hi-index | 0.01 |
With recent advances in high-speed networks, distributed heterogeneous computing has emerged as an attractive computational paradigm. Wide-area grid infrastructures will enable distributed applications-such as video conferencing and distributed interactive simulation--to seamlessly integrate collections of heterogeneous workstations, multiprocessors, and mobile nodes. The underlying network is typically a collection of several heterogeneous links, of different networking technologies. Such a heterogeneous network is also typical in local area workstation clusters, which are increasingly being used as alternatives to parallel computing systems. This paper introduces a framework for developing efficient collective communication schedules over such heterogeneous networks. We focus on application-level communication, between processes of a parallel program. Our framework consists of analytical models of the heterogeneous system, scheduling algorithms for the collective communication pattern, and performance evaluation mechanisms. We show that previous models, which considered node heterogeneity but ignored network heterogeneity, can lead to solutions which are worse than the optimal by an unbounded factor. We then introduce an enhanced communication model, and develop three heuristic algorithms for the broadcast and multicast patterns. The completion time of the schedule is chosen as the performance metric. The heuristic algorithms are fastest edge first (FEF), earliest completing edge first (ECEF), and ECEF with look-ahead. For small system sizes, we find the optimal solution using exhaustive search. Our simulation experiments indicate that the performance of our heuristic algorithms is close to optimal. For performance evaluation of larger systems, we have also developed a simple lower bound on the completion time. Our heuristic algorithms achieve significant performance improvements over previous approaches.