PVM: a framework for parallel distributed computing
Concurrency: Practice and Experience
Virtual memory mapped network interface for the SHRIMP multicomputer
ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
MPI: The Complete Reference
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Communication overhead for space science applications on the Beowulf parallel workstation
HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
Direct numerical simulation of turbulence with a PC/linux cluster: fact or fiction?
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
The Journal of Supercomputing
Network Survivability Simulation of a Commercially Deployed Dynamic Routing System Protocol
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
NeuSim: A Modular Neural Networks Simulator for Beowulf Clusters
IWANN '01 Proceedings of the 6th International Work-Conference on Artificial and Natural Neural Networks: Bio-inspired Applications of Connectionism-Part II
Communication performance issues for two cluster computers
ACSC '03 Proceedings of the 26th Australasian computer science conference - Volume 16
International Journal of High Performance Computing Applications
An improved model for predicting HPL performance
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
A parallel algorithm to compute data synopsis
WSEAS Transactions on Information Science and Applications
Modeling message-passing overhead on NCHC formosa PC cluster
GPC'06 Proceedings of the First international conference on Advances in Grid and Pervasive Computing
Hi-index | 0.00 |
Coupling PC-based commodity technology with distributed computing methodologies provides an important advance in the development of single-user dedicated systems. Beowulf is a class of experimental parallel workstations developed to evaluate and characterize the design space of this new operating point in price-performance. A key factor determining the realizable performance under real-world workloads is the means devised for interprocessor communications. A study has been performed to characterize a family of interconnect topologies feasible with low cost mass market network technologies. Behavior sensitivities to packet size and traffic density are determined. Findings are presented which compare more complex segmented topologies to the earlier parallel ``channel bonded'' scheme. It is shown that in many circumstances the more complex topologies perform better, and in some circumstances software routing techniques compare favorably to more expensive hardware switch mechanisms.