LogP: towards a realistic model of parallel computation
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
Fast Parallel Sorting Under LogP: Experience with the CM-5
IEEE Transactions on Parallel and Distributed Systems
MPI-FM: high performance MPI on workstation clusters
Journal of Parallel and Distributed Computing - Special issue on workstation clusters and network-based computing
Efficient message passing interface (MPI) for parallel computing on clusters of workstations
Journal of Parallel and Distributed Computing - Special issue on workstation clusters and network-based computing
The CAPDYN environment and its message-passing library implementation
Parallel Computing - Special double issue on environment and tools for parallel scientific computing
MPI-DDL: a distributed-data library for MPI
Future Generation Computer Systems - Special issue on HPCN96
Effects of communication latency, overhead, and bandwidth in a cluster architecture
Proceedings of the 24th annual international symposium on Computer architecture
Wide-area implementation of the message passing interface
Parallel Computing - Special issue on applications
The emergence of the MPI message passing standard for parallel computing
Computer Standards & Interfaces
Real-time sonar beamforming on high-performance distributed computers
Parallel Computing
Assessing Fast Network Interfaces
IEEE Micro
MPIDC '96 Proceedings of the Second MPI Developers Conference
Optimal Broadcast and Summation in the LogP Model
Optimal Broadcast and Summation in the LogP Model
MPI: A Message-Passing Interface Standard
MPI: A Message-Passing Interface Standard
Proceedings of the 41st annual Design Automation Conference
Mathematical and Computer Modelling: An International Journal
Hi-index | 0.00 |
Rapid increases in the complexity of algorithms for real-time signal processing applications have led to performance requirements exceeding the capabilities of conventional digital signal processor (DSP) architectures. Many applications, such as autonomous sonar arrays, are distributed in nature and amenable to parallel computing on embedded systems constructed from multiple DSPs networked together. However, to realize the full potential of such applications, a lightweight service for message-passing communication and parallel process coordination is needed that is able to provide high throughput and low latency while minimizing processor and memory utilization. This paper presents the design and analysis of such a service, based on the message passing interface specification, for unicast and collective communications.