Software pipelining: an effective scheduling technique for VLIW machines
PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Emulation of Hypercube Architecture on Nearest-Neighbor Mesh-Connected Processing Elements
IEEE Transactions on Computers
Placement of the Processors of a Hypercube
IEEE Transactions on Computers
Hybrid algorithms for complete exchange in 2D meshes
ICS '96 Proceedings of the 10th international conference on Supercomputing
A method for exploiting communication/computation overlap in hypercubes
Parallel Computing
Executing Algorithms with Hypercube Topology on Torus Multicomputers
IEEE Transactions on Parallel and Distributed Systems
High-Performance Self-Routing Algorithm for Multiprocessor Systems with Shuffle Interconnections
IEEE Transactions on Parallel and Distributed Systems
Cross-Layer Collaborative In-Network Processing in Multihop Wireless Sensor Networks
IEEE Transactions on Mobile Computing
Minimal broadcasting schemas for the mesh structures
International Journal of High Performance Computing and Networking
Hi-index | 0.01 |
A new methodology named CALMANT (CC-cube Algorithms on Meshes and Tori) for mapping a kind of algorithms that we call CC-cube algorithm onto multicomputers with hypercube, mesh, or torus interconnection topology is proposed. This methodology is suitable when the initial problem can be expressed as a set of processes that communicate through a hypercube topology (a CC-cube algorithm). There are many important algorithms that fit into the CC-cube type. CALMANT is based on three different techniques: a) the standard embedding to assign the processes of the algorithm to the nodes of the mesh multicomputer; b) the communication pipelining technique to increase the level of communication parallelism inherent in the CC-cube algorithms; and c) optimal message-scheduling algorithms proposed in this work in order to avoid conflicts and minimizing in this way the communication time. Although CALMANT is proposed for multicomputers with different interconnection network topologies, this paper only focuses on the particular case of meshes.