Fat-trees: universal networks for hardware-efficient supercomputing
IEEE Transactions on Computers
Distributing Hot-Spot Addressing in Large-Scale Multiprocessors
IEEE Transactions on Computers
Dynamically-Allocated Multi-Queue Buffers for VLSI Communication Switches
IEEE Transactions on Computers
High-speed switch scheduling for local-area networks
ACM Transactions on Computer Systems (TOCS)
A New Theory of Deadlock-Free Adaptive Routing in Wormhole Networks
IEEE Transactions on Parallel and Distributed Systems
Using a Multipath Network for Reducing the Effects of Hot Spots
IEEE Transactions on Parallel and Distributed Systems
A new method to make communication latency uniform: distributed routing balancing
ICS '99 Proceedings of the 13th international conference on Supercomputing
The iSLIP scheduling algorithm for input-queued switches
IEEE/ACM Transactions on Networking (TON)
IEEE Transactions on Parallel and Distributed Systems
Deadlock-Free Adaptive Routing in Multicomputer Networks Using Virtual Channels
IEEE Transactions on Parallel and Distributed Systems
k -ary n -trees: High Performance Networks for Massively Parallel Architectures
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
Credit-Flow-Controlled ATM for MP Interconnection: The ATLAS I Single-Chip ATM Switch
HPCA '98 Proceedings of the 4th International Symposium on High-Performance Computer Architecture
BLAM: A High-Performance Routing Algorithm for Virtual Cut-Through Networks
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Self-Tuned Congestion Control for Multiprocessor Networks
HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
HPCA '05 Proceedings of the 11th International Symposium on High-Performance Computer Architecture
Congestion Control in InfiniBand Networks
HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Globally Adaptive Load-Balanced Routing on Tori
IEEE Computer Architecture Letters
RECN-IQ: A Cost-Effective Input-Queued Switch Architecture with Congestion Management
ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
Congestion management for non-blocking clos networks
Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Optimized InfiniBandTM fat-tree routing for shift all-to-all communication patterns
Concurrency and Computation: Practice & Experience - International Supercomputing Conference (ISC07)
HiPC'08 Proceedings of the 15th international conference on High performance computing
Buffer Management Strategies to Reduce HoL Blocking
IEEE Transactions on Parallel and Distributed Systems
Scalable alternatives to virtual output queuing
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
An efficient strategy for reducing head-of-line blocking in fat-trees
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
ICPADS '10 Proceedings of the 2010 IEEE 16th International Conference on Parallel and Distributed Systems
vFtree - A Fat-Tree Routing Algorithm Using Virtual Lanes to Alleviate Congestion
IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Combining Congested-Flow Isolation and Injection Throttling in HPC Interconnection Networks
ICPP '11 Proceedings of the 2011 International Conference on Parallel Processing
Exploring the Scope of the InfiniBand Congestion Control Mechanism
IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Fat-tree routing and node ordering providing contention free traffic for MPI global collectives
Journal of Parallel and Distributed Computing
Extending the BT NAS parallel benchmark to exascale computing
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
IEEE Transactions on Parallel and Distributed Systems
Hi-index | 0.00 |
The overall performance of High-Performance Computing applications may depend largely on the performance achieved by the network interconnecting the end-nodes; thus high-speed interconnect technologies like InfiniBand are used to provide high throughput and low latency. Nevertheless, network performance may be degraded due to congestion; thus using techniques to deal with the problems derived from congestion has become practically mandatory. In this paper we propose a straightforward congestion-management method suitable for fat-tree topologies built from InfiniBand components. Our proposal is based on a traffic-flow-to-service-level mapping that prevents, as much as possible with the resources available in current InfiniBand components (basically Virtual Lanes), the negative impact of the two most common problems derived from congestion: head-of-line blocking and buffer-hogging. We also provide a mathematical approach to analyze the efficiency of our proposal and several ones, by means of a set of analytical metrics. In certain traffic scenarios, we observe up to a 68% of the ideal performance gain that could be achieved in HoL-blocking and buffer-hogging prevention.