A new proposal to deal with congestion in InfiniBand-based fat-trees

Authors:
Jesus Escudero-Sahuquillo;Pedro J. Garcia;Francisco J. Quiles;Sven-Arne Reinemo;Tor Skeie;Olav Lysne;Jose Duato
Affiliations:
-;-;-;-;-;-;-
Venue:
Journal of Parallel and Distributed Computing
Year:
2014

Citing 34
Cited 0

Fat-trees: universal networks for hardware-efficient supercomputing

IEEE Transactions on Computers
Distributing Hot-Spot Addressing in Large-Scale Multiprocessors

IEEE Transactions on Computers
Dynamically-Allocated Multi-Queue Buffers for VLSI Communication Switches

IEEE Transactions on Computers
High-speed switch scheduling for local-area networks

ACM Transactions on Computer Systems (TOCS)
A New Theory of Deadlock-Free Adaptive Routing in Wormhole Networks

IEEE Transactions on Parallel and Distributed Systems
Using a Multipath Network for Reducing the Effects of Hot Spots

IEEE Transactions on Parallel and Distributed Systems
A new method to make communication latency uniform: distributed routing balancing

ICS '99 Proceedings of the 13th international conference on Supercomputing
The iSLIP scheduling algorithm for input-queued switches

IEEE/ACM Transactions on Networking (TON)
Virtual-Channel Flow Control

IEEE Transactions on Parallel and Distributed Systems
Deadlock-Free Adaptive Routing in Multicomputer Networks Using Virtual Channels

IEEE Transactions on Parallel and Distributed Systems
k -ary n -trees: High Performance Networks for Massively Parallel Architectures

IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
On generalized fat trees

IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
Credit-Flow-Controlled ATM for MP Interconnection: The ATLAS I Single-Chip ATM Switch

HPCA '98 Proceedings of the 4th International Symposium on High-Performance Computer Architecture
BLAM: A High-Performance Routing Algorithm for Virtual Cut-Through Networks

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Self-Tuned Congestion Control for Multiprocessor Networks

HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
A New Scalable and Cost-Effective Congestion Management Strategy for Lossless Multistage Interconnection Networks

HPCA '05 Proceedings of the 11th International Symposium on High-Performance Computer Architecture
Congestion Control in InfiniBand Networks

HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Globally Adaptive Load-Balanced Routing on Tori

IEEE Computer Architecture Letters
Efficient, Scalable Congestion Management for Interconnection Networks

IEEE Micro
RECN-IQ: A Cost-Effective Input-Queued Switch Architecture with Congestion Management

ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
Congestion management for non-blocking clos networks

Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
On the Influence of the Packet Marking and Injection Control Schemes in Congestion Management for MINs

Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Optimized InfiniBandTM fat-tree routing for shift all-to-all communication patterns

Concurrency and Computation: Practice & Experience - International Supercomputing Conference (ISC07)
FBICM: efficient congestion management for high-performance networks using distributed deterministic routing

HiPC'08 Proceedings of the 15th international conference on High performance computing
Buffer Management Strategies to Reduce HoL Blocking

IEEE Transactions on Parallel and Distributed Systems
Scalable alternatives to virtual output queuing

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
An efficient strategy for reducing head-of-line blocking in fat-trees

Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Cost-Effective Congestion Management for Interconnection Networks Using Distributed Deterministic Routing

ICPADS '10 Proceedings of the 2010 IEEE 16th International Conference on Parallel and Distributed Systems
vFtree - A Fat-Tree Routing Algorithm Using Virtual Lanes to Alleviate Congestion

IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Combining Congested-Flow Isolation and Injection Throttling in HPC Interconnection Networks

ICPP '11 Proceedings of the 2011 International Conference on Parallel Processing
Exploring the Scope of the InfiniBand Congestion Control Mechanism

IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Fat-tree routing and node ordering providing contention free traffic for MPI global collectives

Journal of Parallel and Distributed Computing
Extending the BT NAS parallel benchmark to exascale computing

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
An Effective and Feasible Congestion Management Technique for High-Performance MINs with Tag-Based Distributed Routing

IEEE Transactions on Parallel and Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The overall performance of High-Performance Computing applications may depend largely on the performance achieved by the network interconnecting the end-nodes; thus high-speed interconnect technologies like InfiniBand are used to provide high throughput and low latency. Nevertheless, network performance may be degraded due to congestion; thus using techniques to deal with the problems derived from congestion has become practically mandatory. In this paper we propose a straightforward congestion-management method suitable for fat-tree topologies built from InfiniBand components. Our proposal is based on a traffic-flow-to-service-level mapping that prevents, as much as possible with the resources available in current InfiniBand components (basically Virtual Lanes), the negative impact of the two most common problems derived from congestion: head-of-line blocking and buffer-hogging. We also provide a mathematical approach to analyze the efficiency of our proposal and several ones, by means of a set of analytical metrics. In certain traffic scenarios, we observe up to a 68% of the ideal performance gain that could be achieved in HoL-blocking and buffer-hogging prevention.