An Architecture for Congestion Management in Ethernet Clusters

Authors:
Gary McAlpine;Manoj Wadekar;Tanmay Gupta;Alan Crouch;Don Newell
Affiliations:
Intel Corporation;Intel Corporation;Intel Corporation;Intel Corporation;Intel Corporation
Venue:
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
Year:
2005

Citing 4
Cited 2

Random early detection gateways for congestion avoidance

IEEE/ACM Transactions on Networking (TON)
TCP and explicit congestion notification

ACM SIGCOMM Computer Communication Review
TCP Onloading for Data Center Servers

Computer
ETA: Experience with an Intel Xeon Processor as a Packet Processing Engine

IEEE Micro

Virtual link: an enabler of enterprise utility computing

ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
Software-Based Management for Ethernet Networks

Wireless Personal Communications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Interconnects for clusters and bladed systems must deliver efficient throughput, low latency, low delay variations and minimal frame drops. The primary technical issues hindering Ethernet adoption for cluster and blade system interconnects are the current methods Ethernet switches use for dealing with congestion, which can happen frequently under cluster and blade system workloads. The common response to congestion is to drop frames and the common method of avoiding the need to drop frames is to utilize very large switch buffers. In this paper, we propose a three-level approach to dealing with congestion that provides efficient throughput, low latency, low delay variations, and can eliminate frame drops, even with very modest sized switch buffers. The approach employs three levels of congestion management: 1) improved link level transient congestion control; 2) oversubscription control at layer 2 subnet ingresses, and 3) end-to-end oversubscription control by the higher layer protocols. We present compelling simulation results showing the incremental benefits provided by each level.