On the speedup required for combined input- and output-queued switching

Authors:
Balaji Prabhakar;Nick McKeown
Affiliations:
Departments of Electrical Engineering and Computer Science, Stanford University, Stanford, CA 94305, USA;Departments of Electrical Engineering and Computer Science, Stanford University, Stanford, CA 94305, USA
Venue:
Automatica (Journal of IFAC)
Year:
1999

Citing 8
Cited 13

On the self-similar nature of Ethernet traffic

SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
Wide area traffic: the failure of Poisson modeling

IEEE/ACM Transactions on Networking (TON)
Scheduling algorithms for input-queued cell switches

Scheduling algorithms for input-queued cell switches
Providing qos guarantees in input-buffered crossbar switches with speedup

Providing qos guarantees in input-buffered crossbar switches with speedup
Achieving 100% throughput in an input-queued switch

INFOCOM'96 Proceedings of the Fifteenth annual joint conference of the IEEE computer and communications societies conference on The conference on computer communications - Volume 1
Matching output queueing with a combined input/output-queued switch

IEEE Journal on Selected Areas in Communications
On the speedup required for work-conserving crossbar switches

IEEE Journal on Selected Areas in Communications
Throughput analysis, optimal buffer allocation, and traffic imbalance study of a generic nonblocking packet switch

IEEE Journal on Selected Areas in Communications

A Multicast FCFS Output Queued Switch without Speedup

NETWORKING '02 Proceedings of the Second International IFIP-TC6 Networking Conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; and Mobile and Wireless Communications
Analysis of the parallel packet switch architecture

IEEE/ACM Transactions on Networking (TON)
Scheduling policies for CIOQ switches

Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
The KR-Benes Network: A Control-Optimal Rearrangeable Permutation Network

IEEE Transactions on Computers
Randomization does not reduce the average delay in parallel packet switches

Proceedings of the seventeenth annual ACM symposium on Parallelism in algorithms and architectures
Packet-mode emulation of output-queued switches

Proceedings of the eighteenth annual ACM symposium on Parallelism in algorithms and architectures
Scheduling policies for CIOQ switches

Journal of Algorithms
Overall Blocking Behavior Analysis of General Banyan-Based Optical Switching Networks

IEEE Transactions on Parallel and Distributed Systems
Optimal delay scheduling in networks with arbitrary constraints

SIGMETRICS '08 Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Design and performance analysis of a practical load-balanced switch

IEEE Transactions on Communications
OBQA: Smart and cost-efficient queue scheme for Head-of-Line blocking elimination in fat-trees

Journal of Parallel and Distributed Computing
A study of matching output queueing with a 3D-VOQ switch

ICOIN'06 Proceedings of the 2006 international conference on Information Networking: advances in Data Communications and Wireless Networks
An efficient adaptive bus arbitration scheme for scalable shared-medium ATM switch

Computer Communications

Quantified Score

Hi-index	22.15

Visualization

Abstract

Architectures based on a non-blocking fabric, such as a crosspoint switch, are attractive for use in high-speed LAN switches, IP routers, and ATM switches. When operating at the highest speed, memory bandwidth limitations dictate that queues be placed at the input of the switch. But it is well known that input-queueing can lead to low throughput, and does not allow the control of latency through the switch. This is in contrast to output-queueing which maximizes throughput and permits the accurate control of packet latency through scheduling. We ask the question: Can a switch with combined input and output queueing be designed to behave identically to an output-queued switch? In this paper, we prove that if the switch uses virtual output queueing and has an internal speedup of just four, it is possible for it to behave identically to an output-queued switch, regardless of the nature of the arriving traffic. Our proof is based on a novel scheduling algorithm, called Most Urgent Cell First. We find that with a speedup of four the most urgent cell first algorithm (or MUCFA) enables perfect emulation of a FIFO output-queued switch, i.e. one in which packets depart in the same order that they arrived. We extend this result to show that with a small modification, the MUCFA algorithm enables perfect emulation of a variety of output scheduling policies, including strict priorities and weighted fair-queueing. This result makes possible switches that perform as if they were output-queued, yet use memories that run more slowly.