A combined input and output queued packet switched system based on PRIZMA switch on a chip technology

Authors:
C. Minkenberg;T. Engbersen
Affiliations:
Zurich Res. Lab., IBM Corp., Ruschlikon, Switzerland;-
Venue:
IEEE Communications Magazine
Year:
2000

Citing 0
Cited 11

Analytical models for replicate-at-send multicasting in shared-memory switches

Performance Evaluation
10 A Four-Terabit Packet Switch Supporting Long Round-Trip Times

IEEE Micro
Analyzing the Influence of Virtual Lanes on the Performance of InfiniBand Networks

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Prizma switch technology

IBM Journal of Research and Development
Modeling, Simulation and Performance Evaluation for a CIOQ Switch Architecture

ANSS '06 Proceedings of the 39th annual Symposium on Simulation
Quality of service provision in combined input and crosspoint queued switches without output queueing match

Computer Communications
A logarithmic scheduling algorithm for high speed input-queued switches

Computer Communications
Design issues in next-generation merchant switch fabrics

IEEE/ACM Transactions on Networking (TON)
Fully hardware based WFQ architecture for high-speed QoS packet scheduling

Integration, the VLSI Journal
Saturating the transceiver bandwidth: switch fabric design on FPGAs

Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
The impact of bursty traffic on FPCF packet switch performance

Computer Communications

Quantified Score

Hi-index	0.25

Visualization

Abstract

A packet-switched system architecture based on the combination of a single-chip output-buffered switch element and input queues that sort arriving packets on a per-output-port basis is proposed. Scheduling is performed in a distributed two-stage approach. Independent arbiters at each of the inputs resolve input contention. Whereas the output-buffered switch element resolves output contention. As a result of this distribution of functionality, complexity of the input arbiters is only linearly proportional to the number of output ports N, thus offering better scalability than purely input-buffered approaches that require complex centralized schedulers. Since the input queues are used as the main buffering mechanism, only a relatively small amount of memory (on the order of N2 packet locations) is required in the shared-memory switch, allowing high-throughput implementations. We present simulation results to demonstrate the high performance and robustness under bursty traffic achieved with the proposed system architecture. A practical implementation in the form of the PRIZMA family of switch chips is outlined, with emphasis on its versatility in scaling in terms of both port speed and number of ports, and its support for quality-of-service mechanisms.