Analytical models for replicate-at-send multicasting in shared-memory switches
Performance Evaluation
Analyzing the Influence of Virtual Lanes on the Performance of InfiniBand Networks
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
IBM Journal of Research and Development
Modeling, Simulation and Performance Evaluation for a CIOQ Switch Architecture
ANSS '06 Proceedings of the 39th annual Symposium on Simulation
A logarithmic scheduling algorithm for high speed input-queued switches
Computer Communications
Design issues in next-generation merchant switch fabrics
IEEE/ACM Transactions on Networking (TON)
Fully hardware based WFQ architecture for high-speed QoS packet scheduling
Integration, the VLSI Journal
Saturating the transceiver bandwidth: switch fabric design on FPGAs
Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
The impact of bursty traffic on FPCF packet switch performance
Computer Communications
Hi-index | 0.25 |
A packet-switched system architecture based on the combination of a single-chip output-buffered switch element and input queues that sort arriving packets on a per-output-port basis is proposed. Scheduling is performed in a distributed two-stage approach. Independent arbiters at each of the inputs resolve input contention. Whereas the output-buffered switch element resolves output contention. As a result of this distribution of functionality, complexity of the input arbiters is only linearly proportional to the number of output ports N, thus offering better scalability than purely input-buffered approaches that require complex centralized schedulers. Since the input queues are used as the main buffering mechanism, only a relatively small amount of memory (on the order of N2 packet locations) is required in the shared-memory switch, allowing high-throughput implementations. We present simulation results to demonstrate the high performance and robustness under bursty traffic achieved with the proposed system architecture. A practical implementation in the form of the PRIZMA family of switch chips is outlined, with emphasis on its versatility in scaling in terms of both port speed and number of ports, and its support for quality-of-service mechanisms.