High-performance architecture for dynamically updatable packet classification on FPGA

Authors:
Yun R. Qu;Shijie Zhou;Viktor K. Prasanna
Affiliations:
University of Southern California, Los Angeles, California, USA;University of Southern California, Los Angeles, California, USA;University of Southern California, Los Angeles, California, USA
Venue:
ANCS '13 Proceedings of the ninth ACM/IEEE symposium on Architectures for networking and communications systems
Year:
2013

Citing 19
Cited 0

Fast and scalable layer four switching

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
High-speed policy-based packet forwarding using efficient multi-dimensional range matching

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Dynamic Algorithms with Worst-Case Performance for Packet Classification

NETWORKING '00 Proceedings of the IFIP-TC6 / European Commission International Conference on Broadband Communications, High Performance Networking, and Performance of Communication Networks
Efficient Multimatch Packet Classification and Lookup with TCAM

IEEE Micro
Algorithms for advanced packet classification with ternary CAMs

Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications
Survey and taxonomy of packet classification techniques

ACM Computing Surveys (CSUR)
Leveraging Wire Properties at the Microarchitecture Level

IEEE Micro
Memory-efficient content filtering hardware for high-speed intrusion detection systems

Proceedings of the 2007 ACM symposium on Applied computing
OpenFlow: enabling innovation in campus networks

ACM SIGCOMM Computer Communication Review
Fast and scalable packet classification using perfect hash functions

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Large-scale wire-speed packet classification on FPGAs

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Fast and scalable packet classification using perfect hash functions

Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
Field-split parallel architecture for high performance multi-match packet classification using FPGAs

Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
Multi-Engine Packet Classification Hardware Accelerator

ICCCN '09 Proceedings of the 2009 Proceedings of 18th International Conference on Computer Communications and Networks
High throughput and large capacity pipelined dynamic search tree on FPGA

Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays
TreeCAM: decoupling updates and lookups in packet classification

Proceedings of the Seventh COnference on emerging Networking EXperiments and Technologies
A Massively Parallel, Energy Efficient Programmable Accelerator for Learning and Classification

ACM Transactions on Architecture and Code Optimization (TACO)
Algorithms for packet classification

IEEE Network: The Magazine of Global Internetworking
Scalable packet classification on FPGA

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Algorithms and FPGA based implementations for packet classification have been studied over the past decade. Algorithmic solutions have focused on high throughput; however, supporting dynamic updates has been challenging. In this paper, we present a 2-dimensional pipelined architecture for packet classification on FPGA, which achieves high throughput while supporting dynamic updates. Fine grained processing elements are arranged in a 2-dimensional array; each processing element accesses its designated memory locally, resulting in a scalable architecture. The entire array is both horizontally and vertically pipelined. As a result, it supports high clock rate that does not deteriorate as the length of the packet header or the size of the rule set increases. The performance of the architecture does not depend on rule set features such as the number of unique values in each field. The architecture also efficiently supports range searches in individual fields. The total memory is proportional to the rule set size. Dynamic updates' modify, delete and insert operations for the rule set during run-time are also supported on the self-reconfigurable processing elements with very little impact on the sustained throughput. Experimental results show that, for a 1K 15-tuple rule set, a state-of-the-art FPGA can sustain 190 Gbps throughput with 1 million updates/second. To the best of our knowledge, we are not aware of any packet classification approach that simultaneously supports both high throughput and dynamic updates of the rule set. Our architecture demonstrates 4x energy efficiency while achieving 2x throughput compared to TCAM.