A prefiltering approach to regular expression matching for network security systems

Authors:
Tingwen Liu;Yong Sun;Alex X. Liu;Li Guo;Binxing Fang
Affiliations:
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China, Graduate University of Chinese Academy of Sciences, Beijing, China;National Engineering Laboratory for Information Security Technologies, Beijing, China;Dept. of Computer Science and Engineering, Michigan State University;National Engineering Laboratory for Information Security Technologies, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China, National Engineering Laboratory for Information Security Technologies, Beijing, China
Venue:
ACNS'12 Proceedings of the 10th international conference on Applied Cryptography and Network Security
Year:
2012

Citing 13
Cited 0

A New Regular Grammar Pattern Matching Algorithm

ESA '96 Proceedings of the Fourth Annual European Symposium on Algorithms
A Fast Regular Expression Indexing Engine

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Snort - Lightweight Intrusion Detection for Networks

LISA '99 Proceedings of the 13th USENIX conference on System administration
Algorithms to accelerate multiple regular expressions matching for deep packet inspection

Proceedings of the 2006 conference on Applications, technologies, architectures, and protocols for computer communications
Approximate fingerprinting to accelerate pattern matching

Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
Fast and memory-efficient regular expression matching for deep packet inspection

Proceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems
Curing regular expressions matching algorithms from insomnia, amnesia, and acalculia

Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
A hybrid finite automaton for practical deep packet inspection

CoNEXT '07 Proceedings of the 2007 ACM CoNEXT conference
Deflating the big bang: fast and scalable deep packet inspection with extended finite automata

Proceedings of the ACM SIGCOMM 2008 conference on Data communication
An improved DFA for fast regular expression matching

ACM SIGCOMM Computer Communication Review
SigMatch: fast and scalable multi-pattern matching

Proceedings of the VLDB Endowment
Fast regular expression matching using small TCAMs for network intrusion detection and prevention systems

USENIX Security'10 Proceedings of the 19th USENIX conference on Security
Managing DFA History with Queue for Deflation DFA

Journal of Network and Systems Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Regular expression (RegEx) matching has been widely used in various networking and security applications. Despite much effort on this important problem, it remains a fundamentally difficult problem. DFA-based solutions can achieve high throughput, but require too much memory to be executed in high speed SRAM. NFA-based solutions require small memory, but are too slow. In this paper, we propose RegexFilter, a prefiltering approach. The basic idea is to generate the RegEx print of RegEx set and use it to prefilter out most unmatched items. There are two key technical challenges: the generation of RegEx print and the matching process of RegEx print. The generation of RegEx is tricky as we need to tradeoff between two conflicting goals: filtering effectiveness, which means that we want the RegEx print to filter out as many unmatched items as possible, and matching speed, which means that we want the matching speed of the RegEx print as high as possible. To address the first challenge, we propose some measurement tools for RegEx complexity and filtering effectiveness, and use it to guide the generation of RegEx print. To address the second challenge, we propose a fast RegEx print matching solution using Ternary Content Addressable Memory. We implemented our approach and conducted experiments on real world data sets. Our experimental results show that RegexFilter can speedup the potential throughput of RegEx matching by 21.5 times and 20.3 times for RegEx sets of Snort and L7-Filter systems, at the cost of less than 0.2 Mb TCAM chip.