Conservative vs. Optimistic Parallelization of Stateful Network Intrusion Detection

Authors:
Derek L. Schuff;Yung Ryn Choe;Vijay S. Pai
Affiliations:
Purdue University, West Lafayette, IN 47907, dschuff@purdue.edu;Purdue University, West Lafayette, IN 47907, yung@purdue.edu;Purdue University, West Lafayette, IN 47907, vpai@purdue.edu
Venue:
ISPASS '08 Proceedings of the ISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and software
Year:
2008

Citing 0
Cited 3

Practice of parallelizing network applications on multi-core architectures

Proceedings of the 23rd international conference on Supercomputing
Re-examining the performance bottleneck in a NIDS with detailed profiling

Journal of Network and Computer Applications
Scalable high-performance parallel design for network intrusion detection systems on many-core processors

ANCS '13 Proceedings of the ninth ACM/IEEE symposium on Architectures for networking and communications systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents and experimentally analyzes the performance of three parallelization strategies for the popular open-source Snort network intrusion detection system (NIDS). The parallelizations include 2 conservative variants and 1 optimistic scheme. The conservative strategy parallelizes inspection at the level of TCP/IP flows, as any potential inter-packet dependences are confined to a single flow. The flows are partitioned among threads, and each flow is processed in-order at one thread. A second variation reassigns flows between threads to improve load balance but still requires that only one thread process a given flow at a time. The flow-concurrent scheme provides good performance for 3 of the 5 network packet traces studied, reaching as high as 4.1 speedup and 3.1 Gbps inspection rate on a commodity 8-core server. Dynamic reassignment does not improve performance scalability because it introduces locking overheads that offset any potential benefits of load balancing. Neither conservative version can achieve good performance, however, without enough concurrent network flows. For this case, this paper presents an optimistic parallelization that exploits the observation that not all packets from a flow are actually connected by dependences. This system allows a single flow to be simultaneously processed by multiple threads, stalling if an actual dependence is found. The optimistic version has additional overheads that reduce speedup by 25% for traces with flow concurrency, but its benefits allow one additional trace to see substantial speedup (2.4 on five cores).