Accelerating the bit-split string matching algorithm using Bloom filters

  • Authors:
  • Kun Huang;Dafang Zhang;Zheng Qin

  • Affiliations:
  • Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, PR China;School of Computer and Communication, Hunan University, Changsha, Hunan Province 410082, PR China and School of Software, Hunan University, Changsha, Hunan Province 410082, PR China;School of Software, Hunan University, Changsha, Hunan Province 410082, PR China

  • Venue:
  • Computer Communications
  • Year:
  • 2010

Quantified Score

Hi-index 0.24

Visualization

Abstract

Deep Packet Inspection (DPI) is essential for network security to scan both packet header and payload to search for predefined signatures. As link rates and traffic volumes of Internet are constantly growing, string matching using Deterministic Finite Automaton (DFA) will be the performance bottleneck of DPI. The recently proposed bit-split string matching algorithm suffers from the unnecessary state transitions problem. The root cause lies in the fact that the bit-split algorithm makes pattern matching in a ''not seeing the forest for trees'' approach, where each tiny DFA only processes a b-bit substring of each input character, but cannot check whether the entire character belongs to the alphabet of original DFA. This paper presents a byte- filtered string matching algorithm, where Bloom filters are used to preprocess each character of every incoming packet to check whether the character belongs to the original alphabet, before performing bit- split string matching. If not, each tiny DFA either makes a transition to its initial state or stops any state transition. Experimental results demonstrate that compared with the bit- split algorithm, our byte-filtered algorithm enormously reduces the string matching time as well as the number of state transitions of tiny DFAs on both synthetic and real signature rule sets.