Regular expressions at their best: a case for rational design

  • Authors:
  • Vincent Le Maout

  • Affiliations:
  • Exalead SA, Paris, France

  • Venue:
  • CIAA'10 Proceedings of the 15th international conference on Implementation and application of automata
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Regular expressions are often an integral part of program customization and many algorithms have been proposed for transforming them into suitable data structures. These algorithms can be divided into two main classes: backtracking or automaton-based algorithms. Surprisingly, the latter class draws less attention than the former, even though automaton-based algorithms represent the oldest and by far the fastest solutions when carefully designed. Only two open-source automatonbased implementations stand out: PCRE and the recent RE2 from Google. We have developed, and present here, a competitive automaton-based regular expression engine on top of the LGPL C++ Automata Standard Template Library (ASTL), whose efficiency and scalability remain unmatched and which distinguishes itself through a unique and rigorous STL-like design.