Shorter regular expressions from finite-state automata

  • Authors:
  • Yo-Sub Han;Derick Wood

  • Affiliations:
  • Department of Computer Science, The Hong Kong University of Science and Technology;Department of Computer Science, The Hong Kong University of Science and Technology

  • Venue:
  • CIAA'05 Proceedings of the 10th international conference on Implementation and Application of Automata
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the use of state elimination to construct shorter regular expressions from finite-state automata. Although state elimination is an intuitive method for computing regular expressions from finite-state automata, the resulting regular expressions are often very long and complicated. We examine the minimization of finite-state automata to obtain shorter expressions first. Then, we introduce vertical chopping based on bridge states and horizontal chopping based on the structural properties of given finite-state automata. We prove that we should not eliminate bridge states until we eliminate all non-bridge states to obtain shorter regular expressions. In addition, we suggest heuristics for state elimination that lead to shorter regular expressions based on vertical chopping and horizontal chopping. Note that we have omitted almost all proofs in this preliminary version.