Provably Shorter Regular Expressions from Deterministic Finite Automata

Authors:
Hermann Gruber;Markus Holzer
Affiliations:
Institut für Informatik, Ludwig-Maximilians-Universität München, München, Germany D-80538;Institut für Informatik, Technische Universität München, Garching bei München, Germany D-85748
Venue:
DLT '08 Proceedings of the 12th international conference on Developments in Language Theory
Year:
2008

Citing 11
Cited 6

Derivatives of Regular Expressions

Journal of the ACM (JACM)
State complexity of proportional removals

Journal of Automata, Languages and Combinatorics - Third international workshop on descriptional complexity of automata, grammars and related structures
Follow automata

Information and Computation
Obtaining shorter regular expressions from finite-state automata

Theoretical Computer Science
Regular expressions: new results and open problems

Journal of Automata, Languages and Combinatorics
Finite Automata, Digraph Connectivity, and Regular Expression Size

ICALP '08 Proceedings of the 35th international colloquium on Automata, Languages and Programming, Part II
Succinctness of Regular Expressions with Interleaving, Intersection and Counting

MFCS '08 Proceedings of the 33rd international symposium on Mathematical Foundations of Computer Science
Complexity measures for regular expressions

Journal of Computer and System Sciences
Optimal lower bounds on regular expression size using communication complexity

FOSSACS'08/ETAPS'08 Proceedings of the Theory and practice of software, 11th international conference on Foundations of software science and computational structures
Approximation to the smallest regular expression for a given regular language

CIAA'04 Proceedings of the 9th international conference on Implementation and Application of Automata
Acyclic automata with easy-to-find short regular expressions

CIAA'05 Proceedings of the 10th international conference on Implementation and Application of Automata

Implementation of State Elimination Using Heuristics

CIAA '09 Proceedings of the 14th International Conference on Implementation and Application of Automata
Short Regular Expressions from Finite Automata: Empirical Results

CIAA '09 Proceedings of the 14th International Conference on Implementation and Application of Automata
The complexity of regular(-like) expressions

DLT'10 Proceedings of the 14th international conference on Developments in language theory
Succinctness of the Complement and Intersection of Regular Expressions

ACM Transactions on Computational Logic (TOCL)
Series-Parallel Automata and Short Regular Expressions

Fundamenta Informaticae
State Elimination Heuristics for Short Regular Expressions

Fundamenta Informaticae

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the problem of finding good elimination orderings for the state elimination algorithm, which is one of the most popular algorithms for the conversion of finite automata into equivalent regular expressions. Based on graph separator techniques we are able to describe elimination strategies that remove states in large induced subgraphs that are "simple" like, e.g., independent sets or subgraphs of bounded treewidth, of the underlying automaton, that lead to regular expressions of moderate size. In particular, we show that there is an elimination ordering such that every language over a binary alphabet accepted by an n-state deterministicfinite automaton has alphabetic width at most O(1.742n), which is, to our knowledge, the algorithm with currently the best known performance guarantee. Finally, we apply our technique to the question on the effect of language operations on regular expression size. In case of the intersection operation we prove an upper bound which matches, up to a small factor, a lower bound recently obtained in [9,10], and thus settles an open problem stated in [7].