Rigorous Approximated Determinization of Weighted Automata

  • Authors:
  • Benjamin Aminof;Orna Kupferman;Robby Lampert

  • Affiliations:
  • -;-;-

  • Venue:
  • LICS '11 Proceedings of the 2011 IEEE 26th Annual Symposium on Logic in Computer Science
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A nondeterministic weighted finite automaton (WFA) maps an input word to a numerical value. Applications of weighted automata include formal verification of quantitative properties, as well as text, speech, and image processing. Many of these applications require the WFAs to be deterministic, or work substantially better when the WFAs are deterministic. Unlike NFAs, which can always bedeterminized, not all WFAs have an equivalent deterministic weighted automaton (DWFA). In \cite{Moh97}, Mohri describes a determinization construction for a subclass of WFA. He also describes a property of WFAs (the {\em twins property}), such that all WFAs that satisfy thetwins property are determinizable and the algorithm terminates on them. Unfortunately, many natural WFAs cannot be determinized. In this paper we study {\em approximated determinization\/} of WFAs. We describe an algorithm that, given a WFA $\A$ and an approximation factor $t \geq 1$, constructs a DWFA $\A'$ that{\em $t$-determinizes\/} $\A$. Formally, for all words $w \in \Sigma^*$, the value of $w$ in $\A'$ is at least its value in $\A$ and at most $t$times its value in $\A$. Our construction involves two new ideas:attributing states in the subset construction by both upper and lower residues, and collapsing attributed subsets whose residues can be tightened. The larger the approximation factor is, the more attributed subsets we can collapse. Thus, $t$-determinization is helpful not only for WFAs that cannot be determinized, but also in cases determinization is possible but results in automata that are too big to handle. In addition, $t$-determinization is useful for reasoning about the competitive ratio of on line algorithms. We also describe a property (the {\em $t$-twins property}) and use it in order to characterize $t$-determinizable WFAs. Finally, we describea polynomial algorithm for deciding whether a given WFA has the $t$-twins property.