Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions

Authors:
Yury Lifshits;Shay Mozes;Oren Weimann;Michal Ziv-Ukelson
Affiliations:
California Institute of Technology, 1200 E. California Blvd., 91125, Pasadena, CA, USA;Brown University, Department of Computer Science, 02912-1910, Providence, RI, USA;MIT Computer Science and Artificial Intelligence Laboratory, 32 Vassar Street, 02139, Cambridge, MA, USA;Ben Gurion University of the Negev, Computer Science Department, 84105, Beer-Sheva, Israel
Venue:
Algorithmica
Year:
2009

Citing 0
Cited 3

A faster algorithm for the computation of string convolutions using LZ78 parsing

Information Processing Letters
Speeding up Bayesian HMM by the four Russians method

WABI'11 Proceedings of the 11th international conference on Algorithms in bioinformatics
Grammar-based compression in a streaming model

LATA'10 Proceedings of the 4th international conference on Language and Automata Theory and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a method to speed up the dynamic program algorithms used for solving the HMM decoding and training problems for discrete time-independent HMMs. We discuss the application of our method to Viterbi’s decoding and training algorithms (IEEE Trans. Inform. Theory IT-13:260–269, 1967), as well as to the forward-backward and Baum-Welch (Inequalities 3:1–8, 1972) algorithms. Our approach is based on identifying repeated substrings in the observed input sequence. Initially, we show how to exploit repetitions of all sufficiently small substrings (this is similar to the Four Russians method). Then, we describe four algorithms based alternatively on run length encoding (RLE), Lempel-Ziv (LZ78) parsing, grammar-based compression (SLP), and byte pair encoding (BPE). Compared to Viterbi’s algorithm, we achieve speedups of Θ(log n) using the Four Russians method, $\Omega(\frac{r}{\log r})$using RLE, $\Omega(\frac{\log n}{k})$using LZ78, $\Omega(\frac{r}{k})$using SLP, and Ω(r) using BPE, where k is the number of hidden states, n is the length of the observed sequence and r is its compression ratio (under each compression scheme). Our experimental results demonstrate that our new algorithms are indeed faster in practice. We also discuss a parallel implementation of our algorithms.