Data structures: time, I/Os, entropy, joules!
ESA'10 Proceedings of the 18th annual European conference on Algorithms: Part II
Grammar-based compression in a streaming model
LATA'10 Proceedings of the 4th international conference on Language and Automata Theory and Applications
Hi-index | 0.00 |
Re-Pair is a dictionary-based compression method invented in 1999 by Larssonand Moffat. Although its practical performance has been established through experiments, the method has resisted all attempts of formal analysis. In thispaper we show that Re-Pair compresses a sequence T[1,n] over an alphabet ofsize $\sigma$ and k-th order entropy H_k, to at most 2nH_k+o(n\log\sigma)bits, for any k=o(log_sigma n).