Variable to Fixed Length Codes for Predictable Sources
DCC '98 Proceedings of the Conference on Data Compression
Linear pattern matching algorithms
SWAT '73 Proceedings of the 14th Annual Symposium on Switching and Automata Theory (swat 1973)
Context-Sensitive Grammar Transform: Compression and Pattern Matching
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
Improved Variable-to-Fixed Length Codes
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
Suffix Tree Based VF-Coding for Compressed Pattern Matching
DCC '09 Proceedings of the 2009 Data Compression Conference
An efficient compression code for text databases
ECIR'03 Proceedings of the 25th European conference on IR research
Hi-index | 0.01 |
We address the problem of improving variable-length-to-fixed-length codes (VF codes), which have favourable properties for fast compressed pattern matching but moderate compression ratios. Compression ratio of VF codes depends on the parse tree that is used as a dictionary. We propose a method that trains a parse tree by scanning an input text repeatedly, and we show experimentally that it improves the compression ratio of VF codes rapidly to the level of state-of-the-art compression methods.