Order Preserving Compression

Authors:
Gennady Antoshenkov;David B. Lomet;James Murray
Affiliations:
-;-;-
Venue:
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Year:
1996

Citing 0
Cited 9

Query optimization in compressed database systems

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
The evolution of effective B-tree: page organization and techniques: a personal account

ACM SIGMOD Record
Implementing sorting in database systems

ACM Computing Surveys (CSUR)
B-tree indexes, interpolation search, and skew

DaMoN '06 Proceedings of the 2nd international workshop on Data management on new hardware
Integrating compression and execution in column-oriented database systems

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
XQueC: A query-conscious compressed XML database

ACM Transactions on Internet Technology (TOIT)
Dictionary-based order-preserving string compression for main memory column stores

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Modern B-Tree Techniques

Foundations and Trends in Databases
Compacting transactional data in hybrid OLTP&OLAP databases

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.01

Visualization

Abstract

Order-preserving compression can improve sorting and searching performance, and hence the performance of database systems. We describe a new parsing (tokenization) technique that can be applied to variable-length "keys", producing substantial compression. It can both compress and decompress data, permitting variable lengths for dictionary entries and compressed forms. The key notion is to partition the space of strings into ranges, encoding the common prefix of each range. We illustrate our method with padding character compression for multi-field keys, demonstrating the dramatic gains possible. A specific version of the method has been implemented in Digital's Rdb relational database system to enable effective multi-field compression.