Adaptive Tuple differential coding

  • Authors:
  • Jean-Paul Deveaux;Andrew Rau-Chaplin;Norbert Zeh

  • Affiliations:
  • Faculty of Computer Science, Dalhousie University, Halifax NS Canada;Faculty of Computer Science, Dalhousie University, Halifax NS Canada;Faculty of Computer Science, Dalhousie University, Halifax NS Canada

  • Venue:
  • DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is desirable to employ compression techniques in Relational OLAP systems to reduce disk space requirements and increase disk I/O throughput. Tuple Differential Coding (TDC) techniques have been introduced to compress views on a tuple level by storing only the differences between consecutive ordered tuples. These techniques work well for highly regular data in which the differences between tuples are fairly constant but are less effective on real data containing either skew or outliers. In this paper we introduce Adaptive Tuple Differential Coding (ATDC), which employs optimization techniques to analyze blocks of tuples to detect large tuple differences, with the purpose of isolating them to minimize their negative effect on the compression of neighbouring tuples. Our experiments show that this new algorithm provides an increase in compression ratio of 15-30% over TDC on typical real datasets.