Fast and Efficient Compression of Floating-Point Data

  • Authors:
  • Peter Lindstrom;Martin Isenburg

  • Affiliations:
  • IEEE;-

  • Venue:
  • IEEE Transactions on Visualization and Computer Graphics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large scale scientific simulation codes typically run on a cluster of CPUs that write/read time steps to/from a single file system. As data sets are constantly growing in size, this increasingly leads to I/O bottlenecks. When the rate at which data is produced exceeds the available I/O bandwidth, the simulation stalls and the CPUs are idle. Data compression can alleviate this problem by using some CPU cycles to reduce the amount of data needed to be transfered. Most compression schemes, however, are designed to operate offline and seek to maximize compression, notthroughput.Furthermore, they often require quantizing floating-point values onto a uniform integer grid, which disqualifies their use in applications where exact values must be retained. We propose a simple scheme for lossless, online compression of floating-point data that transparently integrates into the I/O ofmany applications.A plug-in scheme for data-dependent prediction makes our scheme applicable to a wide variety of data used invisualization, such as unstructured meshes, point sets, images, and voxel grids. We achieve state-of-the-art compression rates and speeds, the latter in part due to an improved entropy coder. We demonstrate that this significantly accelerates I/O throughput in real simulation runs. Unlike previous schemes, our method also adapts well to variable-precision floating-point and integer data.