Efficient logging for enterprise workloads on column-oriented in-memory databases

  • Authors:
  • Johannes Wust;Joos-Hendrick Boese;Frank Renkes;Sebastian Blessing;Jens Krueger;Hasso Plattner

  • Affiliations:
  • Hasso Plattner Institute, Potsdam, Germany;SAP AG, Walldorf, Germany;SAP AG, Walldorf, Germany;Hasso Plattner Institute, Potsdam, Germany;Hasso Plattner Institute, Potsdam, Germany;Hasso Plattner Institute, Potsdam, Germany

  • Venue:
  • Proceedings of the 21st ACM international conference on Information and knowledge management
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The introduction of a 64 bit address space in commodity operating systems and the constant drop in hardware prices made large capacities of main memory in the order of terabytes technically feasible and economically viable. Especially column-oriented in-memory databases are a promising platform to improve data management for enterprise applications. As in-memory databases hold the primary persistence in volatile memory, some form of recovery mechanism is required to prevent potential data loss in case of failures. Two desirable characteristics of any recovery mechanism are (1) that it has a minimal impact on the running system, and (2) that the system recovers quickly and without any data loss after a failure. This paper introduces an efficient logging mechanism for dictionary-compressed column structures that addresses these two characteristics by (1) reducing the overall log size by writing dictionary-compressed values and (2) allowing for parallel writing and reading of log files. We demonstrate the efficiency of our logging approach by comparing the resulting log-file size with traditional logical logging on a workload produced by a productive enterprise system.