Efficient transaction processing in SAP HANA database: the end of a column store myth

  • Authors:
  • Vishal Sikka;Franz Färber;Wolfgang Lehner;Sang Kyun Cha;Thomas Peh;Christof Bornhövd

  • Affiliations:
  • SAP, Palo Alto, CA, USA;SAP, Walldorf, Germany;SAP, Walldorf, Germany;SAP, Seoul, South Korea;SAP, Walldorf, Germany;SAP, Palo Alto, USA

  • Venue:
  • SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The SAP HANA database is the core of SAP's new data management platform. The overall goal of the SAP HANA database is to provide a generic but powerful system for different query scenarios, both transactional and analytical, on the same data representation within a highly scalable execution environment. Within this paper, we highlight the main features that differentiate the SAP HANA database from classical relational database engines. Therefore, we outline the general architecture and design criteria of the SAP HANA in a first step. In a second step, we challenge the common belief that column store data structures are only superior in analytical workloads and not well suited for transactional workloads. We outline the concept of record life cycle management to use different storage formats for the different stages of a record. We not only discuss the general concept but also dive into some of the details of how to efficiently propagate records through their life cycle and moving database entries from write-optimized to read-optimized storage formats. In summary, the paper aims at illustrating how the SAP HANA database is able to efficiently work in analytical as well as transactional workload environments.