Efficient transaction processing in SAP HANA database: the end of a column store myth

Authors:
Vishal Sikka;Franz Färber;Wolfgang Lehner;Sang Kyun Cha;Thomas Peh;Christof Bornhövd
Affiliations:
SAP, Palo Alto, CA, USA;SAP, Walldorf, Germany;SAP, Walldorf, Germany;SAP, Seoul, South Korea;SAP, Walldorf, Germany;SAP, Palo Alto, USA
Venue:
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Year:
2012

Citing 15
Cited 7

Query evaluation techniques for large databases

ACM Computing Surveys (CSUR)
Bruce Lindsay speaks out: on System R, benchmarking, life as an IBM fellow, the power of DBAs in the old days, why performance still matters, Heisenbugs, why he still writes code, singing pigs, and more

ACM SIGMOD Record
Data mining with the SAP NetWeaver BI accelerator

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
P*TIME: highly scalable OLTP DBMS for managing update-intensive stream workload

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
The end of an architectural era: (it's time for a complete rewrite)

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Reducing outer joins

The VLDB Journal — The International Journal on Very Large Data Bases
A common database approach for OLTP and OLAP using an in-memory column database

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
SIMD-scan: ultra fast in-memory table scan using on-chip vector processing units

Proceedings of the VLDB Endowment
A plan for OLAP

Proceedings of the 13th International Conference on Extending Database Technology
How to juggle columns: an entropy-based approach for table compression

Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Engineering basic algorithms of an in-memory text search engine

ACM Transactions on Information Systems (TOIS)
Speeding up queries in column stores: a case for compression

DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
In-Memory Data Management: An Inflection Point for Enterprise Applications

In-Memory Data Management: An Inflection Point for Enterprise Applications
Hybrid data-flow graphs for procedural domain-specific query languages

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
SAP HANA database: data management for modern business applications

ACM SIGMOD Record

Data stream warehousing

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
SynopSys: large graph analytics in the SAP HANA database through summarization

First International Workshop on Graph Data Management Experiences and Systems
Big data challenge: a data management perspective

Frontiers of Computer Science: Selected Publications from Chinese Universities
Lazy data structure maintenance for main-memory analytics over sliding windows

Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Scuba: diving into data at facebook

Proceedings of the VLDB Endowment
SAP HANA: the evolution from a modern main-memory data platform to an enterprise application platform

Proceedings of the VLDB Endowment
Design and evaluation of storage organizations for read-optimized main memory databases

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

The SAP HANA database is the core of SAP's new data management platform. The overall goal of the SAP HANA database is to provide a generic but powerful system for different query scenarios, both transactional and analytical, on the same data representation within a highly scalable execution environment. Within this paper, we highlight the main features that differentiate the SAP HANA database from classical relational database engines. Therefore, we outline the general architecture and design criteria of the SAP HANA in a first step. In a second step, we challenge the common belief that column store data structures are only superior in analytical workloads and not well suited for transactional workloads. We outline the concept of record life cycle management to use different storage formats for the different stages of a record. We not only discuss the general concept but also dive into some of the details of how to efficiently propagate records through their life cycle and moving database entries from write-optimized to read-optimized storage formats. In summary, the paper aims at illustrating how the SAP HANA database is able to efficiently work in analytical as well as transactional workload environments.