Adapt: adaptive database schema design for multi-tenant applications

Authors:
Jiacai Ni;Guoliang Li;Jun Zhang;Lei Li;Jianhua Feng
Affiliations:
Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;IBM China Development Lab, Beijing, China;IBM China Development Lab, Beijing, China;Tsinghua University, Beijing, China
Venue:
Proceedings of the 21st ACM international conference on Information and knowledge management
Year:
2012

Citing 7
Cited 0

Multi-tenant databases for software as a service: schema-mapping techniques

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Supporting Database Applications as a Service

ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
The design of the force.com multitenant internet application development platform

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Zephyr: live migration in shared nothing databases for elastic cloud platforms

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Extensibility and Data Sharing in evolving multi-tenant databases

ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
Data privacy preservation during schema evolution for multi-tenancy applications in cloud computing

WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part I
Towards Multi-tenant Performance SLOs

ICDE '12 Proceedings of the 2012 IEEE 28th International Conference on Data Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multi-tenant data management is a major application of software as a Service (SaaS). Many companies outsource their data to a third party which hosts a multi-tenant database system to provide data management service. The system should have high performance, low space and excellent scalability. One big challenge is to devise a high-quality database schema. Independent Tables Shared Instances and Shared Tables Shared Instances are two state-of-the-art methods. However, the former has poor scalability, while the latter achieves good scalability at the expense of poor performance and high space overhead. In this paper, we trade-off between the two methods and propose an adaptive database schema design approach to achieve good scalability and high performance with low space. To this end, we identify the important attributes and use them to generate a base table. For other attributes, we construct supplementary tables. We propose a cost-based model to adaptively generate the tables above. Our method has the following advantages. First, our method achieves high scalability. Second, our method can trade-off performance and space requirement. Third, our method can be easily applied to existing databases (e.g., MySQL) with minor revisions. Fourth, our method can adapt to any schemas and query workloads. Experimental results show our method achieves high performance and good scalability with low space and outperforms state-of-the-art method.