Autonomous database partitioning using data mining on single computers and cluster computers

Authors:
Liangzhe Li;Le Gruenwald
Affiliations:
University of Oklahoma, Norman, OK;University of Oklahoma, Norman, OK
Venue:
Proceedings of the 16th International Database Engineering & Applications Sysmposium
Year:
2012

Citing 14
Cited 1

Vertical partitioning algorithms for database design

ACM Transactions on Database Systems (TODS)
Vertical partitioning for database design: a graphical algorithm

SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
Parallel database systems: the future of high performance database systems

Communications of the ACM
MPI: a message passing interface

Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Efficient mining of association rules using closed itemset lattices

Information Systems
Fundamentals of Computer Alori

Fundamentals of Computer Alori
A Transaction-Based Approach to Vertical Partitioning for Relational Database Systems

IEEE Transactions on Software Engineering
OLAP Query Evaluation in a Database Cluster: A Performance Study on Intra-Query Parallelism

ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
Integrating vertical and horizontal partitioning into automated physical database design

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Database tuning advisor for microsoft SQL server 2005: demo

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Self-tuning database systems: a decade of progress

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Efficient use of the query optimizer for automated physical design

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
The 3rd international workshop on self-managing database systems (SMDB'08)

ICDEW '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering Workshop
ElasTraS: an elastic transactional data store in the cloud

HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing

Self-managing online partitioner for databases (SMOPD): a vertical database partitioning system with a fully automatic online approach

Proceedings of the 17th International Database Engineering & Applications Symposium

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the most important metrics in measuring the performance of a database system is query response time, which is composed of I/O time and CPU time. I/O time is decided by the amount of data read/write from/to disks and how the data is located on disks. CPU time is decided by how the database system performs the query operations. So if we want to reduce the query response time we can reduce either I/O time or CPU time, or both of them. We know retrieving data from disks is much slower than retrieving data from main memory. Hence, one of the common ways to reduce I/O times is clustering data on disks so that queries will access only relevant data. This paper introduces an efficient algorithm, called AutoClust, for automatic database attribute clustering (or also called automatic database vertical partitioning) for single computers as well as cluster computers. It is based on closed item sets mined from queries and their attributes using association rule mining. The paper then presents experimental results comparing the performance of AutoClust with that of a baseline algorithm on both single computers and cluster computers using the TPC-H benchmark running on major commercial database systems. The experiments show that AutoClust has better query costs for both types of computers.