Task scheduling for GPU accelerated OLAP systems

Authors:
Lubomir Riha;Colin Shea;Maria Malik;Tarek El-Ghazawi
Affiliations:
The George Washington University, Washington, DC;The George Washington University, Washington, DC;The George Washington University, Washington, DC;The George Washington University, Washington, DC
Venue:
Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
Year:
2011

Citing 14
Cited 1

Sensitivity-driven co-synthesis of distributed embedded systems

ISSS '95 Proceedings of the 8th international symposium on System synthesis
An array-based algorithm for simultaneous multidimensional aggregates

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
A hardware-software cosynthesis technique based on heterogeneous multiprocessor scheduling

CODES '99 Proceedings of the seventh international workshop on Hardware/software codesign
Bottom-up computation of sparse and Iceberg CUBE

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Techniques for mapping tasks to machines in heterogeneous computing systems

Journal of Systems Architecture: the EUROMICRO Journal - Heterogeneous distributed and parallel architectures: hardware, software and design tools
Efficient Organization of Large Multidimensional Arrays

Proceedings of the Tenth International Conference on Data Engineering
Process Partitioning for Distributed Embedded Systems

CODES '96 Proceedings of the 4th International Workshop on Hardware/Software Co-Design
A Comparison Study of Static Mapping Heuristics for a Class of Meta-Tasks on Heterogeneous Computing Systems

HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Computing Multidimensional Aggregates in Parallel

ICPADS '98 Proceedings of the 1998 International Conference on Parallel and Distributed Systems
Top-Down Computation of Partial ROLAP Data Cubes

HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 8 - Volume 8
Fast computation of database operations using graphics processors

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Exploring graphics processing units as parallel coprocessors for online aggregation

DOLAP '10 Proceedings of the ACM 13th international workshop on Data warehousing and OLAP
Comparing GPU and CPU in OLAP cubes creation

SOFSEM'11 Proceedings of the 37th international conference on Current trends in theory and practice of computer science
MOGAC: a multiobjective genetic algorithm for hardware-software cosynthesis of distributed embedded systems

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

An Adaptive Hybrid OLAP Architecture with optimized memory access patterns

Cluster Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Processing data related to business intelligence is an ever important and complex task set. One approach to answering multi-faceted analytical queries quickly is online analytical processing, or OLAP. OLAP allows for quick query response times thanks to its use of n-dimensional models referred to as OLAP cubes. As with all data laden systems, OLAP systems are dealing with ever-increasing dimensionality of the data cube while expecting system responsiveness to be maintained. As queries become more complex and the dimensionality of the cube grows ever larger, runtime required to aggregate queries increases. To coalesce these requirements, without impacting the apparent dimensionality of the cube, an agile method for reporting must be found. In this paper we propose a task-scheduling algorithm for GPU accelerated OLAP systems. This scheduling algorithm looks to balance the GPU and CPU load to meet a minimally acceptable completion time for OLAP queries. A partial in memory cube is formed using highest-level general queries. To ensure fast response time of aggregations, the cube is restricted in dimensionality. If a query requires data outside of the dimensional cube, or the time to search the cube is greater than the time to execute a raw aggregation, the task is scheduled on the GPU for processing. Our evaluation of the heterogeneous task scheduler shows a performance increase of 8.5x over a CPU only OLAP system.