Surveying the landscape: an in-depth analysis of spatial database workloads

Authors:
Bogdan Simion;Suprio Ray;Angela Demke Brown
Affiliations:
University of Toronto;University of Toronto;University of Toronto
Venue:
Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Year:
2012

Citing 19
Cited 1

On Workload Characterization of Relational Database Environments

IEEE Transactions on Software Engineering
Cluster analysis and workload classification

ACM SIGMETRICS Performance Evaluation Review
Memory system characterization of commercial workloads

Proceedings of the 25th annual international symposium on Computer architecture
Performance characterization of a Quad Pentium Pro SMP using OLTP workloads

Proceedings of the 25th annual international symposium on Computer architecture
An analysis of database workload performance on simultaneous multithreaded processors

Proceedings of the 25th annual international symposium on Computer architecture
Characterizing Web user sessions

ACM SIGMETRICS Performance Evaluation Review
Automatically classifying database workloads

Proceedings of the eleventh international conference on Information and knowledge management
Characterization of database access pattern for analytic prediction of buffer hit probability

The VLDB Journal — The International Journal on Very Large Data Bases
DBMSs on a Modern Processor: Where Does Time Go?

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
PROMISE: Predicting Query Behavior to Enable Predictive Caching Strategies for OLAP Systems

DaWaK 2000 Proceedings of the Second International Conference on Data Warehousing and Knowledge Discovery
Workload Characterization Issues and Methodologies

Performance Evaluation: Origins and Directions
How java programs interact with virtual machines at the microarchitectural level

OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Characteristics of production database workloads and the TPC benchmarks

IBM Systems Journal - End-to-end security
Developing a characterization of business intelligence workloads for sizing new database systems

Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
Towards workload-aware dbmss: identifying workload type and predicting its change

Towards workload-aware dbmss: identifying workload type and predicting its change
Resource Selection for Autonomic Database Tuning

ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Self-tuning database technology and information services: from wishful thinking to viable engineering

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Dynamic resource allocation for database servers running on virtual storage

FAST '09 Proccedings of the 7th conference on File and storage technologies
Jackpine: A benchmark to evaluate spatial database performance

ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering

A parallel spatial data analysis infrastructure for the cloud

Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spatial databases are increasingly important for a wide variety of real-world applications, such as land surveying, urban planning, cartography and location-based services. However, spatial database workload properties are not well-understood. For example, it is unknown to what degree one spatial application resembles another in terms of resource demand, or how the demand will change as more concurrent queries (i.e., more users) are added. We show that spatial workloads have a different CPU execution profile than well-studied decision support workloads, as represented by TPC-H. We present a framework to automatically classify spatial queries and characterize spatial workload mixes. We first analyze the resource consumption (i.e., computation and I/O) of a representative set of spatial queries, which are then classified into five distinct categories. Next, we create five homogeneous spatial workloads, each composed of queries from one of these classes. We then vary database-specific parameters (e.g., the buffer pool size) and workload specific parameters (e.g., the query mix), to characterize a workload in terms of CPU utilization and I/O activity trends. We study workloads simulating real-world spatial database applications and show how our framework can classify them and predict resource utilization trends under various settings. This can provide clues to the database administrator regarding which resources are heavily contended and can guide resource upgrades. We further validate our approach by applying it to a much larger dataset, and to a second DBMS.