On Workload Characterization of Relational Database Environments
IEEE Transactions on Software Engineering
Cluster analysis and workload classification
ACM SIGMETRICS Performance Evaluation Review
Memory system characterization of commercial workloads
Proceedings of the 25th annual international symposium on Computer architecture
Performance characterization of a Quad Pentium Pro SMP using OLTP workloads
Proceedings of the 25th annual international symposium on Computer architecture
An analysis of database workload performance on simultaneous multithreaded processors
Proceedings of the 25th annual international symposium on Computer architecture
Characterizing Web user sessions
ACM SIGMETRICS Performance Evaluation Review
Automatically classifying database workloads
Proceedings of the eleventh international conference on Information and knowledge management
Characterization of database access pattern for analytic prediction of buffer hit probability
The VLDB Journal — The International Journal on Very Large Data Bases
DBMSs on a Modern Processor: Where Does Time Go?
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
PROMISE: Predicting Query Behavior to Enable Predictive Caching Strategies for OLAP Systems
DaWaK 2000 Proceedings of the Second International Conference on Data Warehousing and Knowledge Discovery
Workload Characterization Issues and Methodologies
Performance Evaluation: Origins and Directions
How java programs interact with virtual machines at the microarchitectural level
OOPSLA '03 Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications
Characteristics of production database workloads and the TPC benchmarks
IBM Systems Journal - End-to-end security
Developing a characterization of business intelligence workloads for sizing new database systems
Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
Towards workload-aware dbmss: identifying workload type and predicting its change
Towards workload-aware dbmss: identifying workload type and predicting its change
Resource Selection for Autonomic Database Tuning
ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Dynamic resource allocation for database servers running on virtual storage
FAST '09 Proccedings of the 7th conference on File and storage technologies
Jackpine: A benchmark to evaluate spatial database performance
ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
A parallel spatial data analysis infrastructure for the cloud
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hi-index | 0.00 |
Spatial databases are increasingly important for a wide variety of real-world applications, such as land surveying, urban planning, cartography and location-based services. However, spatial database workload properties are not well-understood. For example, it is unknown to what degree one spatial application resembles another in terms of resource demand, or how the demand will change as more concurrent queries (i.e., more users) are added. We show that spatial workloads have a different CPU execution profile than well-studied decision support workloads, as represented by TPC-H. We present a framework to automatically classify spatial queries and characterize spatial workload mixes. We first analyze the resource consumption (i.e., computation and I/O) of a representative set of spatial queries, which are then classified into five distinct categories. Next, we create five homogeneous spatial workloads, each composed of queries from one of these classes. We then vary database-specific parameters (e.g., the buffer pool size) and workload specific parameters (e.g., the query mix), to characterize a workload in terms of CPU utilization and I/O activity trends. We study workloads simulating real-world spatial database applications and show how our framework can classify them and predict resource utilization trends under various settings. This can provide clues to the database administrator regarding which resources are heavily contended and can guide resource upgrades. We further validate our approach by applying it to a much larger dataset, and to a second DBMS.