ACM Transactions on Database Systems (TODS)
The design and analysis of spatial data structures
The design and analysis of spatial data structures
Data cache management using frequency-based replacement
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Intelligent caching: selecting, representing, and reusing data in an information server
CIKM '94 Proceedings of the third international conference on Information and knowledge management
Multiple-query optimization at algorithm-level
Data & Knowledge Engineering
Workload scheduling for multiple query processing
Information Processing Letters
An overview of query optimization in relational systems
PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Capacity planning for Web performance: metrics, models, and methods
Capacity planning for Web performance: metrics, models, and methods
IO-lite: a unified I/O buffering and caching system
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
MOCHA: a self-extensible database middleware system for distributed data sources
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Efficient and extensible algorithms for multi query optimization
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
The working set model for program behavior
Communications of the ACM
The state of the art in distributed query processing
ACM Computing Surveys (CSUR)
Materialized view selection and maintenance using multi-query optimization
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Numerical Methods for Scientists and Engineers
Numerical Methods for Scientists and Engineers
Improving Proxy Cache Performance: Analysis of Three Replacement Policies
IEEE Internet Computing
Visualization of Large Data Sets with the Active Data Repository
IEEE Computer Graphics and Applications
Common Subexpression Processing in Multiple-Query Processing
IEEE Transactions on Knowledge and Data Engineering
Semantic Caching and Query Processing
IEEE Transactions on Knowledge and Data Engineering
Titan: A High-Performance Remote Sensing Database
ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Batch Scheduling in Parallel Database Systems
Proceedings of the Ninth International Conference on Data Engineering
Optimizing Queries with Materialized Views
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Multiple Query Processing in Deductive Databases using Query Graphs
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Semantic Data Caching and Replacement
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Query Scheduling in Multi Query Optimization
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
The Volcano Optimizer Generator: Extensibility and Efficient Search
Proceedings of the Ninth International Conference on Data Engineering
Active Proxy-G: optimizing the query execution process in the grid
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Dynamic Querying of Streaming Data with the dQUOB System
IEEE Transactions on Parallel and Distributed Systems
A high performance multi-perspective vision studio
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Armada: A Parallel File System for Computational Grids
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
A Component Based Services Architecture for Building Distributed Applications
HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
Dynamic Caching of Query Results for Decision Support Systems
SSDBM '99 Proceedings of the 11th International Conference on Scientific and Statistical Database Management
The EELRU adaptive replacement algorithm
Performance Evaluation
Multiple Query Optimization by Cache-Aware Middleware Using Query Teamwork
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
(R) Prefetching and Caching for Query Scheduling in a Special Class of Distributed Applications
ICPP '96 Proceedings of the Proceedings of the 1996 International Conference on Parallel Processing - Volume 3
Parallel aggregation on multi-dimensional scientific datasets
Parallel aggregation on multi-dimensional scientific datasets
Multiple query optimization support for data analysis applications
Multiple query optimization support for data analysis applications
IEEE Transactions on Knowledge and Data Engineering
Simultaneous optimization of complex mining tasks with a knowledgeable cache
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Cost-aware WWW proxy caching algorithms
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Dynamic function placement for data-intensive cluster computing
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Elastic Cloud Caches for Accelerating Service-Oriented Computations
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Site-autonomous distributed semantic caching
Proceedings of the 2011 ACM Symposium on Applied Computing
Making a case for distributed file systems at Exascale
Proceedings of the third international workshop on Large-scale system and application performance
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Managing a fragmented XML data cube with oracle and timesten
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
Hi-index | 0.00 |
In this paper, we present a multi-query optimization framework based on the concept of active semantic caching. The framework permits the identification and transparent reuse of data and computation in the presence of multiple queries (or query batches) that specify user-defined operators and aggregations originating from scientific data-analysis applications. We show how query scheduling techniques, coupled with intelligent cache replacement policies, can further improve the performance of query processing by leveraging the active semantic caching operators. We also propose a methodology for functionally decomposing complex queries in terms of primitives so that multiple reuse sites are exposed to the query optimizer, to increase the amount of reuse. The optimization framework and the database system implemented with it are designed to be efficient irrespective of the underlying parallel and/or distributed machine configuration. We present experimental results highlighting the performance improvements obtained by our methods using real scientific data-analysis applications on multiple parallel and distributed processing configurations (e.g., single symmetric multiprocessor (SMP) machine, cluster of SMP nodes, and a Grid computing configuration).