Cost-based query scrambling for initial delays

Authors:
Tolga Urhan;Michael J. Franklin;Laurent Amsaleg
Affiliations:
University of Maryland;University of Maryland;IRISA
Venue:
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Year:
1998

Citing 21
Cited 74

Join processing in database systems with large main memories

ACM Transactions on Database Systems (TODS)
R* optimizer validation and performance evaluation for local queries

SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
Query optimization by simulated annealing

SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Randomized algorithms for optimizing large join queries

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Query optimization for parallel execution

SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Query evaluation techniques for large databases

ACM Computing Surveys (CSUR)
Modern database systems: the object model, interoperability, and beyond

Modern database systems: the object model, interoperability, and beyond
Pegasus: a heterogeneous information management system

Modern database systems
Optimization of dynamic query evaluation plans

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Reducing multidatabase query response time by tree balancing

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
The Garlic project

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Dynamic Query Operator Scheduling for Wide-Area Remote Access

Distributed and Parallel Databases
Querying the World Wide Web

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Scrambling query plans to cope with unexpected delays

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Access path selection in a relational database management system

SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
InterViso: dealing with the complexity of federated database access

The VLDB Journal — The International Journal on Very Large Data Bases
Dynamic Query Optimization in Rdb/VMS

Proceedings of the Ninth International Conference on Data Engineering
The Case for Enhanced Abstract Data Types

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Parametric Query Optimization

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Of Objects and Databases: A Decade of Turmoil

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Scaling heterogeneous databases and the design of Disco

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)

Dynamic Query Operator Scheduling for Wide-Area Remote Access

Distributed and Parallel Databases
Least expected cost query optimization: an exercise in utility

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An adaptive query execution system for data integration

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
The state of the art in distributed query processing

ACM Computing Surveys (CSUR)
Fault-tolerant, load-balancing queries in telegraph

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Characterizing memory requirements for queries over continuous data streams

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Rate-based query optimization for streaming information sources

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Continuously adaptive continuous queries over streams

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Efficient evaluation of queries in a mediator for WebSources

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Partial results for online query processing

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Join and multi-join processing in data integration systems

Data & Knowledge Engineering
The cougar approach to in-network query processing in sensor networks

ACM SIGMOD Record
Efficient and Adaptive Processing of Multiple Continuous Queries

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Introducing QoS to Electronic Commerce Applications

ISEC '01 Proceedings of the Second International Symposium on Topics in Electronic Commerce
Cost Models DO Matter: Providing Cost Information for Diverse Data Sources in a Federated System

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Probabilistic Optimization of Top N Queries

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
LEO - DB2's LEarning Optimizer

Proceedings of the 27th International Conference on Very Large Data Bases
Scheduling Jobs Before Shut-Down

SWAT '00 Proceedings of the 7th Scandinavian Workshop on Algorithm Theory
Query Processing in Self-Profiling Composable Peer-to-Peer Mediator Databases

EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
Adaptive Query Processing: A Survey

BNCOD 19 Proceedings of the 19th British National Conference on Databases: Advances in Databases
Developing Evolutionary Cost Models for Query Optimization in a Dynamic Multidatabase Environment

On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002
Challenges in Ubiquitous Data Management

Informatics - 10 Years Back. 10 Years Ahead.
Support for Mobile Location-Aware Applications in MAGNET

Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems
An XML query engine for network-bound data

The VLDB Journal — The International Journal on Very Large Data Bases
Learning response time for WebSources using query feedback and application in query optimization

The VLDB Journal — The International Journal on Very Large Data Bases
Answering queries using views: A survey

The VLDB Journal — The International Journal on Very Large Data Bases
Chain: operator scheduling for memory minimization in data stream systems

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Quality of service in an information economy

ACM Transactions on Internet Technology (TOIT)
PSoup: a system for streaming queries over streaming data

The VLDB Journal — The International Journal on Very Large Data Bases
Optimizing Recursive Information Gathering Plans in EMERAC

Journal of Intelligent Information Systems
Evolutionary techniques for updating query cost models in a dynamic multidatabase environment

The VLDB Journal — The International Journal on Very Large Data Bases
Adapting to source properties in processing data integration queries

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Adaptive ordering of pipelined stream filters

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Static optimization of conjunctive queries with sliding windows over infinite streams

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Robust query processing through progressive optimization

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
LEO: An autonomic query optimizer for DB2

IBM Systems Journal
Exploiting k-constraints to reduce memory overhead in continuous queries over data streams

ACM Transactions on Database Systems (TODS)
Operator scheduling in data stream systems

The VLDB Journal — The International Journal on Very Large Data Bases
Self-monitoring query execution for adaptive query processing

Data & Knowledge Engineering
Adaptive Caching for Continuous Queries

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Adaptive Processing of Top-k Queries in XML

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
TinyDB: an acquisitional query processing system for sensor networks

ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
DAIMON: data integration for a mobile network

Proceedings of the 4th ACM international workshop on Data engineering for wireless and mobile access
Proactive re-optimization

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Early hash join: a configurable algorithm for the efficient and early production of join results

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Adaptive query processing in mobile environment

MPAC '05 Proceedings of the 3rd international workshop on Middleware for pervasive and ad-hoc computing
Optimizing Cyclic Join View Maintenance over Distributed Data Sources

IEEE Transactions on Knowledge and Data Engineering
Data integration: the teenage years

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Adaptive rank-aware query optimization in relational databases

ACM Transactions on Database Systems (TODS)
Incremental Evaluation of Sliding-Window Queries over Data Streams

IEEE Transactions on Knowledge and Data Engineering
HybMig: A Hybrid Approach to Dynamic Plan Migration for Continuous Queries

IEEE Transactions on Knowledge and Data Engineering
Streaming queries over streaming data

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Maximizing the output rate of multi-way join queries over streaming information sources

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Lifting the burden of history from adaptive query processing

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Adaptive query processing

Foundations and Trends in Databases
Query optimization via contention space partitioning and cost error controlling for dynamic multidatabase systems

Distributed and Parallel Databases
Optimizing complex queries with multiple relation instances

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Using slice join for efficient evaluation of multi-way joins

Data & Knowledge Engineering
A strategy to develop adaptive and interactive query brokers

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Optimizer plan change management: improved stability and performance in Oracle 11g

Proceedings of the VLDB Endowment
Architecture of a Database System

Foundations and Trends in Databases
Query optimizers: time to rethink the contract?

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Automation everywhere: autonomics and data management

BNCOD'07 Proceedings of the 24th British national conference on Databases
RRPJ: result-rate based progressive relational join

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Combining artificial intelligence and databases for data integration

Artificial intelligence today
Preference query evaluation over expensive attributes

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Run-time adaptivity for search computing

Search computing
ANAPSID: an adaptive query processing engine for SPARQL endpoints

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Progressive query optimization for federated queries

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
A foundation for the replacement of pipelined physical join operators in adaptive query processing

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Chapter 12: panta rhei: flexible execution engine for search computing queries

Search Computing
Optimizing adaptive multi-route query processing via time-partitioned indices

Journal of Computer and System Sciences
Shark: SQL and rich analytics at scale

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data

Quantified Score

Hi-index	0.00

Visualization

Abstract

Remote data access from disparate sources across a wide-area network such as the Internet is problematic due to the unpredictable nature of the communications medium and the lack of knowledge about the load and potential delays at remote sites. Traditional, static, query processing approaches break down in this environment because they are unable to adapt in response to unexpected delays. Query scrambling has been proposed to address this problem. Scrambling modifies query execution plans on-the-fly when delays are encountered during runtime. In its original formulation, scrambling was based on simple heuristics, which although providing good performance in many cases, were also shown to be susceptible to problems resulting from bad scrambling decisions. In this paper we address these shortcomings by investigating ways to exploit query optimization technology to aid in making intelligent scrambling choices. We propose three different approaches to using query optimization for scrambling. These approaches vary, for example, in whether they optimize for total work or response-time, and whether they construct partial or complete alternative plans. Using a two-phase randomized query optimizer, a distributed query processing simulator, and a workload derived from queries of the TPCD benchmark, we evaluate these different approaches and compare their ability to cope with initial delays in accessing remote sources. The results show that cost-based scrambling can effectively hide initial delays, but that in the absence of good predictions of expected delay durations, there are fundamental tradeoffs between risk aversion and effectiveness.