Partial results for online query processing

Authors:
Vijayshankar Raman;Joseph M. Hellerstein
Affiliations:
IBM Almaden Research Center, San Jose, CA;University of California, Berkeley, CA
Venue:
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Year:
2002

Citing 21
Cited 17

Orienteering in an information landscape: how information seekers get from here to there

CHI '93 Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems
Online aggregation

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Efficient mid-query re-optimization of sub-optimal query execution plans

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Cost-based query scrambling for initial delays

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
An adaptive query execution system for data integration

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
An algorithm for ordering subgoals in NAIL?

Proceedings of the seventh ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
WSQ/DSQ: a practical approach for combined querying of databases and the Web

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
NiagaraCQ: a scalable continuous query system for Internet databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Online query processing: a tutorial

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Partial Answers for Unavailable Data Sources

FQAS '98 Proceedings of the Third International Conference on Flexible Query Answering Systems
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Optimizing Queries Across Diverse Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Online Feedback for Nested Aggregate Queries with Multi-Threading

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Crawling the Hidden Web

Proceedings of the 27th International Conference on Very Large Data Bases
Dynamic Pipeline Scheduling for Improving Interactive Query Performance

Proceedings of the 27th International Conference on Very Large Data Bases
Approximate Query Processing: Taming the TeraBytes

Proceedings of the 27th International Conference on Very Large Data Bases
Online dynamic reordering

The VLDB Journal — The International Journal on Very Large Data Bases
Query processing and optimization in Oracle Rdb

The VLDB Journal — The International Journal on Very Large Data Bases
Query Formulation from High-Level Concepts for Relational Databases

UIDIS '99 Proceedings of the 1999 User Interfaces to Data Intensive Systems
Interactive query processing

Interactive query processing

Dynamic sample selection for approximate query processing

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient dynamic mining of constrained frequent sets

ACM Transactions on Database Systems (TODS)
Sophia: an Information Plane for networked systems

ACM SIGCOMM Computer Communication Review
Self-monitoring query execution for adaptive query processing

Data & Knowledge Engineering
Fault-tolerance in the Borealis distributed stream processing system

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Progressive optimization in a shared-nothing parallel database

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
GridDB: a data-centric overlay for scientific grids

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Lifting the burden of history from adaptive query processing

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Fault-tolerance in the borealis distributed stream processing system

ACM Transactions on Database Systems (TODS)
Adaptive query processing

Foundations and Trends in Databases
Speculative plan execution for information gathering

Artificial Intelligence
A strategy to develop adaptive and interactive query brokers

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
An expressive language and efficient execution system for software agents

Journal of Artificial Intelligence Research
Tweets as data: demonstration of TweeQL and Twitinfo

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Processing and visualizing the data in tweets

ACM SIGMOD Record
Progressive query optimization for federated queries

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Realtime analysis of information diffusion in social media

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional query processors generate full, accurate query results, either in batch or in pipelined fashion. We argue that this strict model is too rigid for exploratory queries over diverse and distributed data sources, such as sources on the Internet. Instead, we propose a looser model of querying in which a user submits a broad initial query outline, and the system continually generates partial result tuples that may contain values for only some of the output fields. The user can watch these partial results accumulate at the user interface, and accordingly refine the query by specifying their interest in different kinds of partial results.After describing our querying model and user interface, we present a query processing architecture for this model which is implemented in the Telegraph dataflow system. Our architecture is designed to generate partial results quickly, and to adapt query execution to changing user interests. The crux of this architecture is a dataflow operator that supports two kinds of reorderings: reordering of intermediate tuples within a dataflow, and reordering of query plan operators through which tuples flow. We study reordering policies that optimize for the quality of partial results delivered over time, and experimentally demonstrate the benefits of our architecture in this context.