Randomized computations on large data sets: tight lower bounds

Authors:
Martin Grohe;André Hernich;Nicole Schweikardt
Affiliations:
Humboldt-Universität, Berlin, Germany;Humboldt-Universität, Berlin, Germany;Humboldt-Universität, Berlin, Germany
Venue:
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Year:
2006

Citing 15
Cited 12

Reversal complexity

SIAM Journal on Computing
Randomized algorithms

Randomized algorithms
The space complexity of approximating the frequency moments

Journal of Computer and System Sciences
Computing on data streams

External memory algorithms
On showing lower bounds for external-memory computational geometry problems

External memory algorithms
External memory algorithms and data structures: dealing with massive data

ACM Computing Surveys (CSUR)
Models and issues in data stream systems

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data streams: algorithms and applications

SODA '03 Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms
On the Streaming Model Augmented with a Sorting Primitive

FOCS '04 Proceedings of the 45th Annual IEEE Symposium on Foundations of Computer Science
On the memory requirements of XPath evaluation over XML streams

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Buffering in query evaluation over XML streams

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Lower bounds for sorting with few random accesses to external memory

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Design and Analysis of Randomized Algorithms: Introduction to Design Paradigms (Texts in Theoretical Computer Science. An EATCS Series)

Design and Analysis of Randomized Algorithms: Introduction to Design Paradigms (Texts in Theoretical Computer Science. An EATCS Series)
Tight lower bounds for query processing on streaming and external memory data

ICALP'05 Proceedings of the 32nd international conference on Automata, Languages and Programming
The complexity of querying external memory and streaming data

FCT'05 Proceedings of the 15th international conference on Fundamentals of Computation Theory

Tight lower bounds for query processing on streaming and external memory data

Theoretical Computer Science
Lower bounds for randomized read/write stream algorithms

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Machine models and lower bounds for query processing

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Reversal complexity revisited

Theoretical Computer Science
Lower bounds for processing data with few random accesses to external memory

Journal of the ACM (JACM)
Annotations in Data Streams

ICALP '09 Proceedings of the 36th International Colloquium on Automata, Languages and Programming: Part I
Trading off space for passes in graph streaming problems

ACM Transactions on Algorithms (TALG)
Memory lower bounds for XPath evaluation over XML streams

Journal of Computer and System Sciences
The Value of Multiple Read/Write Streams for Approximating Frequency Moments

ACM Transactions on Computation Theory (TOCT)
Validating XML documents in the streaming model with external memory

Proceedings of the 15th International Conference on Database Theory
Validating XML documents in the streaming model with external memory

ACM Transactions on Database Systems (TODS) - Invited papers issue
Tradeoff lower lounds for stack machines

Computational Complexity

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the randomized version of a computation model (introduced in [9, 10]) that restricts random access to external memory and internal memory space. Essentially, this model can be viewed as a powerful version of a data stream model that puts no cost on sequential scans of external memory (as other models for data streams) and, in addition, (like other external memory models, but unlike streaming models), admits several large external memory devices that can be read and written to in parallel.We obtain tight lower bounds for the decision problems set equality, multiset equality, and checksort. More precisely, we show that any randomized one-sided-error bounded Monte Carlo algorithm for these problems must perform Ω(logN) random accesses to external memory devices, provided that the internal memory size is at most O(4√N/logN), where N denotes the size of the input data.From the lower bound on the set equality problem we can infer lower bounds on the worst case data complexity of query evaluation for the languages XQuery, XPath, and relational algebra on streaming data. More precisely, we show that there exist queries in XQuery, XPath, and relational algebra, such that any (randomized) Las Vegas algorithm that evaluates these queries must perform Ω(logN) random accesses to external memory devices, provided that the internal memory size is at most O(4√N/logN).