Reverse nearest neighbor aggregates over data streams

Authors:
Flip Korn;S. Muthukrishnan;Divesh Srivastava
Affiliations:
AT&T Labs-Research;AT&T Labs-Research;AT&T Labs-Research
Venue:
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Year:
2002

Citing 24
Cited 33

Some computer science issues in ubiquitous computing

Communications of the ACM - Special issue on computer augmented environments: back to the real world
Sensors: the next wave of innovation

Communications of the ACM
Incremental clustering and dynamic information retrieval

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Approximate medians and other quantiles in one pass and with limited memory

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Online association rule mining

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
BOAT—optimistic decision tree construction

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Random sampling techniques for space efficient online computation of order statistics of large datasets

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Next century challenges: mobile networking for “Smart Dust”

MobiCom '99 Proceedings of the 5th annual ACM/IEEE international conference on Mobile computing and networking
Wireless integrated network sensors

Communications of the ACM
Influence sets based on reverse nearest neighbor queries

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
On computing correlated aggregates over continual data streams

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Space-efficient online computation of quantile summaries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Fast, small-space algorithms for approximate histogram maintenance

STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Maintaining stream statistics over sliding windows: (extended abstract)

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
An Index Structure for Efficient Reverse Nearest Neighbor Queries

Proceedings of the 17th International Conference on Data Engineering
Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries

Proceedings of the 27th International Conference on Very Large Data Bases
Discovery of Influence Sets in Frequently Updated Databases

Proceedings of the 27th International Conference on Very Large Data Bases
A One-Pass Algorithm for Accurately Estimating Quantiles for Disk-Resident Data

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Clustering data streams

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Stable distributions, pseudorandom generators, embeddings and data stream computation

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Fjording the Stream: An Architecture for Queries Over Streaming Sensor Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Comparing data streams using Hamming norms (how to zero in)

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
How to summarize the universe: dynamic maintenance of quantiles

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Issues in data stream management

ACM SIGMOD Record
Approximate join processing over data streams

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
High dimensional reverse nearest neighbor queries

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Group Nearest Neighbor Queries

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Deterministic sampling and range counting in geometric data streams

SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
Spatially-decaying aggregation over a network: model and algorithms

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Approximating extent measures of points

Journal of the ACM (JACM)
Semantic Approximation of Data Stream Joins

IEEE Transactions on Knowledge and Data Engineering
Conceptual partitioning: an efficient method for continuous nearest neighbor monitoring

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
On computing top-t most influential spatial sites

VLDB '05 Proceedings of the 31st international conference on Very large data bases
ERkNN: efficient reverse k-nearest neighbors retrieval with local kNN-distance estimation

Proceedings of the 14th ACM international conference on Information and knowledge management
BORDER: Efficient Computation of Boundary Points

IEEE Transactions on Knowledge and Data Engineering
Reverse Nearest Neighbors in Large Graphs

IEEE Transactions on Knowledge and Data Engineering
Rights Protection for Discrete Numeric Streams

IEEE Transactions on Knowledge and Data Engineering
Data streams: algorithms and applications

Foundations and Trends® in Theoretical Computer Science
Exploiting a page-level upper bound for multi-type nearest neighbor queries

GIS '06 Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems
Efficient range-constrained similarity search on wavelet synopses over multiple streams

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Spatially-decaying aggregation over a network

Journal of Computer and System Sciences
Deterministic sampling and range counting in geometric data streams

ACM Transactions on Algorithms (TALG)
Reverse Nearest Neighbors Search in Ad Hoc Subspaces

IEEE Transactions on Knowledge and Data Engineering
Multidimensional reverse kNN search

The VLDB Journal — The International Journal on Very Large Data Bases
Tuple routing strategies for distributed eddies

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Resilient rights protection for sensor streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Reverse kNN search in arbitrary dimensionality

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Approximate NN queries on streams with guaranteed error/performance bounds

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Efficient algorithms for reverse proximity query problems

Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
On efficient mutual nearest neighbor query processing in spatial databases

Data & Knowledge Engineering
Efficient processing of probabilistic reverse nearest neighbor queries over uncertain data

The VLDB Journal — The International Journal on Very Large Data Bases
Continuous spatial assignment of moving users

The VLDB Journal — The International Journal on Very Large Data Bases
Aggregate computation over data streams

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Reverse nearest neighbor search in peer-to-peer systems

FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Streaming algorithms for data in motion

ESCAPE'07 Proceedings of the First international conference on Combinatorics, Algorithms, Probabilistic and Experimental Methodologies
DART: an efficient method for direction-aware bichromatic reverse k nearest neighbor queries

SSTD'13 Proceedings of the 13th international conference on Advances in Spatial and Temporal Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Reverse Nearest Neighbor (RNN) queries have been studied for finite, stored data sets and are of interest for decision support. However, in many applications such as fixed wireless telephony access and sensor-based highway traffic monitoring, the data arrives in a stream and cannot be stored. Exploratory analysis on this data stream can be formalized naturally using the notion of RNN aggregates (RNNAs), which involve the computation of some aggregate (such as C0UNT or MAX DISTANCE) over the set of reverse nearest neighbor "clients" associated with each "server". In this paper, we introduce and investigate the problem of computing three types of RNNA queries over data streams of "client" locations: (i) Max-RNNA: given K servers, return the maximum RNNA over all clients to their closest servers; (ii) List-RNNA: given K servers, return a list of RNNAs over all clients to each of the K servers; and (iii) Opt-RNNA: find a subset of at most K servers for which their RNNAs are below a given threshold. While exact computation of these queries is not possible in the data stream model, we present efficient algorithms to approximately answer these RNNA queries over data streams with error guarantees. We provide analytical proofs of constant factor approximations for many RNNA queries, and complement our analyses with experimental evidence of the accuracy of our techniques.