Finding the K highest-ranked answers in a distributed network

Authors:
Demetrios Zeinalipour-Yazti;Zografoula Vagena;Vana Kalogeraki;Dimitrios Gunopulos;Vassilis J. Tsotras;Michail Vlachos;Nick Koudas;Divesh Srivastava
Affiliations:
University of Cyprus, Dept. of Computer Science, 75 Kallipoleos Str., P.O. Box 20537, CY-1678, Nicosia, Cyprus;Microsoft Research Cambridge, Cambridge, United Kingdom;AUEB, Athens, Greece and UC - Riverside, Riverside, CA, United States;University of Athens, Athens, Greece;UC - Riverside, Riverside, CA, United States;IBM Research Zurich, Rueschlikon, Switzerland;University of Toronto, Toronto, ON, Canada;AT&T Research Labs, Florham Park, NJ, United States
Venue:
Computer Networks: The International Journal of Computer and Telecommunications Networking
Year:
2009

Citing 37
Cited 1

Combining fuzzy information from multiple systems (extended abstract)

PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Fuzzy queries in multimedia database systems

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Space/time trade-offs in hash coding with allowable errors

Communications of the ACM
Optimal aggregation algorithms for middleware

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Evaluating Top-k Selection Queries

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Probabilistic Optimization of Top N Queries

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Optimizing Multi-Feature Queries for Image Databases

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Region proximity in metric spaces and its use for approximate similarity search

ACM Transactions on Information Systems (TOIS)
Query Processing Issues in Image(Multimedia) Databases

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Distributed top-k monitoring

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
The design of an acquisitional query processor for sensor networks

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Adaptive filters for continuous queries over distributed data streams

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Evaluating Top-k Queries over Web-Accessible Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
PlanetLab: an overlay testbed for broad-coverage services

ACM SIGCOMM Computer Communication Review
Habitat monitoring with sensor networks

Communications of the ACM - Wireless sensor networks
Evaluating top-k queries over web-accessible databases

ACM Transactions on Database Systems (TODS)
Efficient top-K query calculation in distributed networks

Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Answering similarity queries in peer-to-peer networks

Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Supporting top-k join queries in relational databases

The VLDB Journal — The International Journal on Very Large Data Bases
Progressive Distributed Top-k Retrieval in Peer-to-Peer Networks

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Finding (Recently) Frequent Items in Distributed Data Streams

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
TAG: a Tiny AGgregation service for Ad-Hoc sensor networks

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
RankSQL: query algebra and optimization for relational top-k queries

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Topk Queries across Multiple Private Databases

ICDCS '05 Proceedings of the 25th IEEE International Conference on Distributed Computing Systems
The threshold join algorithm for top-k queries in distributed sensor networks

DMSN '05 Proceedings of the 2nd international workshop on Data management for sensor networks
KLEE: a framework for distributed top-k query algorithms

VLDB '05 Proceedings of the 31st international conference on Very large data bases
A Sampling-Based Approach to Optimizing Top-k Queries in Sensor Networks

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
A geometric approach to monitoring threshold functions over distributed data streams

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Continuous monitoring of top-k queries over sliding windows

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Data Acquisition in Sensor Networks with Large Memories

ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Distributed spatio-temporal similarity search

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Microhash: an efficient index structure for fash-based sensor devices

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Top-k query evaluation with probabilistic guarantees

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
A survey of top-k query processing techniques in relational database systems

ACM Computing Surveys (CSUR)
KSpot: Effectively Monitoring the K Most Important Events in a Wireless Sensor Network

ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
MINT Views: Materialized In-Network Top-k Views in Sensor Networks

MDM '07 Proceedings of the 2007 International Conference on Mobile Data Management
Efficient processing of distributed top-k queries

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications

Energy-efficient top-k query processing in wireless sensor networks

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present an algorithm for finding the k highest-ranked (or Top-k) answers in a distributed network. A Top-K query returns the subset of most relevant answers, in place of all answers, for two reasons: (i) to minimize the cost metric that is associated with the retrieval of all answers; and (ii) to improve the recall and the precision of the answer-set, such that the user is not overwhelmed with irrelevant results. Our study focuses on multi-hop distributed networks in which the data is accessible by traversing a network of nodes. Such a setting captures very well the computation framework of emerging Sensor Networks, Peer-to-Peer Networks and Vehicular Networks. We present the Threshold Join Algorithm (TJA), an efficient algorithm that utilizes a non-uniform threshold on the queried attribute in order to minimize the transfer of data when a query is executed. Additionally, TJA resolves queries in the network rather than in a centralized fashion which further minimizes the consumption of bandwidth and delay. We performed an extensive experimental evaluation of our algorithm using a real testbed of 75 workstations along with a trace-driven experimental methodology. Our results indicate that TJA requires an order of magnitude less communication than the state-of-the-art, scales well with respect to the parameter k and the network topology.