Communication-efficient distributed monitoring of thresholded counts

Authors:
Ram Keralapura;Graham Cormode;Jeyashankher Ramamirtham
Affiliations:
UC Davis;Bell Labs;Bell Labs
Venue:
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Year:
2006

Citing 17
Cited 46

New directions in traffic measurement and accounting

Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
Distributed top-k monitoring

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Adaptive filters for continuous queries over distributed data streams

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Gigascope: a stream database for network applications

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
STREAM: the stanford stream data manager (demonstration description)

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Aurora: a data stream management system

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
TelegraphCQ: continuous dataflow processing

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Approximate Aggregation Techniques for Sensor Databases

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Synopsis diffusion for robust aggregation in sensor networks

SenSys '04 Proceedings of the 2nd international conference on Embedded networked sensor systems
Holistic aggregates in a networked world: distributed tracking of approximate quantiles

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Tributaries and deltas: efficient and robust aggregation in sensor network streams

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Distributed operation in the Borealis stream processing engine

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
An improved data stream summary: the count-min sketch and its applications

Journal of Algorithms
Sketching streams through the net: distributed approximate query tracking

VLDB '05 Proceedings of the 31st international conference on Very large data bases
INSIGHT: a distributed monitoring system for tracking continuous queries

Proceedings of the twentieth ACM symposium on Operating systems principles
What's Different: Distributed, Continuous Monitoring of Duplicate-Resilient Aggregates on Data Streams

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
A geometric approach to monitoring threshold functions over distributed data streams

Proceedings of the 2006 ACM SIGMOD international conference on Management of data

A geometric approach to monitoring threshold functions over distributed data streams

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Toward sophisticated detection with distributed triggers

Proceedings of the 2006 SIGCOMM workshop on Mining network data
Streaming in a connected world: querying and tracking distributed data streams

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
A geometric approach to monitoring threshold functions over distributed data streams

ACM Transactions on Database Systems (TODS)
STAR: self-tuning aggregation for scalable monitoring

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Algorithms for distributed functional monitoring

Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Approximate continuous querying over distributed streams

ACM Transactions on Database Systems (TODS)
Decentralized detection of global threshold crossings using aggregation trees

Computer Networks: The International Journal of Computer and Telecommunications Networking
Shape sensitive geometric monitoring

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Computing Frequent Elements Using Gossip

SIROCCO '08 Proceedings of the 15th international colloquium on Structural Information and Communication Complexity
Support Vector Machines, Data Reduction, and Approximate Kernel Matrices

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
GS-TMS: a global stream-based threat monitor system

Proceedings of the VLDB Endowment
Multi-dimensional online tracking

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Flooding-Assisted Threshold Assignment for Aggregate Monitoring in Sensor Networks

ICDCN '09 Proceedings of the 10th International Conference on Distributed Computing and Networking
Optimal tracking of distributed heavy hitters and quantiles

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Cost-aware reactive monitoring in resource-constrained wireless sensor networks

WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
Gossiping for threshold detection

IM'09 Proceedings of the 11th IFIP/IEEE international conference on Symposium on Integrated Network Management
Distributed threshold selection for aggregate threshold monitoring in sensor networks

CCNC'09 Proceedings of the 6th IEEE Conference on Consumer Communications and Networking Conference
Distributed stream join query processing with semijoins

Distributed and Parallel Databases
Aggregate computation over data streams

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Optimal sampling from distributed streams

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Reactive monitoring of aggregates in Gaussian random field over wireless sensor networks

SpringSim '10 Proceedings of the 2010 Spring Simulation Multiconference
Identifying frequent items in a network using gossip

Journal of Parallel and Distributed Computing
Brief announcement: a decentralized algorithm for distributed trigger counting

DISC'10 Proceedings of the 24th international conference on Distributed computing
Distributed threshold querying of general functions by a difference of monotonic representation

Proceedings of the VLDB Endowment
Algorithms for distributed functional monitoring

ACM Transactions on Algorithms (TALG)
An efficient decentralized algorithm for the distributed trigger counting problem

ICDCN'11 Proceedings of the 12th international conference on Distributed computing and networking
Online tracking of the dominance relationship of distributed multi-dimensional data

WAOA'10 Proceedings of the 8th international conference on Approximation and online algorithms
Tracking distributed aggregates over time-based sliding windows

Proceedings of the 30th annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing
Mining frequent itemsets over distributed data streams by continuously maintaining a global synopsis

Data Mining and Knowledge Discovery
Continuous distributed monitoring: a short survey

Proceedings of the First International Workshop on Algorithms and Models for Distributed Event Processing
Optimal random sampling from distributed streams revisited

DISC'11 Proceedings of the 25th international conference on Distributed computing
Lower bounds for number-in-hand multiparty communication complexity, made easy

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Monitoring and status representation of devices in wireless grids

GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
Multidimensional online tracking

ACM Transactions on Algorithms (TALG)
Continuous sampling from distributed streams

Journal of the ACM (JACM)
Randomized algorithms for tracking distributed count, frequencies, and ranks

PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Prediction-based geometric monitoring over distributed data streams

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Tight bounds for distributed functional monitoring

STOC '12 Proceedings of the forty-fourth annual ACM symposium on Theory of computing
Survey: Streaming techniques and data aggregation in networks of tiny artefacts

Computer Science Review
Tracking distributed aggregates over time-based sliding windows

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
ProFID: Practical frequent items discovery in peer-to-peer networks

Future Generation Computer Systems
The continuous distributed monitoring model

ACM SIGMOD Record
Sketch-based geometric monitoring of distributed stream queries

Proceedings of the VLDB Endowment
Ratio threshold queries over distributed data sources

Proceedings of the VLDB Endowment
An efficient method of sharing device resource status in wireless grids

Multiagent and Grid Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Monitoring is an issue of primary concern in current and next generation networked systems. For ex, the objective of sensor networks is to monitor their surroundings for a variety of different applications like atmospheric conditions, wildlife behavior, and troop movements among others. Similarly, monitoring in data networks is critical not only for accounting and management, but also for detecting anomalies and attacks. Such monitoring applications are inherently continuous and distributed, and must be designed to minimize the communication overhead that they introduce. In this context we introduce and study a fundamental class of problems called "thresholded counts" where we must return the aggregate frequency count of an event that is continuously monitored by distributed nodes with a user-specified accuracy whenever the actual count exceeds a given threshold value.In this paper we propose to address the problem of thresholded counts by setting local thresholds at each monitoring node and initiating communication only when the locally observed data exceeds these local thresholds. We explore algorithms in two categories: static and adaptive thresholds. In the static case, we consider thresholds based on a linear combination of two alternate strategies, and show that there exists an optimal blend of the two strategies that results in minimum communication overhead. We further show that this optimal blend can be found using a steepest descent search. In the adaptive case, we propose algorithms that adjust the local thresholds based on the observed distributions of updated information. We use extensive simulations not only to verify the accuracy of our algorithms and validate our theoretical results, but also to evaluate the performance of our algorithms. We find that both approaches yield significant savings over the naive approach of centralized processing.