The filter-placement problem and its application to minimizing information multiplicity

Authors:
Dóra Erdös;Vatche Ishakian;Andrei Lapets;Evimaria Terzi;Azer Bestavros
Affiliations:
Boston University, Boston, MA;Boston University, Boston, MA;Boston University, Boston, MA;Boston University, Boston, MA;Boston University, Boston, MA
Venue:
Proceedings of the VLDB Endowment
Year:
2012

Citing 20
Cited 0

Building efficient wireless sensor networks with low-level naming

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Mining the network value of customers

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Comparison of broadcasting techniques for mobile ad hoc networks

Proceedings of the 3rd ACM international symposium on Mobile ad hoc networking & computing
Simultaneous optimization for concave costs: single sink aggregation or single source buy-at-bulk

SODA '03 Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms
The Impact of Data Aggregation in Wireless Sensor Networks

ICDCSW '02 Proceedings of the 22nd International Conference on Distributed Computing Systems
Mining knowledge-sharing sites for viral marketing

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient Time Series Matching by Wavelets

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Maximizing the spread of influence through a social network

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
WALRUS: A Similarity Retrieval Algorithm for Image Databases

IEEE Transactions on Knowledge and Data Engineering
Facility location: distributed approximation

Proceedings of the twenty-fourth annual ACM symposium on Principles of distributed computing
Cost-effective outbreak detection in networks

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
An Information Theoretic Framework for Field Monitoring Using Autonomously Mobile Sensors

DCOSS '08 Proceedings of the 4th IEEE international conference on Distributed Computing in Sensor Systems
Distributed Algorithm for En Route Aggregation Decision in Wireless Sensor Networks

IEEE Transactions on Mobile Computing
Meme-tracking and the dynamics of the news cycle

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Operator Placement for Snapshot Multi-predicate Queries in Wireless Sensor Networks

MDM '09 Proceedings of the 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware
Near-optimal observation selection using submodular functions

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Fast shortest path distance estimation in large networks

Proceedings of the 18th ACM conference on Information and knowledge management
Distributed facility location algorithms for flexible configuration of wireless sensor networks

DCOSS'07 Proceedings of the 3rd IEEE international conference on Distributed computing in sensor systems
What is Twitter, a social network or a news media?

Proceedings of the 19th international conference on World wide web
Efficient video similarity measurement with video signature

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many information networks, data items -- such as updates in social networks, news flowing through interconnected RSS feeds and blogs, measurements in sensor networks, route updates in ad-hoc networks -- propagate in an uncoordinated manner: nodes often relay information they receive to neighbors, independent of whether or not these neighbors received the same information from other sources. This uncoordinated data dissemination may result in significant, yet unnecessary communication and processing overheads, ultimately reducing the utility of information networks. To alleviate the negative impacts of this information multiplicity phenomenon, we propose that a subset of nodes (selected at key positions in the network) carry out additional information filtering functionality. Thus, nodes are responsible for the removal (or significant reduction) of the redundant data items relayed through them. We refer to such nodes as filters. We formally define the Filter Placement problem as a combinatorial optimization problem, and study its computational complexity for different types of graphs. We also present polynomial-time approximation algorithms and scalable heuristics for the problem. Our experimental results, which we obtained through extensive simulations on synthetic and real-world information flow networks, suggest that in many settings a relatively small number of filters are fairly effective in removing a large fraction of redundant information.