Detecting epidemics using highly noisy data

Authors:
Chris Milling;Constantine Caramanis;Shie Mannor;Sanjay Shakkottai
Affiliations:
UT Austin, Austin, TX, USA;UT Austin, Austin, TX, USA;The Technion, Haifa, Israel;UT Austin, Austin, TX, USA
Venue:
Proceedings of the fourteenth ACM international symposium on Mobile ad hoc networking and computing
Year:
2013

Citing 7
Cited 0

Random Graph Dynamics (Cambridge Series in Statistical and Probabilistic Mathematics)

Random Graph Dynamics (Cambridge Series in Statistical and Probabilistic Mathematics)
Detecting sources of computer viruses in networks: theory and experiment

Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Inferring Networks of Diffusion and Influence

ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning the graph of epidemic cascades

Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Network forensics: random infection vs spreading epidemic

Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Rumors in a Network: Who's the Culprit?

IEEE Transactions on Information Theory
Information diffusion and external influence in networks

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

From Cholera, AIDS/HIV, and Malaria, to rumors and viral video, understanding the causative network behind an epidemic's spread has repeatedly proven critical for managing the spread (controlling or encouraging, as the case may be). Our current approaches to understand and predict epidemics rely on the scarce, but exact/reliable, expert diagnoses. This paper proposes a different way forward: use more readily available but also more noisy data with {\em many false negatives and false positives}, to determine the causative network of an epidemic. Specifically, we consider an epidemic that spreads according to one of two networks. At some point in time we see a small random subsample (perhaps a vanishingly small fraction) of those infected, along with an order-wise similar number of false positives. We derive sufficient conditions for this problem to be detectable, and provide an efficient algorithm that solves the hypothesis testing problem. We apply this model to two settings. In the first setting, we simply want to distinguish between random illness (a complete graph) and an epidemic (spread along a structured graph). In the second, we have a superposition of both of these, and we wish to detect which is the strongest component.