Detection of emerging space-time clusters

Authors:
Daniel B. Neill;Andrew W. Moore;Maheshkumar Sabhnani;Kenny Daniel
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Year:
2005

Citing 5
Cited 18

Automatic subspace clustering of high dimensional data for data mining applications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Activity monitoring: noticing interesting changes in behavior

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Bump hunting in high-dimensional data

Statistics and Computing
Rapid detection of significant spatial clusters

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
On detecting space-time clusters

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining

A probabilistic approach to spatiotemporal theme pattern mining on weblogs

Proceedings of the 15th international conference on World Wide Web
Detecting research topics via the correlation between graphs and texts

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering correlated spatio-temporal changes in evolving graphs

Knowledge and Information Systems
A bayesian mixture model with linear regression mixing proportions

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Higher order mining

ACM SIGKDD Explorations Newsletter
Simulation of Multivariate Spatial-Temporal Outbreak Data for Detection Algorithm Evaluation

BioSecure '08 Proceedings of the 2008 International Workshop on Biosurveillance and Biosecurity
Change analysis in spatial datasets by interestingness comparison

SIGSPATIAL Special
Guessing the extreme values in a data set: a Bayesian method and its applications

The VLDB Journal — The International Journal on Very Large Data Bases
A real-time temporal Bayesian architecture for event surveillance and its application to patient-specific multiple disease outbreak detection

Data Mining and Knowledge Discovery
Spatio-temporal clustering of road network data

AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
Bayesian CAR models for syndromic surveillance on multiple data streams: Theory and practice

Information Fusion
Observation strategies for event detection with incidence on runtime verification: theory, algorithms, experimentation

Annals of Mathematics and Artificial Intelligence
Discovering emerging topics in unlabelled text collections

ADBIS'06 Proceedings of the 10th East European conference on Advances in Databases and Information Systems
On mining anomalous patterns in road traffic streams

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
SigSpot: mining significant anomalous regions from time-evolving networks (abstract only)

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Spatio-temporal polygonal clustering with space and time as first-class citizens

Geoinformatica
On detection of emerging anomalous traffic patterns using GPS data

Data & Knowledge Engineering
Fast generalized subset scan for anomalous pattern detection

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a new class of spatio-temporal cluster detection methods designed for the rapid detection of emerging space-time clusters. We focus on the motivating application of prospective disease surveillance: detecting space-time clusters of disease cases resulting from an emerging disease outbreak. Automatic, real-time detection of outbreaks can enable rapid epidemiological response, potentially reducing rates of morbidity and mortality. Building on the prior work on spatial and space-time scan statistics, our methods combine time series analysis (to determine how many cases we expect to observe for a given spatial region in a given time interval) with new "emerging cluster" space-time scan statistics (to decide whether an observed increase in cases in a region is significant), enabling fast and accurate detection of emerging outbreaks. We evaluate these methods on two types of simulated outbreaks: aerosol release of inhalational anthrax (e.g. from a bioterrorist attack) and FLOO ("Fictional Linear Onset Outbreak"), injected into actual baseline data (Emergency Department records and over-the-counter drug sales data from Allegheny County). We demonstrate that our methods are successful in rapidly detecting both outbreak types while keeping the number of false positives low, and show that our new "emerging cluster" scan statistics consistently outperform the standard "persistent cluster" scan statistics approach.