Dynamic itemset counting and implication rules for market basket data
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The Racing Algorithm: Model Selection for Lazy Learners
Artificial Intelligence Review - Special issue on lazy learning
Detecting change in categorical data: mining contrast sets
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Temporal sequence learning and data reduction for anomaly detection
ACM Transactions on Information and System Security (TISSEC)
Bayesian approaches to failure prediction for disk drives
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Anomaly Detection over Noisy Data using Learned Probability Distributions
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Fever detection from free-text clinical records for biosurveillance
Journal of Biomedical Informatics
On the discovery of significant statistical quantitative rules
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Algorithms for rapid outbreak detection: a research synthesis
Journal of Biomedical Informatics
Artificial Intelligence in Medicine
The Journal of Machine Learning Research
IEEE Transactions on Knowledge and Data Engineering
Detecting anomalous records in categorical datasets
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Adaptive spike detection for resilient data stream mining
AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
On the chance accuracies of large collections of classifiers
Proceedings of the 25th international conference on Machine learning
Anomaly pattern detection in categorical datasets
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
ACM Computing Surveys (CSUR)
A general framework to detect unsafe system states from multisensor data stream
IEEE Transactions on Intelligent Transportation Systems
Real-time driving danger-level prediction
Engineering Applications of Artificial Intelligence
On detecting clustered anomalies using SCiForest
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Spatiotemporal Models for Data-Anomaly Detection in Dynamic Environmental Monitoring Campaigns
ACM Transactions on Sensor Networks (TOSN)
Real-valued all-dimensions search: low-overhead rapid searching over subsets of attributes
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Mining outliers in spatial networks
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Ranking outliers using symmetric neighborhood relationship
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
A review of public health syndromic surveillance systems
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
A minimum spanning tree-inspired clustering-based outlier detection technique
ICDM'12 Proceedings of the 12th Industrial conference on Advances in Data Mining: applications and theoretical aspects
Hidden Source Behavior Change Tracking and Detection
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
Mining multidimensional contextual outliers from categorical relational data
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Review: A review of novelty detection
Signal Processing
Hi-index | 0.00 |
This paper presents an algorithm for performing early detection of disease outbreaks by searching a database of emergency department cases for anomalous patterns. Traditional techniques for anomaly detection are unsatisfactory for this problem because they identify individual data points that are rare due to particular combinations of features. When applied to our scenario, these traditional algorithms discover isolated outliers of particularly strange events, such as someone accidentally shooting their ear, that are not indicative of a new outbreak. Instead, we would like to detect anomalous patterns. These patterns are groups with specific characteristics whose recent pattern of illness is anomalous relative to historical patterns. We propose using a rule-based anomaly detection algorithm that characterizes each anomalous pattern with a rule. The significance of each rule is carefully evaluated using Fisher's Exact Test and a randomization test. Our algorithm is compared against a standard detection algorithm by measuring the number of false positives and the timeliness of detection. Simulated data, produced by a simulator that creates the effects of an epidemic on a city, is used for evaluation. The results indicate that our algorithm has significantly better detection times for common significance thresholds while having a slightly higher false positive rate.