ACM Computing Surveys (CSUR)
Detection of unique temporal segments by information theoretic meta-clustering
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Anomaly detection in categorical datasets using bayesian networks
AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Latent feature encoding using dyadic and relational data
Proceedings of the 20th ACM international conference on Information and knowledge management
Detection of variable length anomalous subsequences in data streams
International Journal of Intelligent Information and Database Systems
Review: A review of novelty detection
Signal Processing
Hi-index | 0.00 |
Identifying atypical objects is one of the traditional topics in machine learning. Recently, novel approaches, e.g., Minority Detection and One-class clustering, have explored further to identify clusters of atypical objects which strongly contrast from the rest of the data in terms of their distribution or density. This paper analyzes such tasks from an information theoretic perspective. Based on Information Bottleneck formalization, these tasks interpret to increasing the averaged atypicalness of the clusters while reducing the complexity of the clustering. This formalization yields a unifying view of the new approaches as well as the classic outlier detection. We also present a scalable minimization algorithm which exploits the localized form of the cost function over individual clusters. The proposed algorithm is evaluated using simulated datasets and a text classification benchmark, in comparison with an existing method.