Clustering based one-class classification for compliance verification of the comprehensive nuclear-test-ban treaty

Authors:
Shiven Sharma;Colin Bellinger;Nathalie Japkowicz
Affiliations:
SITE, University of Ottawa, Ottawa, Canada;SITE, University of Ottawa, Ottawa, Canada;SITE, University of Ottawa, Ottawa, Canada
Venue:
Canadian AI'12 Proceedings of the 25th Canadian conference on Advances in Artificial Intelligence
Year:
2012

Citing 9
Cited 1

Machine Learning for the Detection of Oil Spills in Satellite Radar Images

Machine Learning - Special issue on applications of machine learning and the knowledge discovery process
Supervised versus unsupervised binary-learning by feedforward neural networks

Machine Learning
One-class svms for document classification

The Journal of Machine Learning Research
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
One-Class Classification by Combining Density and Class Probability Estimation

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Evaluating Learning Algorithms: A Classification Perspective

Evaluating Learning Algorithms: A Classification Perspective
On simulating episodic events against a background of noise-like non-episodic events

Proceedings of the 2010 Summer Computer Simulation Conference
On the pattern recognition and classification of stochastically episodic events

Transactions on Compuational Collective Intelligence VI

Clustering-based ensembles for one-class classification

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Monitoring the levels of radioxenon isotopes in the atmosphere has been proposed as a means of verifying the Comprehensive Nuclear-Test-Ban Treaty (CTBT). This translates into a classification problem, whereby the measured concentrations either belong to an explosion class or a background class. Instances drawn from the explosions class are extremely rare, if not non-existent. Therefore, the resulting dataset is extremely imbalanced, and inherently suited for one-class classification. Further exacerbating the problem is the fact that the background distribution can be extremely complex, and thus, modelling it using one-class learning is difficult. In order to improve upon the previous classification results, we investigate the augmentation of one-class learning methods with clustering. The purpose of clustering is to convert a complex distribution into simpler distributions, the clusters, over which more effective models can be built. The resulting model, built from one-class learners trained over the clusters, performs more effectively than a model that is built over the original distribution. This thesis is empirically tested on three different data domains; in particular, a number of artificial datasets, datasets from the UCI repository, and data modelled after the extremely challenging CTBT. The results offer credence to the fact that there is an improvement in performance when clustering is used with one-class classification on complex distributions.