On the influence of class information in the two-stage clustering of a human brain tumour dataset

Authors:
Raúl Cruz-Barbosa;Alfredo Vellido
Affiliations:
Universitat Politécnica de Catalunya, Barcelona, Spain and Universidad Tecnológica de la Mixteca, Huajuapan, Oaxaca, México;Universidad Tecnológica de la Mixteca, Huajuapan, Oaxaca, México
Venue:
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
Year:
2007

Citing 6
Cited 0

GTM: the generative topographic mapping

Neural Computation
Unsupervised Learning of Finite Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Self-Organizing Maps

Self-Organizing Maps
Missing data imputation through GTM as a mixture of t-distributions

Neural Networks
On the improvement of brain tumour data clustering using class information

Proceedings of the 2006 conference on STAIRS 2006: Proceedings of the Third Starting AI Researchers' Symposium
Clustering of the self-organizing map

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper analyzes, through clustering and visualization, Magnetic Resonance Spectra corresponding to a complex human brain tumour dataset. Clustering is performed as a two-stage process, in which the first stage model is Generative Topographic Mapping (GTM). In semi-supervised settings, class information can be added to refine the clustering process. A class information-enriched variant of GTM, class-GTM, is used here for a first cluster description of the data. The number of clusters used by GTM is usually large for visualization purposes and does not necessarily correspond to the overall class structure. Consequently, in a second stage, clusters are agglomerated using the K-means algorithm with different initialization strategies, some of them defined ad hoc for the GTM models. We aim to evaluate how and under what circumstances the use of class information influences tumour cluster-wise class separability in the final result of the two-stage clustering process.