Producing accurate interpretable clusters from high-dimensional data

Authors:
Derek Greene;Pádraig Cunningham
Affiliations:
University of Dublin, Trinity College, Dublin 2, Ireland;University of Dublin, Trinity College, Dublin 2, Ireland
Venue:
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Year:
2005

Citing 5
Cited 3

Concept decompositions for large sparse text data using clustering

Machine Learning
Co-clustering documents and words using bipartite spectral graph partitioning

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Document clustering based on non-negative matrix factorization

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions

The Journal of Machine Learning Research
Soft clustering criterion functions for partitional document clustering: a summary of results

Proceedings of the thirteenth ACM international conference on Information and knowledge management

Identification of gene transcript signatures predictive for estrogen receptor and lymph node status using a stepwise forward selection artificial neural network modelling approach

Artificial Intelligence in Medicine
A Matrix Factorization Approach for Integrating Multiple Data Views

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Fuzzy semi-supervised co-clustering for text documents

Fuzzy Sets and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The primary goal of cluster analysis is to produce clusters that accurately reflect the natural groupings in the data. A second objective is to identify features that are descriptive of the clusters. In addition to these requirements, we often wish to allow objects to be associated with more than one cluster. In this paper we present a technique, based on the spectral co-clustering model, that is effective in meeting these objectives. Our evaluation on a range of text clustering problems shows that the proposed method yields accuracy superior to that afforded by existing techniques, while producing cluster descriptions that are amenable to human interpretation.