An automatic method to determine the number of clusters using decision-theoretic rough set

Authors:
Hong Yu;Zhanguo Liu;Guoyin Wang
Affiliations:
-;-;-
Venue:
International Journal of Approximate Reasoning
Year:
2014

Citing 42
Cited 6

A Validity Measure for Fuzzy Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
A decision-theoretic roguth set model

Methodologies for intelligent systems, 5
Techniques of Cluster Algorithms in Data Mining

Data Mining and Knowledge Discovery
Clustering validity checking methods: part II

ACM SIGMOD Record
A unified framework for model-based clustering

The Journal of Machine Learning Research
Cluster Analysis for Gene Expression Data: A Survey

IEEE Transactions on Knowledge and Data Engineering
A Method of Web Search Result Clustering Based on Rough Sets

WI '05 Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence
How Many Clusters? An Information-Theoretic Perspective

Neural Computation
Virtual Clusters for Grid Communities

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
An objective approach to cluster validation

Pattern Recognition Letters
A local-density based spatial clustering algorithm with noise

Information Systems
A cluster validity index for fuzzy clustering

Information Sciences: an International Journal
Defining clusters from a hierarchical cluster tree

Bioinformatics
Attribute reduction in decision-theoretic rough set models

Information Sciences: an International Journal
Probabilistic rough set approximations

International Journal of Approximate Reasoning
Hierarchical Adaptive Clustering

Informatica
Clustering high dimensional data: A graph-based relaxed optimization approach

Information Sciences: an International Journal
Rough Ensemble Classifier: A Comparative Study

WILF '09 Proceedings of the 8th International Workshop on Fuzzy Logic and Applications
Rough Cluster Quality Index Based on Decision Theory

IEEE Transactions on Knowledge and Data Engineering
Semi-supervised Rough Cost/Benefit Decisions

Fundamenta Informaticae - Fundamentals of Knowledge Technology
Decision-theoretic rough set models

RSKT'07 Proceedings of the 2nd international conference on Rough sets and knowledge technology
Bayesian decision theory for dominance-based rough set approach

RSKT'07 Proceedings of the 2nd international conference on Rough sets and knowledge technology
Virtual Organization Clusters: Self-provisioned clouds on the grid

Future Generation Computer Systems
Minimum spanning tree based split-and-merge: A hierarchical clustering method

Information Sciences: an International Journal
Probabilistic model criteria with decision-theoretic rough sets

Information Sciences: an International Journal
Determination of the threshold value β of variable precision rough set by fuzzy algorithms

International Journal of Approximate Reasoning
Attribute reduction in decision-theoretic rough set model: a further investigation

RSKT'11 Proceedings of the 6th international conference on Rough sets and knowledge technology
Automatically determining the number of clusters using decision-theoretic rough set

RSKT'11 Proceedings of the 6th international conference on Rough sets and knowledge technology
Determining the number of clusters using information entropy for mixed data

Pattern Recognition
Least squares quantization in PCM

IEEE Transactions on Information Theory
An information-theoretic interpretation of thresholds in probabilistic rough sets

RSKT'12 Proceedings of the 7th international conference on Rough Sets and Knowledge Technology
Multiple criteria decision analysis with game-theoretic rough sets

RSKT'12 Proceedings of the 7th international conference on Rough Sets and Knowledge Technology
Autonomous Knowledge-oriented Clustering Using Decision-Theoretic Rough Set Theory

Fundamenta Informaticae - Rough Sets and Knowledge Technology (RSKT 2010)
Modelling Multi-agent Three-way Decisions with Decision-theoretic Rough Sets

Fundamenta Informaticae - Rough Sets and Knowledge Technology (RSKT 2010)
Soft clustering -- Fuzzy and rough approaches and their extensions and derivatives

International Journal of Approximate Reasoning
In Search of Effective Granulization with DTRS for Ternary Classification

International Journal of Cognitive Informatics and Natural Intelligence
Analyzing uncertainties of probabilistic rough set regions with game-theoretic rough sets

International Journal of Approximate Reasoning
Generalized probabilistic approximations of incomplete data

International Journal of Approximate Reasoning
Incorporating logistic regression to decision-theoretic rough sets for classifications

International Journal of Approximate Reasoning
Multigranulation decision-theoretic rough sets

International Journal of Approximate Reasoning
An axiomatic characterization of probabilistic rough sets

International Journal of Approximate Reasoning
On an optimization representation of decision-theoretic rough set model

International Journal of Approximate Reasoning

Multigranulation decision-theoretic rough sets

International Journal of Approximate Reasoning
Qualitative and quantitative combinations of crisp and rough clustering schemes using dominance relations

International Journal of Approximate Reasoning
An extension to Rough c-means clustering based on decision-theoretic Rough Sets model

International Journal of Approximate Reasoning
On an optimization representation of decision-theoretic rough set model

International Journal of Approximate Reasoning
Feature selection with test cost constraint

International Journal of Approximate Reasoning
Quantitative information architecture, granular computing and rough set models in the double-quantitative approximation space of precision and grade

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clustering provides a common means of identifying structure in complex data, and there is renewed interest in clustering as a tool for the analysis of large data sets in many fields. Determining the number of clusters in a data set is one of the most challenging and difficult problems in cluster analysis. To combat the problem, this paper proposes an efficient automatic method by extending the decision-theoretic rough set model to clustering. A new clustering validity evaluation function is designed based on the risk calculated by loss functions and possibilities. Then a hierarchical clustering algorithm, ACA-DTRS algorithm, is proposed, which is proved to stop automatically at the perfect number of clusters without manual interference. Furthermore, a novel fast algorithm, FACA-DTRS, is devised based on the conclusion obtained in the validation of the ACA-DTRS algorithm. The performance of algorithms has been studied on some synthetic and real world data sets. The algorithm analysis and the results of comparison experiments show that the new method, without manual parameter specified in advance, is more valid to determine the number of clusters and more efficient in terms of time cost.