Rough Set Based Clustering Using Active Learning Approach

Authors:
Rekha Kandwal;Prerna Mahajan;Ritu Vijay
Affiliations:
Ministry of Earth Sciences and Science and Technology, India;Guru Gobind Singh Indraprastha University, India;Bansthali University, India
Venue:
International Journal of Artificial Life Research
Year:
2011

Citing 7
Cited 0

Clustering transactions using large items

Proceedings of the eighth international conference on Information and knowledge management
Data clustering: a review

ACM Computing Surveys (CSUR)
Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values

Data Mining and Knowledge Discovery
Clustering Categorical Data: An Approach Based on Dynamical Systems

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Feature Selection for Unsupervised Learning

The Journal of Machine Learning Research
Study of a Cluster Algorithm Based on Rough Sets Theory

ISDA '06 Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications - Volume 01
Rough–Fuzzy Collaborative Clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper revisits the problem of active learning and decision making when the cost of labeling incurs cost and unlabeled data is available in abundance. In many real world applications large amounts of data are available but the cost of correctly labeling it prohibits its use. In such cases, active learning can be employed. In this paper the authors propose rough set based clustering using active learning approach. The authors extend the basic notion of Hamming distance to propose a dissimilarity measure which helps in finding the approximations of clusters in the given data set. The underlying theoretical background for this decision is rough set theory. The authors have investigated our algorithm on the benchmark data sets from UCI machine learning repository which have shown promising results.