Online fuzzy medoid based clustering algorithms

Authors:
Nicolas Labroche
Affiliations:
-
Venue:
Neurocomputing
Year:
2014

Citing 28
Cited 0

On relational data versions of c-means algorithms

Pattern Recognition Letters - Special issue on fuzzy set technology in pattern recognition
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Pattern Recognition with Fuzzy Objective Function Algorithms

Pattern Recognition with Fuzzy Objective Function Algorithms
Categorizing Visitors Dynamically by Fast and Robust Clustering of Access Logs

WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
A Generalization-Based Approach to Clustering of Web Usage Sessions

WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Clustering Data Streams: Theory and Practice

IEEE Transactions on Knowledge and Data Engineering
Mining data streams: a review

ACM SIGMOD Record
Approximate clustering in very large relational data: Research Articles

International Journal of Intelligent Systems
Density-based clustering for real-time stream data

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for clustering evolving data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for projected clustering of high dimensional data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Top 10 algorithms in data mining

Knowledge and Information Systems
Adapting the right measures for K-means clustering

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Robust partitional clustering by outlier and density insensitive seeding

Pattern Recognition Letters
C-DenStream: Using Domain Knowledge on a Data Stream

DS '09 Proceedings of the 12th International Conference on Discovery Science
Sequential Adaptive Fuzzy Inference System (SAFIS) for nonlinear system identification and prediction

Fuzzy Sets and Systems
Partially supervised clustering for image segmentation

Pattern Recognition
Intelligent Choice of the Number of Clusters in K-Means Clustering: An Experimental Study with Different Cluster Spreads

Journal of Classification
Scalable Fuzzy Algorithms for Data Management and Analysis: Methods and Design

Scalable Fuzzy Algorithms for Data Management and Analysis: Methods and Design
Clustering Performance on Evolving Data Streams: Assessing Algorithms and Evaluation Measures within MOA

ICDMW '10 Proceedings of the 2010 IEEE International Conference on Data Mining Workshops
An effective evaluation measure for clustering on evolving data streams

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Algorithm for High Dimensional Data Stream over Sliding Windows

TRUSTCOM '11 Proceedings of the 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications
The ClusTree: indexing micro-clusters for anytime stream mining

Knowledge and Information Systems
Fuzzy clustering with partial supervision

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Low-complexity fuzzy relational clustering algorithms for Web mining

IEEE Transactions on Fuzzy Systems
DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction

IEEE Transactions on Fuzzy Systems
Algorithms of fuzzy clustering with partial supervision

Pattern Recognition Letters
Fuzzily Connected Multimodel Systems Evolving Autonomously From Data Streams

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper describes two new online fuzzy clustering algorithms based on medoids. These algorithms have been developed to deal with either very large datasets that do not fit in main memory or data streams in which data are produced continuously. The innovative aspect of our approach is the combination of fuzzy methods, which are well adapted to outliers and overlapping clusters, with medoids and the introduction of a decay mechanism to adapt more effectively to changes over time in the data streams. The use of medoids instead of means allows to deal with non-numerical data (e.g. sequences...) and improves the interpretability of the cluster centers. Experiments conducted on artificial and real datasets show that our new algorithms are competitive with state-of-the-art clustering algorithms in terms of purity of the partition, F1 score and computation times. Finally, experiments conducted on artificial data streams show the benefit of our decay mechanism in the case of evolving distributions.