On relational data versions of c-means algorithms
Pattern Recognition Letters - Special issue on fuzzy set technology in pattern recognition
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Pattern Recognition with Fuzzy Objective Function Algorithms
Pattern Recognition with Fuzzy Objective Function Algorithms
Categorizing Visitors Dynamically by Fast and Robust Clustering of Access Logs
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
A Generalization-Based Approach to Clustering of Web Usage Sessions
WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Clustering Data Streams: Theory and Practice
IEEE Transactions on Knowledge and Data Engineering
ACM SIGMOD Record
Approximate clustering in very large relational data: Research Articles
International Journal of Intelligent Systems
Density-based clustering for real-time stream data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for clustering evolving data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for projected clustering of high dimensional data streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Top 10 algorithms in data mining
Knowledge and Information Systems
Adapting the right measures for K-means clustering
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Robust partitional clustering by outlier and density insensitive seeding
Pattern Recognition Letters
C-DenStream: Using Domain Knowledge on a Data Stream
DS '09 Proceedings of the 12th International Conference on Discovery Science
Partially supervised clustering for image segmentation
Pattern Recognition
Scalable Fuzzy Algorithms for Data Management and Analysis: Methods and Design
Scalable Fuzzy Algorithms for Data Management and Analysis: Methods and Design
ICDMW '10 Proceedings of the 2010 IEEE International Conference on Data Mining Workshops
An effective evaluation measure for clustering on evolving data streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Algorithm for High Dimensional Data Stream over Sliding Windows
TRUSTCOM '11 Proceedings of the 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications
The ClusTree: indexing micro-clusters for anytime stream mining
Knowledge and Information Systems
Fuzzy clustering with partial supervision
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Low-complexity fuzzy relational clustering algorithms for Web mining
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems
Algorithms of fuzzy clustering with partial supervision
Pattern Recognition Letters
Fuzzily Connected Multimodel Systems Evolving Autonomously From Data Streams
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hi-index | 0.01 |
This paper describes two new online fuzzy clustering algorithms based on medoids. These algorithms have been developed to deal with either very large datasets that do not fit in main memory or data streams in which data are produced continuously. The innovative aspect of our approach is the combination of fuzzy methods, which are well adapted to outliers and overlapping clusters, with medoids and the introduction of a decay mechanism to adapt more effectively to changes over time in the data streams. The use of medoids instead of means allows to deal with non-numerical data (e.g. sequences...) and improves the interpretability of the cluster centers. Experiments conducted on artificial and real datasets show that our new algorithms are competitive with state-of-the-art clustering algorithms in terms of purity of the partition, F1 score and computation times. Finally, experiments conducted on artificial data streams show the benefit of our decay mechanism in the case of evolving distributions.