On Clustering Validation Techniques

Authors:
Maria Halkidi;Yannis Batistakis;Michalis Vazirgiannis
Affiliations:
Department of Informatics, Athens University of Economics & Business, Patision 76, 10434, Athens, Greece (Hellas). mhalk@aueb.gr;Department of Informatics, Athens University of Economics & Business, Patision 76, 10434, Athens, Greece (Hellas). yannis@aueb.gr;Department of Informatics, Athens University of Economics & Business, Patision 76, 10434, Athens, Greece (Hellas). mvazirg@aueb.gr
Venue:
Journal of Intelligent Information Systems
Year:
2001

Citing 18
Cited 253

Unsupervised Optimal Fuzzy Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Validity Measure for Fuzzy Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Fuzzy C Quadratic Shell clustering algorithm and the detection of second-degree curves

Pattern Recognition Letters
Applied multivariate techniques

Applied multivariate techniques
Validating fuzzy partitions obtained through c-shells clustering

Pattern Recognition Letters - Special issue on fuzzy set technology in pattern recognition
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
CURE: an efficient clustering algorithm for large databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A new cluster validity index for the fuzzy c-mean

Pattern Recognition Letters
Data clustering: a review

ACM Computing Surveys (CSUR)
Data mining: concepts and techniques

Data mining: concepts and techniques
Machine Learning

Machine Learning
Data Mining Techniques: For Marketing, Sales, and Customer Support

Data Mining Techniques: For Marketing, Sales, and Customer Support
Incremental Clustering for Mining in a Data Warehousing Environment

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
STING: A Statistical Information Grid Approach to Spatial Data Mining

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
ROCK: A Robust Clustering Algorithm for Categorical Attributes

ICDE '99 Proceedings of the 15th International Conference on Data Engineering

Interactive methods for taxonomy editing and validation

Proceedings of the eleventh international conference on Information and knowledge management
VizCluster and its Application on Classifying Gene Expression Data

Distributed and Parallel Databases
On Data Clustering Analysis: Scalability, Constraints, and Validation

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Eureka!: A Tool for Interactive Knowledge Discovery

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Cluster validation techniques for genome expression data

Signal Processing - Special issue: Genomic signal processing
Clustering of streaming time series is meaningless

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Web Usage Mining as a Tool for Personalization: A Survey

User Modeling and User-Adapted Interaction
Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
THESUS: Organizing Web document collections based on link semantics

The VLDB Journal — The International Journal on Very Large Data Bases
Clustering time series from ARMA models with clipped data

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Cluster Analysis for Gene Expression Data: A Survey

IEEE Transactions on Knowledge and Data Engineering
A unified framework for image database clustering and content-based retrieval

Proceedings of the 2nd ACM international workshop on Multimedia databases
Clustering Time Series with Clipped Data

Machine Learning
A personalized search engine based on web-snippet hierarchical clustering

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Near-duplicate detection for eRulemaking

dg.o '05 Proceedings of the 2005 national conference on Digital government research
Effective and Efficient Distributed Model-Based Clustering

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Collaborative multi-strategy classification: application to per-pixel analysis of images

MDM '05 Proceedings of the 6th international workshop on Multimedia data mining: mining integrated media and complex data
Adherence clustering: an efficient method for mining market-basket clusters

Information Systems
MACLAW: A modular approach for clustering with local attribute weighting

Pattern Recognition Letters - Special issue: Evolutionary computer vision and image understanding
Towards mapping library and information science

Information Processing and Management: an International Journal - Special issue: Informetrics
Clustering analysis for data samples with multiple labels

DBA'06 Proceedings of the 24th IASTED international conference on Database and applications
Clustering quality measures for data samples with multiple labels

DBA'06 Proceedings of the 24th IASTED international conference on Database and applications
ST-DBSCAN: An algorithm for clustering spatial-temporal data

Data & Knowledge Engineering
Model-based evaluation of clustering validation measures

Pattern Recognition
An aggregated clustering approach using multi-ant colonies algorithms

Pattern Recognition
A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment

Pattern Recognition Letters
Machines in the conversation: detecting themes and trends in informal communication streams

IBM Systems Journal
A threshold criterion, auto-detection and its use in MST-based clustering

Intelligent Data Analysis
Validation and interpretation of Web users' sessions clusters

Information Processing and Management: an International Journal
Inference and evaluation of the multinomial mixture model for text clustering

Information Processing and Management: an International Journal
Role classification of hosts within enterprise networks based on connection patterns

ATEC '03 Proceedings of the annual conference on USENIX Annual Technical Conference
Estimating the concentration of optically active constituents of sea water by Takagi-Sugeno models with quadratic rule consequents

Pattern Recognition
Enhancing the Effectiveness of Clustering with Spectra Analysis

IEEE Transactions on Knowledge and Data Engineering
Graph-based sequence clustering through multiobjective evolutionary algorithms for web recommender systems

Proceedings of the 9th annual conference on Genetic and evolutionary computation
Collaborative multi-step mono-level multi-strategy classification

Multimedia Tools and Applications
Generating and Browsing Multiple Taxonomies Over a Document Collection

Journal of Management Information Systems
MMR: An algorithm for clustering categorical data using Rough Set Theory

Data & Knowledge Engineering
A cluster validity measure with a hybrid parameter search method for the support vector clustering algorithm

Pattern Recognition
Comparison between two coevolutionary feature weighting algorithms in clustering

Pattern Recognition
Extensions of vector quantization for incremental clustering

Pattern Recognition
Extracting Relevant Attribute Values for Improved Search

IEEE Internet Computing
Assessment of self-organizing map variants for clustering with application to redistribution of emotional speech patterns

Neurocomputing
Randomized metric induction and evolutionary conceptual clustering for semantic knowledge bases

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Clustering for unsupervised relation identification

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Merging distributed database summaries

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Clustering support for automated tracing

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
New modifications and applications of fuzzy C-means methodology

Computational Statistics & Data Analysis
A personalized search engine based on Web-snippet hierarchical clustering

Software—Practice & Experience
Enhanced P2P services providing multimedia content

Advances in Multimedia
A density-based cluster validity approach using multi-representatives

Pattern Recognition Letters
Cluster validity measurement for arbitrary shaped clusters

AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Cluster validity measurement techniques

AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Multi-objective clustering ensemble

International Journal of Hybrid Intelligent Systems - Hybridization of Intelligent Systems
An overview of clustering methods

Intelligent Data Analysis
Relation discovery from web data for competency management

Web Intelligence and Agent Systems
Incremental clustering of mixed data based on distance hierarchy

Expert Systems with Applications: An International Journal
Image clustering based on a shared nearest neighbors approach for tagged collections

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Discovering correlated spatio-temporal changes in evolving graphs

Knowledge and Information Systems
Building rules on top of ontologies for the semantic web with inductive logic programming

Theory and Practice of Logic Programming
Innovation in the cluster validating techniques

Fuzzy Optimization and Decision Making
On the Missing Link Between Frequent Pattern Discovery and Concept Formation

Inductive Logic Programming
A Bounded Index for Cluster Validity

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Multi-level Clustering in Sarcoidosis: A Preliminary Study

AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Web Usage Mining in Noisy and Ambiguous Environments: Exploring the Role of Concept Hierarchies, Compression, and Robust User Profiles

From Web to Social Web: Discovering and Deploying User and Content Profiles
Categorical Data Clustering Using the Combinations of Attribute Values

ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Evolutionary Clustering in Description Logics: Controlling Concept Formation and Drift in Ontologies

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Automatic image pixel clustering with an improved differential evolution

Applied Soft Computing
Identification of association rules between clusters

CSTST '08 Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
Identifying clusters of user behavior in intranet search engine log files

Journal of the American Society for Information Science and Technology
A Robust Methodology for Comparing Performances of Clustering Validity Criteria

SBIA '08 Proceedings of the 19th Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence
On the efficiency of evolutionary fuzzy clustering

Journal of Heuristics
Intrusion detection alarms reduction using root cause analysis and clustering

Computer Communications
BotMiner: clustering analysis of network traffic for protocol- and structure-independent botnet detection

SS'08 Proceedings of the 17th conference on Security symposium
A comprehensive validity index for clustering

Intelligent Data Analysis
A new method for hierarchical clustering combination

Intelligent Data Analysis
On comparing two sequences of numbers and its applications to clustering analysis

Information Sciences: an International Journal
Information Granulation: A Medical Case Study

Transactions on Rough Sets IX
Clustering of document collection - A weighting approach

Expert Systems with Applications: An International Journal
Hybrid intelligent vision-based car-like vehicle backing systems design

Expert Systems with Applications: An International Journal
Multidimensional cluster stability analysis from a Brazilian Bradyrhizobium sp. RFLP/PCR data set

Journal of Computational and Applied Mathematics
Arif Index for Predicting the Classification Accuracy of Features and Its Application in Heart Beat Classification Problem

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series

Neurocomputing
A survey of Web clustering engines

ACM Computing Surveys (CSUR)
Document analysis and visualization with zero-inflated poisson

Data Mining and Knowledge Discovery
A comparison of extrinsic clustering evaluation metrics based on formal constraints

Information Retrieval
Robust Division in Clustering of Streaming Time Series

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Metric-based stochastic conceptual clustering for ontologies

Information Systems
Metric-based stochastic conceptual clustering for ontologies

Information Systems
Hybrid clustering for validation and improvement of subject-classification schemes

Information Processing and Management: an International Journal
A novel measure for validating clustering results applied to road traffic

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
A novel measure for validating clustering results applied to road traffic

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
An Approach to Web-Scale Named-Entity Disambiguation

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A new clustering approach using similarity of patterns texture

Intelligent Data Analysis
Mixture-model cluster analysis using information theoretical criteria

Intelligent Data Analysis
Fuzzy Clustering for Categorical Spaces

ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Performance evaluation of density-based clustering methods

Information Sciences: an International Journal
Separation index and partial membership for clustering

Computational Statistics & Data Analysis
A novel HMM-based clustering algorithm for the analysis of gene expression time-course data

Computational Statistics & Data Analysis
A survey of evolutionary algorithms for clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Using graph partitioning to discover regions of correlated spatio-temporal change in evolving graphs

Intelligent Data Analysis
Collaborative clustering with background knowledge

Data & Knowledge Engineering
Framework for evaluating clustering algorithms in duplicate detection

Proceedings of the VLDB Endowment
A hybrid approach for supplier cluster analysis

Computers & Mathematics with Applications
Unsupervised Fuzzy Clustering for the Segmentation and Annotation of Upwelling Regions in Sea Surface Temperature Images

DS '09 Proceedings of the 12th International Conference on Discovery Science
Clustering of Retrieved Images by Integrating Perceptual Signal Features within Keyword-Based Image Search Engines

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Unsupervised classification of polarimetric SAR image with dynamic clustering: An image processing approach

Advances in Engineering Software
Adherence clustering: an efficient method for mining market-basket clusters

Information Systems
Feature-based cluster validation for high-dimensional data

AIA '08 Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications
Kernel-induced fuzzy clustering of image pixels with an improved differential evolution algorithm

Information Sciences: an International Journal
Data-Fusion in Clustering Microarray Data: Balancing Discovery and Interpretability

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Multiobjective genetic algorithm-based fuzzy clustering of categorical attributes

IEEE Transactions on Evolutionary Computation
A clustering validity assessment index

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
A new HMM-based ensemble generation method for numeral recognition

MCS'07 Proceedings of the 7th international conference on Multiple classifier systems
QC4: a clustering evaluation method

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Towards adaptive web mining: histograms and contexts in text data clustering

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Generalized external indexes for comparing data partitions with overlapping categories

Pattern Recognition Letters
Methods to bicluster validation and comparison in microarray data

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Semi-fuzzy splitting in online divisive-agglomerative clustering

EPIA'07 Proceedings of the aritficial intelligence 13th Portuguese conference on Progress in artificial intelligence
Weighted partition consensus via kernels

Pattern Recognition
Segmentation of upwelling regions in sea surface temperature images via unsupervised fuzzy clustering

IDEAL'09 Proceedings of the 10th international conference on Intelligent data engineering and automated learning
A K-means approach based on concept hierarchical tree for search results clustering

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Stochastic approximation driven particle swarm optimization

IIT'09 Proceedings of the 6th international conference on Innovations in information technology
Approximate clustering of time series using compact model-based descriptions

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Fuzzy Clustering for Semantic Knowledge Bases

Fundamenta Informaticae - Methodologies for Intelligent Systems
Exploiting tree structure of a web page for clustering

International Journal of Knowledge and Web Intelligence
Fractional particle swarm optimization in multidimensional search space

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
SEP/COP: An efficient method to find the best partition in hierarchical clustering based on a new cluster validity index

Pattern Recognition
Flocking based approach for data clustering

Natural Computing: an international journal
Impact of object extraction methods on classification performance in surface inspection systems

Machine Vision and Applications - Integrated Imaging and Vision Techniques for Industrial Inspection
F-statistics algorithm for gene clustering evaluation

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
Behavioral clustering of HTTP-based malware and signature generation using malicious network traces

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
On combining multiple clusterings: an overview and a new perspective

Applied Intelligence
Stability-based validation of bicluster solutions

Pattern Recognition
Determining the most proper number of cluster in fuzzy clustering by using artificial neural networks

Expert Systems with Applications: An International Journal
Finding irregularly shaped clusters based on entropy

ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Mining hot clusters of similar anomalies for system management

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Unifying content and context similarities of the textual and visual information in an image clustering framework

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
On consensus clustering validation

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Distributed antipole clustering for efficient data search and management in Euclidean and metric spaces

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Towards a standard methodology to evaluate internal cluster validity indices

Pattern Recognition Letters
Stochastic approximation driven particle swarm optimization with simultaneous perturbation -Who will guide the guide?

Applied Soft Computing
An improved rough clustering using discernibility based initial seed computation

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Subspace clustering for indexing high dimensional data: a main memory index based on local reductions and individual multi-representations

Proceedings of the 14th International Conference on Extending Database Technology
L2GClust: local-to-global clustering of stream sources

Proceedings of the 2011 ACM Symposium on Applied Computing
A methodology to find clusters in the data based on Shannon's entropy and genetic algorithms

ACELAE'11 Proceedings of the 10th WSEAS international conference on communications, electrical & computer engineering, and 9th WSEAS international conference on Applied electromagnetics, wireless and optical communications
A semantic approach to ETL technologies

Data & Knowledge Engineering
Automatic hierarchical clustering algorithm for remote sensing data

Pattern Recognition and Image Analysis
Enhancing grid-density based clustering for high dimensional data

Journal of Systems and Software
Population-based artificial immune system clustering algorithm

ICARIS'11 Proceedings of the 10th international conference on Artificial immune systems
The minimum code length for clustering using the gray code

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Clustering of multiple microarray experiments using information integration

ITBAM'11 Proceedings of the Second international conference on Information technology in bio- and medical informatics
A graph partitioning approach to SOM clustering

IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Grouping alternating schemata in semantic valence dictionary of polish verbs

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Unsupervised video surveillance

ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
A classification of cluster validity indexes based on membership degree and applications

WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part I
CarGene: Characterisation of sets of genes based on metabolic pathways analysis

International Journal of Data Mining and Bioinformatics
On measuring forgery quality in online signatures

Pattern Recognition
Model order selection for multiple cooperative swarms clustering using stability analysis

Information Sciences: an International Journal
MiniMax ε-stable cluster validity index for Type-2 fuzziness

Information Sciences: an International Journal
Comparative analysis of power consumption in university buildings using envSOM

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Effectivity of internal validation techniques for gene clustering

ISBMDA'06 Proceedings of the 7th international conference on Biological and Medical Data Analysis
A p2p architecture for multimedia content retrieval

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
DClusterE: A Framework for Evaluating and Understanding Document Clustering Using Visualization

ACM Transactions on Intelligent Systems and Technology (TIST)
Contextual maps for browsing huge document collections

ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Text data clustering by contextual graphs

DS'06 Proceedings of the 9th international conference on Discovery Science
Integration of ant colony SOM and k-means for clustering analysis

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
A model for the distribution design of distributed databases and an approach to solve large instances

IWDC'05 Proceedings of the 7th international conference on Distributed Computing
Distribution design in distributed databases using clustering to solve large instances

ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Modified adaptive resonance theory network for mixed data based on distance hierarchy

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part IV
Clustering and selecting suppliers based on simulated annealing algorithms

Computers & Mathematics with Applications
CPCQ: Contrast pattern based clustering quality index for categorical data

Pattern Recognition
Overcoming browser cookie churn with clustering

Proceedings of the fifth ACM international conference on Web search and data mining
Entropy on covers

Data Mining and Knowledge Discovery
A two-stage genetic algorithm for automatic clustering

Neurocomputing
NNCluster: an efficient clustering algorithm for road network trajectories

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Dynamic data clustering using stochastic approximation driven multi-dimensional particle swarm optimization

EvoApplicatons'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part I
User-driven fuzzy clustering: on the road to semantic classification

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part I
Parallel implementation of information retrieval clustering models

VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Model-Based cluster analysis for web users sessions

ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
Kernel k-means for categorical data

IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Genetic algorithms for feature weighting: evolution vs. coevolution and darwin vs. lamarck

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
A coevolutionary approach for clustering with feature weighting application to image analysis

EC'05 Proceedings of the 3rd European conference on Applications of Evolutionary Computing
Improving retrievability with improved cluster-based pseudo-relevance feedback selection

Expert Systems with Applications: An International Journal
HS-measure: a hybrid clustering validity measure to interpret road traffic data

Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools
Clustering of web sessions using levenshtein metric

ICDM'04 Proceedings of the 4th international conference on Advances in Data Mining: applications in Image Mining, Medicine and Biotechnology, Management and Environmental Control, and Telecommunications
Fuzzy distance based hierarchical clustering calculated using the a∗ algorithm

IWCIA'06 Proceedings of the 11th international conference on Combinatorial Image Analysis
Clustering similarity comparison using density profiles

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
A non-parametric method for data clustering with optimal variable weighting

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Segmentation of very high resolution remote sensing imagery of urban areas using particle swarm optimization algorithm

ICIAR'10 Proceedings of the 7th international conference on Image Analysis and Recognition - Volume Part I
Clustering of heterogeneously typed data with soft computing - a case study

MICAI'11 Proceedings of the 10th international conference on Artificial Intelligence: advances in Soft Computing - Volume Part II
A new grouping genetic algorithm for clustering problems

Expert Systems with Applications: An International Journal
A robust adaptive clustering analysis method for automatic identification of clusters

Pattern Recognition
Multi-dimensional representations of laparoscopic simulations for SANETs

EUROCAST'11 Proceedings of the 13th international conference on Computer Aided Systems Theory - Volume Part II
A BIRCH-Based clustering method for large time series databases

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
A new efficient and unbiased approach for clustering quality evaluation

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Classification of textual E-mail spam using data mining techniques

Applied Computational Intelligence and Soft Computing
Combining evaluation metrics via the unanimous improvement ratio and its application to clustering tasks

Journal of Artificial Intelligence Research
Automated computational delimitation of SST upwelling areas using fuzzy clustering

Computers & Geosciences
An efficient incremental method for generating equivalence groups of search results in information retrieval and queries

Knowledge-Based Systems
k-Means clustering of asymmetric data

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
GANC: Greedy agglomerative normalized cut for graph clustering

Pattern Recognition
A two-leveled symbiotic evolutionary algorithm for clustering problems

Applied Intelligence
Validating cluster structures in data mining tasks

Proceedings of the 2012 Joint EDBT/ICDT Workshops
Hypergraph based geometric biclustering algorithm

Pattern Recognition Letters
A framework for Multi-Agent Based Clustering

Autonomous Agents and Multi-Agent Systems
GAUR: a method to detect Sybil groups in peer-to-peer overlays

International Journal of Grid and Utility Computing
Hierarchical cluster algorithm for remote sensing data of earth

Pattern Recognition and Image Analysis
Automatic aspect discrimination in data clustering

Pattern Recognition
Generation of a clustering ensemble based on a gravitational self-organising map

Neurocomputing
An effective unsupervised network anomaly detection method

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Mining complex activities in the wild via a single smartphone accelerometer

Proceedings of the Sixth International Workshop on Knowledge Discovery from Sensor Data
Text categorization using an ensemble classifier based on a mean co-association matrix

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
ciForager: Incrementally discovering regions of correlated change in evolving graphs

ACM Transactions on Knowledge Discovery from Data (TKDD)
Design and implementation of an intelligent automatic question answering system based on data mining

ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part II
A constructive particle swarm algorithm for fuzzy clustering

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
An extensive comparative study of cluster validity indices

Pattern Recognition
Unsupervised translation sense clustering

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
A visual analytics framework for cluster analysis of DNA microarray data

Expert Systems with Applications: An International Journal
Towards hierarchical clustering

CSR'07 Proceedings of the Second international conference on Computer Science: theory and applications
Discretization in gene expression data analysis: a selected survey

Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
A novel soft set approach in selecting clustering attribute

Knowledge-Based Systems
Model-based clustering of high-dimensional data: Variable selection versus facet determination

International Journal of Approximate Reasoning
Optimal clustering in the context of overlapping cluster analysis

Information Sciences: an International Journal
Clustering criteria in multiobjective data clustering

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Center-Wise intra-inter silhouettes

SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
VAMO: towards a fully automated malware clustering validity analysis

Proceedings of the 28th Annual Computer Security Applications Conference
Iterative evolutionary subspace clustering

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
MicroClAn: Microarray clustering analysis

Journal of Parallel and Distributed Computing
An evolutionary computational model applied to cluster analysis of DNA microarray data

Expert Systems with Applications: An International Journal
GPU accelerated genetic clustering

SEAL'12 Proceedings of the 9th international conference on Simulated Evolution and Learning
How people describe their place: identifying predominant types of place descriptions

Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information
Evaluation of malware clustering based on its dynamic behaviour

AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Scalable fine-grained behavioral clustering of HTTP-based malware

Computer Networks: The International Journal of Computer and Telecommunications Networking
Structure inference for linked data sources using clustering

Proceedings of the Joint EDBT/ICDT 2013 Workshops
Ranking and selection of unsupervised learning marketing segmentation

Knowledge-Based Systems
A general evaluation measure for document organization tasks

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
On the combination of relative clustering validity criteria

Proceedings of the 25th International Conference on Scientific and Statistical Database Management
A semi-supervised feature selection method using a non-parametric technique with pairwise instance constraints

Journal of Information Science
Expert system for clustering prokaryotic species by their metabolic features

Expert Systems with Applications: An International Journal
Cluster ensemble selection based on relative validity indexes

Data Mining and Knowledge Discovery
Visualizing clusters in artificial neural networks using Morse theory

Advances in Artificial Neural Systems
How Many Clusters: A Validation Index for Arbitrary-Shaped Clusters

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Is data clustering in adversarial settings secure?

Proceedings of the 2013 ACM workshop on Artificial intelligence and security
OClustR: A new graph-based algorithm for overlapping clustering

Neurocomputing
Clustering Household Electricity Use Profiles

Proceedings of Workshop on Machine Learning for Sensory Data Analysis
Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts

Proceedings of the 7th ACM international conference on Web search and data mining
Cluster methods for assessing research performance: exploring Spanish computer science

Scientometrics
A fuzzy support vector machine algorithm for classification based on a novel PIM fuzzy clustering method

Neurocomputing
Evolutionary k-means for distributed data sets

Neurocomputing
A proposed IPC-based clustering method for exploiting expert knowledge and its application to strategic planning

Journal of Information Science
Asymmetric clustering using the alpha-beta divergence

Pattern Recognition
A comparison of clustering quality indices using outliers and noise

Intelligent Data Analysis
Feature selection for k-means clustering stability: theoretical analysis and an algorithm

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.01

Visualization

Abstract

Cluster analysis aims at identifying groups of similar objects and, therefore helps to discover distribution of patterns and interesting correlations in large data sets. It has been subject of wide research since it arises in many application domains in engineering, business and social sciences. Especially, in the last years the availability of huge transactional and experimental data sets and the arising requirements for data mining created needs for clustering algorithms that scale and can be applied in diverse domains.This paper introduces the fundamental concepts of clustering while it surveys the widely known clustering algorithms in a comparative way. Moreover, it addresses an important issue of clustering process regarding the quality assessment of the clustering results. This is also related to the inherent features of the data set under concern. A review of clustering validity measures and approaches available in the literature is presented. Furthermore, the paper illustrates the issues that are under-addressed by the recent algorithms and gives the trends in clustering process.