Random sampling with a reservoir
ACM Transactions on Mathematical Software (TOMS)
Algorithms for clustering data
Algorithms for clustering data
The design and analysis of spatial data structures
The design and analysis of spatial data structures
Introduction to algorithms
The R*-tree: an efficient and robust access method for points and rectangles
SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Randomized algorithms
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
The R+-Tree: A Dynamic Index for Multi-Dimensional Objects
VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Parallel Algorithms for Hierarchical Clustering
Parallel Algorithms for Hierarchical Clustering
A framework for measuring changes in data characteristics
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
OPTICS: ordering points to identify the clustering structure
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Fast algorithms for projected clustering
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Multi-dimensional selectivity estimation using compressed histogram information
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Scalable algorithms for mining large databases
KDD '99 Tutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Entropy-based subspace clustering for mining numerical data
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluating a class of distance-mapping algorithms for data mining and clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Hierarchical parallel coordinates for exploration of large datasets
VIS '99 Proceedings of the conference on Visualization '99: celebrating ten years
Data mining and the Web: past, present and future
Proceedings of the 2nd international workshop on Web information and data management
A multiple-resolution method for edge-centric data clustering
Proceedings of the eighth international conference on Information and knowledge management
Data mining on an OLTP system (nearly) for free
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Finding generalized projected clusters in high dimensional spaces
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Density biased sampling: an improved method for data mining and clustering
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Efficient algorithms for mining outliers from large data sets
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Approximation algorithms for projective clustering
SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Clustering through decision tree construction
Proceedings of the ninth international conference on Information and knowledge management
High performance clustering based on the similarity join
Proceedings of the ninth international conference on Information and knowledge management
Information retrieval on the web
ACM Computing Surveys (CSUR)
Scalability for clustering algorithms revisited
ACM SIGKDD Explorations Newsletter
H-BLOB: a hierarchical visual clustering method using implicit surfaces
Proceedings of the conference on Visualization '00
Outlier detection for high dimensional data
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Epsilon grid order: an algorithm for the similarity join on massive high-dimensional data
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Scalable data mining with model constraints
ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
Efficient discovery of error-tolerant frequent itemsets in high dimensions
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Mining top-n local outliers in large databases
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Finding similar images quicky using object shapes
Proceedings of the tenth international conference on Information and knowledge management
Genetic subtyping using cluster analysis
ACM SIGKDD Explorations Newsletter
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
A Monte Carlo algorithm for fast projective clustering
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
An evaluation of sampling methods for data mining with fuzzy C-means
Data mining for design and manufacturing
Why so many clustering algorithms: a position paper
ACM SIGKDD Explorations Newsletter
An efficient and effective algorithm for density biased sampling
Proceedings of the eleventh international conference on Information and knowledge management
Evaluation of hierarchical clustering algorithms for document datasets
Proceedings of the eleventh international conference on Information and knowledge management
COOLCAT: an entropy-based algorithm for categorical clustering
Proceedings of the eleventh international conference on Information and knowledge management
FREM: fast and robust EM clustering for large data sets
Proceedings of the eleventh international conference on Information and knowledge management
Hyper-rectangle based segmentation and clustering of large video data sets
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Intelligent multimedia computing and networking
Clustering validity checking methods: part II
ACM SIGMOD Record
Squeezer: an efficient algorithm for clustering categorical data
Journal of Computer Science and Technology
On Clustering Validation Techniques
Journal of Intelligent Information Systems
A Decision Criterion for the Optimal Number of Clusters in Hierarchical Clustering
Journal of Global Optimization
Computer
Querying Time Series Data Based on Similarity
IEEE Transactions on Knowledge and Data Engineering
DEMON: Mining and Monitoring Evolving Data
IEEE Transactions on Knowledge and Data Engineering
Finding Interesting Associations without Support Pruning
IEEE Transactions on Knowledge and Data Engineering
Finding Localized Associations in Market Basket Data
IEEE Transactions on Knowledge and Data Engineering
Redefining Clustering for High-Dimensional Applications
IEEE Transactions on Knowledge and Data Engineering
CLARANS: A Method for Clustering Objects for Spatial Data Mining
IEEE Transactions on Knowledge and Data Engineering
An evolutionary technique based on K-means algorithm for optimal clustering in RN
Information Sciences—Applications: An International Journal
Fast hierarchical clustering and its validation
Data & Knowledge Engineering
Efficiently Determining the Starting Sample Size for Progressive Sampling
EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Scalable Model for Extensional and Intensional Descriptions of Unclassified Data
IPDPS '00 Proceedings of the 15 IPDPS 2000 Workshops on Parallel and Distributed Processing
Quality Scheme Assessment in the Clustering Process
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
A Data Set Oriented Approach for Clustering Algorithm Selection
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
A Study on the Hierarchical Data Clustering Algorithm Based on Gravity Theory
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Multiscale Comparison of Temporal Patternsin Time-Series Medical Databases
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Semantic Compression and Pattern Extraction with Fascicles
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Indexing the Distance: An Efficient Method to KNN Processing
Proceedings of the 27th International Conference on Very Large Data Bases
C2P: Clustering based on Closest Pairs
Proceedings of the 27th International Conference on Very Large Data Bases
AUTOCLUST+: Automatic Clustering of Point-Data Sets in the Presence of Obstacles
TSDM '00 Proceedings of the First International Workshop on Temporal, Spatial, and Spatio-Temporal Data Mining-Revised Papers
Decontamination of Training Samples for Supervised Pattern Recognition Methods
Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition
Revisiting R-Tree Construction Principles
ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
Pattern-Oriented Hierachical Clustering
ADBIS '99 Proceedings of the Third East European Conference on Advances in Databases and Information Systems
Approximate k -Closest-Pairs with Space Filling Curves
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
CoFD: An Algorithm for Non-distance Based Clustering in High Dimensional Spaces
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
Self-Tuning Clustering: An Adaptive Clustering Method for Transaction Data
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
Fully Dynamic Clustering of Metric Data Sets
BNCOD 19 Proceedings of the 19th British National Conference on Databases: Advances in Databases
A Fast Algorithm for Density-Based Clustering in Large Database
PAKDD '99 Proceedings of the Third Pacific-Asia Conference on Methodologies for Knowledge Discovery and Data Mining
A Hybrid Approach to Clustering in Very Large Databases
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Efficient Hierarchical Clustering Algorithms Using Partially Overlapping Partitions
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Scalable Hierarchical Clustering Method for Sequences of Categorical Values
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Enhancing Effectiveness of Outlier Detections for Low Density Patterns
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
On Data Clustering Analysis: Scalability, Constraints, and Validation
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Efficiently Mining Gene Expression Data via Integrated Clustering and Validation Techniques
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
SNN: A Supervised Clustering Algorithm
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Building an Information and Knowledge Fusion System
Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
DROLAP - A Dense-Region Based Approach to On-Line Analytical Processing
DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Declustering Spatial Objects by Clustering for Parallel Disks
DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
Efficient Automated Mining of Fuzzy Association Rules
DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
RecTree: An Efficient Collaborative Filtering Method
DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
Collective, Hierarchical Clustering from Distributed, Heterogeneous Data
Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Reengineering legacy systems for distributed environments
Journal of Systems and Software
A survey on wavelet applications in data mining
ACM SIGKDD Explorations Newsletter
Maintaining variance and k-medians over data stream windows
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
A robust and efficient clustering algorithm based on cohesion self-merging
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Data Streams: Theory and Practice
IEEE Transactions on Knowledge and Data Engineering
P-AutoClass: Scalable Parallel Clustering for Mining Large Data Sets
IEEE Transactions on Knowledge and Data Engineering
Approximation algorithms for projective clustering
Journal of Algorithms
A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Efficiently Detecting Arbitrary Shaped Clusters in Image Databases
ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Interactive Data Analysis on Numeric-Data
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
Clustering in very large databases based on distance and density
Journal of Computer Science and Technology
PHC: a fast partition and hierarchy-based clustering algorithm
Journal of Computer Science and Technology
Clustering binary data streams with K-means
DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
A New Cluster Isolation Criterion Based on Dissimilarity Increments
IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Biased Sampling for Approximate Clustering and Outlier Detection in Large Data Sets
IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering
Graph-based hierarchical conceptual clustering
The Journal of Machine Learning Research
Self-Organizing-Map Based Clustering Using a Local Clustering Validity Index
Neural Processing Letters
Validating and Refining Clusters via Visual Rendering
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Cluster rendering of skewed datasets via visualization
Proceedings of the 2003 ACM symposium on Applied computing
Classifying large data sets using SVMs with hierarchical clusters
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A Monotonic On-Line Linear Algorithm for Hierarchical Agglomerative Classification
Information Technology and Management
Efficient data mining for calling path patterns in GSM networks
Information Systems
GraphZip: a fast and automatic compression method for spatial data clustering
Proceedings of the 2004 ACM symposium on Applied computing
A Human-Computer Interactive Method for Projected Clustering
IEEE Transactions on Knowledge and Data Engineering
Statistical grid-based clustering over data streams
ACM SIGMOD Record
A novel genetic algorithm for automatic clustering
Pattern Recognition Letters
ItCompress: An Iterative Semantic Compression Algorithm
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
LDC: Enabling Search By Partial Distance In A Hyper-Dimensional Space
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Leaders-subleaders: an efficient hierarchical clustering algorithm for large data sets
Pattern Recognition Letters
Hypergraph Models and Algorithms for Data-Pattern-Based Clustering
Data Mining and Knowledge Discovery
Space-efficient cubes for OLAP range-sum queries
Decision Support Systems
Clustering objects on a spatial network
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
MAIDS: mining alarming incidents from data streams
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
A k-Median Algorithm with Running Time Independent of Data Size
Machine Learning
Document clustering via adaptive subspace iteration
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient Disk-Based K-Means Clustering for Relational Databases
IEEE Transactions on Knowledge and Data Engineering
Diagonal Ordering: a new approach to high-dimensional KNN processing
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Fully automatic cross-associations
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Fast mining of spatial collocations
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A top-down approach for density-based clustering using multidimensional indexes
Journal of Systems and Software - Special issue: Performance modeling and analysis of computer systems and networks
HARP: A Practical Projected Clustering Algorithm
IEEE Transactions on Knowledge and Data Engineering
Compression schemes for differential categorical stream clustering
Proceedings of the thirteenth ACM international conference on Information and knowledge management
ClusterMap: labeling clusters in large datasets via visualization
Proceedings of the thirteenth ACM international conference on Information and knowledge management
IEEE Transactions on Knowledge and Data Engineering
Subspace clustering for high dimensional categorical data
ACM SIGKDD Explorations Newsletter
Clustering in Dynamic Spatial Databases
Journal of Intelligent Information Systems
Antipole Tree Indexing to Support Range Search and K-Nearest Neighbor Search in Metric Spaces
IEEE Transactions on Knowledge and Data Engineering
Elastic Translation Invariant Matching of Trajectories
Machine Learning
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An effective and efficient algorithm for high-dimensional outlier detection
The VLDB Journal — The International Journal on Very Large Data Bases
Combining Multiple Clusterings Using Evidence Accumulation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Hierarchical Clustering Algorithms for Document Datasets
Data Mining and Knowledge Discovery
A novel grammar-based genetic programming approach to clustering
Proceedings of the 2005 ACM symposium on Applied computing
Factor matrix text filtering and clustering: Research Articles
Journal of the American Society for Information Science and Technology
iDistance: An adaptive B+-tree based indexing method for nearest neighbor search
ACM Transactions on Database Systems (TODS)
Automatic Subspace Clustering of High Dimensional Data
Data Mining and Knowledge Discovery
GCHL: A grid-clustering algorithm for high-dimensional very large spatial data bases
Pattern Recognition Letters
VISTA: validating and refining clusters via visualization
Information Visualization
A Shrinking-Based Clustering Approach for Multidimensional Data
IEEE Transactions on Knowledge and Data Engineering
A framework for mining topological patterns in spatio-temporal databases
Proceedings of the 14th ACM international conference on Information and knowledge management
Efficiently Mining Gene Expression Data via a Novel Parameterless Clustering Method
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Making SVMs Scalable to Large Data Sets using Hierarchical Cluster Indexing
Data Mining and Knowledge Discovery
Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Parameter-Free Spatial Data Mining Using MDL
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
CLUMP: A Scalable and Robust Framework for Structure Discovery
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Enhancing Data Analysis with Noise Removal
IEEE Transactions on Knowledge and Data Engineering
Unsupervised clustering on dynamic databases
Pattern Recognition Letters
A parallel hybrid web document clustering algorithm and its performance study
The Journal of Supercomputing - Special issue: Parallel and distributed processing and applications
A data mining course for computer science: primary sources and implementations
Proceedings of the 37th SIGCSE technical symposium on Computer science education
Integrating XML data sources using approximate joins
ACM Transactions on Database Systems (TODS)
Hypothesis oriented cluster analysis in data mining by visualization
Proceedings of the working conference on Advanced visual interfaces
Analyzing user's behavior on a video database
MDM '05 Proceedings of the 6th international workshop on Multimedia data mining: mining integrated media and complex data
QROCK: A quick version of the ROCK algorithm for clustering of categorical data
Pattern Recognition Letters
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
MPM: a hierarchical clustering algorithm using matrix partitioning method for non-numeric data
Journal of Intelligent Information Systems
Indexed-based density biased sampling for clustering applications
Data & Knowledge Engineering
PENS: an algorithm for density-based clustering in peer-to-peer systems
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
iVIBRATE: Interactive visualization-based framework for clustering large datasets
ACM Transactions on Information Systems (TOIS)
Detecting outliers using transduction and statistical testing
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Robust information-theoretic clustering
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
K-means clustering versus validation measures: a data distribution perspective
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A scaleable document clustering approach for large document corpora
Information Processing and Management: an International Journal
Finding centric local outliers in categorical/numerical spaces
Knowledge and Information Systems
LinkClus: efficient clustering via heterogeneous semantic links
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Effective document clustering for large heterogeneous law firm collections
ICAIL '05 Proceedings of the 10th international conference on Artificial intelligence and law
ST-DBSCAN: An algorithm for clustering spatial-temporal data
Data & Knowledge Engineering
Gradual model generator for single-pass clustering
Pattern Recognition
Fast similarity join for multi-dimensional data
Information Systems
A clustering algorithm based on maximal θ-distant subtrees
Pattern Recognition
Novel multi-centroid, multi-run sampling schemes for $K$-medoids-based algorithms
International Journal of Knowledge-based and Intelligent Engineering Systems
pPOP: Fast yet accurate parallel hierarchical clustering using partitioning
Data & Knowledge Engineering
Classification of large data sets with mixture models via sufficient EM
Computational Statistics & Data Analysis
Towards higher disk head utilization: extracting free bandwidth from busy disk drives
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Exploiting parallelism to support scalable hierarchical clustering
Journal of the American Society for Information Science and Technology
Quality-Aware Sampling and Its Applications in Incremental Data Mining
IEEE Transactions on Knowledge and Data Engineering
Clustering methodologies for identifying country core competencies
Journal of Information Science
A new data clustering approach: Generalized cellular automata
Information Systems
A k-mean clustering algorithm for mixed numeric and categorical data
Data & Knowledge Engineering
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A Sketch Algorithm for Estimating Two-Way and Multi-Way Associations
Computational Linguistics
Algorithms for clustering high dimensional and distributed data
Intelligent Data Analysis
Evolutionary model selection in unsupervised learning
Intelligent Data Analysis
Mining association rules using clustering
Intelligent Data Analysis
RIC: Parameter-free noise-robust clustering
ACM Transactions on Knowledge Discovery from Data (TKDD)
Top-Down Parameter-Free Clustering of High-Dimensional Categorical Data
IEEE Transactions on Knowledge and Data Engineering
A framework for clustering evolving data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A shrinking-based approach for multi-dimensional data analysis
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for projected clustering of high dimensional data streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Proceedings of the ACM first Ph.D. workshop in CIKM
LEGClust—A Clustering Algorithm Based on Layered Entropic Subgraphs
IEEE Transactions on Pattern Analysis and Machine Intelligence
Nugget discovery in visual exploration environments by query consolidation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Grid-based subspace clustering over data streams
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Cluster By: a new sql extension for spatial data aggregation
Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
Accelerating k-medoid-based algorithms through metric access methods
Journal of Systems and Software
ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
A density-based cluster validity approach using multi-representatives
Pattern Recognition Letters
Clustering techniques utilized in web usage mining
AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Cluster validity measurement for arbitrary shaped clusters
AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Cluster validity measurement techniques
AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
A scalable sampling scheme for clustering in network traffic analysis
Proceedings of the 2nd international conference on Scalable information systems
Fast adaptive clustering for very large datasets
ICCOMP'05 Proceedings of the 9th WSEAS International Conference on Computers
Effective clustering and boundary detection algorithm based on Delaunay triangulation
Pattern Recognition Letters
A general grid-clustering approach
Pattern Recognition Letters
Mining multiple-level fuzzy blocks from multidimensional data
Fuzzy Sets and Systems
Approximation algorithms for clustering uncertain data
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Enhanced correlation search technique for clustering cancer gene expression data
SSIP'06 Proceedings of the 6th WSEAS International Conference on Signal, Speech and Image Processing
Data utility and privacy protection trade-off in k-anonymisation
PAIS '08 Proceedings of the 2008 international workshop on Privacy and anonymity in information society
Website browsing aid: A navigation graph-based recommendation system
Decision Support Systems
Tree-based partition querying: a methodology for computing medoids in large spatial datasets
The VLDB Journal — The International Journal on Very Large Data Bases
SS-ClusterTree: a subspace clustering based indexing algorithm over high-dimensional image features
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Mining typical patterns from databases
Information Sciences: an International Journal
Summarizing spatial data streams using ClusterHulls
Journal of Experimental Algorithmics (JEA)
Improved search strategies and extensions to k-medoids-based clustering algorithms
International Journal of Business Intelligence and Data Mining
The 3DVDM Approach: A Case Study with Clickstream Data
Visual Data Mining
An Efficient Algorithm for Clustering Search Engine Results
Computational Intelligence and Security
Clustering Streaming Time Series Using CBC
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A Clustering Algorithm Based on Adaptive Subcluster Merging
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Varying Density Spatial Clustering Based on a Hierarchical Tree
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
A Novel Spatial Clustering Algorithm with Sampling
MDAI '07 Proceedings of the 4th international conference on Modeling Decisions for Artificial Intelligence
A Visual and Interactive Data Exploration Method for Large Data Sets and Clustering
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Patch Relational Neural Gas --- Clustering of Huge Dissimilarity Datasets
ANNPR '08 Proceedings of the 3rd IAPR workshop on Artificial Neural Networks in Pattern Recognition
Approximate Clustering of Noisy Biomedical Data
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Finding Arbitrary Shaped Clusters for Character Recognition
ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
Mining Top-n Local Outliers in Constrained Spatial Networks
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Hierarchical, Parameter-Free Community Discovery
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Clustering Distributed Sensor Data Streams
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Finding groups in data: Cluster analysis with ants
Applied Soft Computing
DIVFRP: An automatic divisive hierarchical clustering method based on the furthest reference points
Pattern Recognition Letters
Accurate localization of low-level radioactive source under noise and measurement errors
Proceedings of the 6th ACM conference on Embedded network sensor systems
Feature-preserved sampling over streaming data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Determining the best K for clustering transactional datasets: A coverage density-based approach
Data & Knowledge Engineering
Development of a mechanism for ontology-based product lifecycle knowledge integration
Expert Systems with Applications: An International Journal
Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets
KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
NPClu: An approach for clustering spatially extended objects
Intelligent Data Analysis
Multifractal-based cluster hierarchy optimisation algorithm
International Journal of Business Intelligence and Data Mining
A multi-prototype clustering algorithm
Pattern Recognition
A search space reduction methodology for data mining in large databases
Engineering Applications of Artificial Intelligence
Efficiently tracing clusters over high-dimensional on-line data streams
Data & Knowledge Engineering
Patch clustering for massive data sets
Neurocomputing
Nonlinear Data Analysis Using a New Hybrid Data Clustering Algorithm
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
PFHC: A clustering algorithm based on data partitioning for unevenly distributed datasets
Fuzzy Sets and Systems
Effective spatial clustering methods for optimal facility establishment
Intelligent Data Analysis
Novelty detection with application to data streams
Intelligent Data Analysis - Knowledge Discovery from Data Streams
Top-k typicality queries and efficient query answering methods on large databases
The VLDB Journal — The International Journal on Very Large Data Bases
Preface: an overview on learning from data streams
New Generation Computing
Median Topographic Maps for Biomedical Data Sets
Similarity-Based Clustering
Two Stage Knowledge Discovery for Spatio-temporal Radio-emission Data
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
An adaptive flocking algorithm for performing approximate clustering
Information Sciences: an International Journal
NPUST: An Efficient Clustering Algorithm Using Partition Space Technique for Large Databases
IEA/AIE '09 Proceedings of the 22nd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: Next-Generation Applied Intelligence
GF-DBSCAN: a new efficient and effective data clustering technique for large databases
MUSP'09 Proceedings of the 9th WSEAS international conference on Multimedia systems & signal processing
C-DBSCAN: Density-Based Clustering with Constraints
RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Journal of Data and Information Quality (JDIQ)
NIVA: a Robust cluster validity
ICCOM'08 Proceedings of the 12th WSEAS international conference on Communications
Proceedings of the 46th Annual Southeast Regional Conference on XX
Proceedings of the 46th Annual Southeast Regional Conference on XX
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
GeoSOM Suite: A Tool for Spatial Clustering
ICCSA '09 Proceedings of the International Conference on Computational Science and Its Applications: Part I
CSBIterKmeans: A New Clustering Algorithm Based on Quantitative Assessment of the Clustering Quality
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Cancer class prediction: Two stage clustering approach to identify informative genes
Intelligent Data Analysis
Computation of initial modes for K-modes clustering algorithm using evidence accumulation
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Clustering heterogeneous data using clustering by compression
ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
Clustering in the membership embedding space
International Journal of Knowledge Engineering and Soft Data Paradigms
Rough-DBSCAN: A fast hybrid density based clustering method for large data sets
Pattern Recognition Letters
Extending fuzzy and probabilistic clustering to very large data sets
Computational Statistics & Data Analysis
Indexing 3-D human motion repositories for content-based retrieval
IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
On-line discovery of flock patterns in spatio-temporal data
Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Robustness of density-based clustering methods with various neighborhood relations
Fuzzy Sets and Systems
K-means clustering versus validation measures: a data-distribution perspective
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
SPARCL: an effective and efficient algorithm for mining arbitrary shape-based clusters
Knowledge and Information Systems
EIDBSCAN: An Extended Improving DBSCAN algorithm with sampling techniques
International Journal of Business Intelligence and Data Mining
Using trees to depict a forest
Proceedings of the VLDB Endowment
Modeling and querying possible repairs in duplicate detection
Proceedings of the VLDB Endowment
Intelligent Data Granulation on Load: Improving Infobright's Knowledge Grid
FGIT '09 Proceedings of the 1st International Conference on Future Generation Information Technology
Improved Visual Clustering through Unsupervised Dimensionality Reduction
RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
An incremental clustering scheme for data de-duplication
Data Mining and Knowledge Discovery
Clustering large data sets based on data compression technique and weighted quality measures
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
eCCV: a new fuzzy cluster validity measure for large relational bioinformatics datasets
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
A new method for clustering heterogeneous data: clustering by compression
WSEAS Transactions on Computers
Active constrained clustering with multiple cluster representatives
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Reducing metadata complexity for faster table summarization
Proceedings of the 13th International Conference on Extending Database Technology
Clustering of time series data-a survey
Pattern Recognition
Communication-Efficient Privacy-Preserving Clustering
Transactions on Data Privacy
Anomaly intrusion detection by clustering transactional audit streams in a host computer
Information Sciences: an International Journal
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
Connection network and optimization of interest metric for one-to-one marketing
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Mining comprehensible clustering rules with an evolutionary algorithm
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
An applicable hierarchical clustering algorithm for content-based image retrieval
MIRAGE'07 Proceedings of the 3rd international conference on Computer vision/computer graphics collaboration techniques
Coclustering based parcellation of human brain cortex using diffusion tensor MRI
ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
Data mining as an automated service
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
AGRID: an efficient algorithm for clustering large high-dimensional datasets
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Multi-level clustering and reasoning about its clusters using region connection calculus
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
HOT: hypergraph-based outlier test for categorical data
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
DBRS: a density-based spatial clustering method with random sampling
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Incremental clustering in geography and optimization spaces
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
A clustering algorithm based on mechanics
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
BRIM: an efficient boundary points detecting algorithm
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Applying data mining techniques to analyze alert data
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
A new greedy algorithm for improving b-coloring clustering
GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition
Speeding up clustering-based k-anonymisation algorithms with pre-partitioning
BNCOD'07 Proceedings of the 24th British national conference on Databases
Outlier detection with streaming dyadic decomposition
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
A search space reduction methodology for large databases: a case study
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Network thinking and network intelligence
WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
Clustering moving objects in spatial networks
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Continuous medoid queries over moving objects
SSTD'07 Proceedings of the 10th international conference on Advances in spatial and temporal databases
Contextual adaptive clustering of web and text documents with personalization
MCD'07 Proceedings of the 3rd ECML/PKDD international conference on Mining complex data
A creditable subspace labeling method based on D-S evidence theory
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Quantization-based clustering algorithm
Pattern Recognition
High-dimensional indexing: transformational approaches to high-dimensional range and similarity searches
Chinese web comments clustering analysis with a two-phase method
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Journal of Network and Computer Applications
Integrating induction and deduction for noisy data mining
Information Sciences: an International Journal
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Scalable Clustering for Mining Local-Correlated Clusters in High Dimensions and Large Datasets
Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
On cluster tree for nested and multi-density data clustering
Pattern Recognition
A fast divisive clustering algorithm using an improved discrete particle swarm optimizer
Pattern Recognition Letters
Enhancing principal direction divisive clustering
Pattern Recognition
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Integrated Computer-Aided Engineering
Expert Systems with Applications: An International Journal
Density-based semi-supervised clustering
Data Mining and Knowledge Discovery
A novel intrusion detection system based on hierarchical clustering and support vector machines
Expert Systems with Applications: An International Journal
Topographic mapping of large dissimilarity data sets
Neural Computation
Approximate pairwise clustering for large data sets via sampling plus extension
Pattern Recognition
Can shared-neighbor distances defeat the curse of dimensionality?
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Stratified reservoir sampling over heterogeneous data streams
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Collective taxonomizing: A collaborative approach to organizing document repositories
Decision Support Systems
A fuzzy trust evaluation method for knowledge sharing in virtual enterprises
Computers and Industrial Engineering
Toward improving re-coloring based clustering with graph b-coloring
PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Coclustering for cross-subject fiber tract analysis through diffusion tensor imaging
IEEE Transactions on Information Technology in Biomedicine - Special section on affective and pervasive computing for healthcare
Parallelization of a hierarchical data clustering algorithm using OpenMP
IWOMP'05/IWOMP'06 Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming
Proceedings of the 2010 conference on Artificial Intelligence Research and Development: Proceedings of the 13th International Conference of the Catalan Association for Artificial Intelligence
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Towards improving subspace data analysis
Proceedings of the 48th Annual Southeast Regional Conference
Inter-dimensional fuzzy clustering
Proceedings of the 48th Annual Southeast Regional Conference
Distance-based outlier detection: consolidation and renewed bearing
Proceedings of the VLDB Endowment
The construction of an individual credit risk assessment method: based on the combination algorithms
ICICA'10 Proceedings of the First international conference on Information computing and applications
Expert Systems with Applications: An International Journal
Clustering distributed sensor data streams using local processing and reduced communication
Intelligent Data Analysis - Ubiquitous Knowledge Discovery
Multi-source shared nearest neighbours for multi-modal image clustering
Multimedia Tools and Applications
The discovery of hierarchical cluster structures assisted by a visualization technique
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
A new-fangled FES-k-Means clustering algorithm for disease discovery and visual analytics
EURASIP Journal on Bioinformatics and Systems Biology
Spatial neighborhood clustering based on data field
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
A top-down approach for hierarchical cluster exploration by visualization
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
An improved KNN based outlier detection algorithm for large datasets
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Simulation of DNA damage clustering after proton irradiation using an adapted DBSCAN algorithm
Computer Methods and Programs in Biomedicine
A comparison of internal and external cluster validation indexes
AMERICAN-MATH'11/CEA'11 Proceedings of the 2011 American conference on applied mathematics and the 5th WSEAS international conference on Computer engineering and applications
Fast outlier detection for very large log data
Expert Systems with Applications: An International Journal
XML data clustering: An overview
ACM Computing Surveys (CSUR)
A survey on clustering in data mining
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Nemoz: a distributed framework for collaborative media organization
Ubiquitous knowledge discovery
Web document clustering based on web log mining
ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Nemoz: a distributed framework for collaborative media organization
Ubiquitous knowledge discovery
Minimum spanning tree based split-and-merge: A hierarchical clustering method
Information Sciences: an International Journal
Enhancing grid-density based clustering for high dimensional data
Journal of Systems and Software
Behavioural Proximity Discovery: an adaptive approach for root cause analysis
International Journal of Business Intelligence and Data Mining
A unique property of single-link distance and its application in data clustering
Data & Knowledge Engineering
Isolating top-k dense regions with filtration of sparse background
Pattern Recognition Letters
Spatial clustering to uncluttering map visualization in SOLAP
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part I
SpectralCAT: Categorical spectral clustering of numerical and nominal data
Pattern Recognition
Partitioning hard clustering algorithms based on multiple dissimilarity matrices
Pattern Recognition
CloudVista: visual cluster exploration for extreme scale data in the cloud
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
A graph model for mutual information based clustering
Journal of Intelligent Information Systems
Effective monitoring by efficient fingerprint matching using a forest of NAQ-trees
Journal of Intelligent Information Systems
A survey: hybrid evolutionary algorithms for cluster analysis
Artificial Intelligence Review
EDA-USL: unsupervised clustering algorithm based on estimation of distribution algorithm
International Journal of Wireless and Mobile Computing
Anomaly intrusion detection based on clustering a data stream
ISC'06 Proceedings of the 9th international conference on Information Security
A new clustering approach for symbolic data and its validation: application to the healthcare data
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Clustering scientific literature using sparse citation graph analysis
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
A voronoi diagram approach to autonomous clustering
DS'06 Proceedings of the 9th international conference on Discovery Science
Localized alternative cluster ensembles for collaborative structuring
ECML'06 Proceedings of the 17th European conference on Machine Learning
A maximum profit coverage algorithm with application to small molecules cluster identification
WEA'06 Proceedings of the 5th international conference on Experimental Algorithms
Swarm-Based distributed clustering in peer-to-peer systems
EA'05 Proceedings of the 7th international conference on Artificial Evolution
Mining outliers in spatial networks
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Ranking outliers using symmetric neighborhood relationship
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
An auto-stopped hierarchical clustering algorithm for analyzing 3d model database
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Hybrid agglomerative clustering for large databases: an efficient interactivity approach
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
An auto-stopped hierarchical clustering algorithm integrating outlier detection algorithm
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
A clustering algorithm based absorbing nearest neighbors
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
MICCAI'06 Proceedings of the 9th international conference on Medical Image Computing and Computer-Assisted Intervention - Volume Part II
HOV3: an approach to visual cluster analysis
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
iDISQUE: tuning high-dimensional similarity queries in DHT networks
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
Clustering very large dissimilarity data sets
ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Weighted k-means for density-biased clustering
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Association based prefetching algorithm in mobile environments
ICESS'04 Proceedings of the First international conference on Embedded Software and Systems
High-dimensional shared nearest neighbor clustering algorithm
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
A clustering model based on matrix approximation with applications to cluster system log files
ECML'05 Proceedings of the 16th European conference on Machine Learning
Improving k-means by outlier removal
SCIA'05 Proceedings of the 14th Scandinavian conference on Image Analysis
Succinct and informative cluster descriptions for document repositories
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Scalable clustering using graphics processors
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
A distributed algorithm for outlier detection in a large database
DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Fuzzy self-organizing map neural network using kernel PCA and the application
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
Dynamic pattern mining: an incremental data clustering approach
Journal on Data Semantics II
Clustering large dynamic datasets using exemplar points
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
On autonomous k-means clustering
ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
Determining the number of clusters using information entropy for mixed data
Pattern Recognition
Medoid queries in large spatial databases
SSTD'05 Proceedings of the 9th international conference on Advances in Spatial and Temporal Databases
On discovering moving clusters in spatio-temporal data
SSTD'05 Proceedings of the 9th international conference on Advances in Spatial and Temporal Databases
Hierarchical clustering algorithm with combined criteria for large and complex similarity data
International Journal of Knowledge Engineering and Soft Data Paradigms
OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, COA, and ODBASE - Volume Part II
On approximation algorithms for data mining applications
Efficient Approximation and Online Algorithms
KIDBSCAN: a new efficient data clustering algorithm
ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Non parametric local density-based clustering for multimodal overlapping distributions
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
A new clustering algorithm based on k-means using a line segment as prototype
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
An approach to find embedded clusters using density based techniques
ICDCIT'05 Proceedings of the Second international conference on Distributed Computing and Internet Technology
Proceedings of the VLDB Endowment
A clustering approach using weighted similarity majority margins
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Data clustering using bacterial foraging optimization
Journal of Intelligent Information Systems
Bootstrapping personal gesture shortcuts with the wisdom of the crowd and handwriting recognition
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A fast and effective partitioning algorithm for document clustering
ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
Chaotic ant swarm approach for data clustering
Applied Soft Computing
SpaGRID: a spatial grid framework for high dimensional medical databases
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
Survey on particle swarm optimization based clustering analysis
SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
Mining temporal patterns in popularity of web items
Information Sciences: an International Journal
ACM Transactions on Knowledge Discovery from Data (TKDD)
Objective function-based clustering
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Design and evaluation of decentralized online clustering
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
SOStream: self organizing density-based clustering over data stream
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Cluster_KDD: a visual clustering and knowledge discovery platform based on concept lattice
ICSI'12 Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part II
MOSAIC: a proximity graph approach for agglomerative clustering
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
An effective approach based on rough set and topic cluster to build peer communities
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Efficient stochastic algorithms for document clustering
Information Sciences: an International Journal
Hierarchical data organization for effective retrieval of similar shaders
Proceedings of the 2012 ACM Research in Applied Computation Symposium
Credit-Card fraud profiling using a hybrid incremental clustering methodology
SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Knowledge augmentation via incremental clustering: new technology for effective knowledge management
International Journal of Business Information Systems
Mining neighbor-based patterns in data streams
Information Systems
On the use of consensus clustering for incremental learning of topic hierarchies
SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence
ESC: An efficient synchronization-based clustering algorithm
Knowledge-Based Systems
Knowledge kanban system for virtual research and development
Robotics and Computer-Integrated Manufacturing
ASCCN: Arbitrary Shaped Clustering Method with Compatible Nucleoids
International Journal of Data Warehousing and Mining
Data Field for Hierarchical Clustering
International Journal of Data Warehousing and Mining
Spatial Clustering in SOLAP Systems to Enhance Map Visualization
International Journal of Data Warehousing and Mining
Hamming Distance based Clustering Algorithm
International Journal of Information Retrieval Research
Weighted Fuzzy-Possibilistic C-Means Over Large Data Sets
International Journal of Data Warehousing and Mining
A data partitioning approach for hierarchical clustering
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
Graphics hardware based efficient and scalable fuzzy c-means clustering
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Scalable fine-grained behavioral clustering of HTTP-based malware
Computer Networks: The International Journal of Computer and Telecommunications Networking
Efficient event detection by exploiting crowds
Proceedings of the 7th ACM international conference on Distributed event-based systems
A sample-based hierarchical adaptive K-means clustering method for large-scale video retrieval
Knowledge-Based Systems
TSum: fast, principled table summarization
Proceedings of the Seventh International Workshop on Data Mining for Online Advertising
Clustering based on a near neighbor graph and a grid cell graph
Journal of Intelligent Information Systems
Similarity queries: their conceptual evaluation, transformations, and processing
The VLDB Journal — The International Journal on Very Large Data Bases
Automatic player behavior analysis system using trajectory data in a massive multiplayer online game
Multimedia Tools and Applications
CoBi: Pattern Based Co-Regulated Biclustering of Gene Expression Data
Pattern Recognition Letters
CRUDAW: a novel fuzzy technique for clustering records following user defined attribute weights
AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
MAR: Maximum Attribute Relative of soft set for clustering attribute selection
Knowledge-Based Systems
An automated search space reduction methodology for large databases
ICDM'13 Proceedings of the 13th international conference on Advances in Data Mining: applications and theoretical aspects
Energy-based function to evaluate data stream clustering
Advances in Data Analysis and Classification
Adaptive stratified reservoir sampling over heterogeneous data streams
Information Systems
Mining stable patterns in multiple correlated databases
Decision Support Systems
Learning motion patterns in unstructured scene based on latent structural information
Journal of Visual Languages and Computing
Survey of Clustering: Algorithms and Applications
International Journal of Information Retrieval Research
A re-coloring approach for graph b-coloring based clustering
International Journal of Knowledge-based and Intelligent Engineering Systems
Subspace clustering of high-dimensional data: an evolutionary approach
Applied Computational Intelligence and Soft Computing
Hi-index | 0.01 |
Clustering, in data mining, is useful for discovering groups and identifying interesting distributions in the underlying data. Traditional clustering algorithms either favor clusters with spherical shapes and similar sizes, or are very fragile in the presence of outliers. We propose a new clustering algorithm called CURE that is more robust to outliers, and identifies clusters having non-spherical shapes and wide variances in size. CURE achieves this by representing each cluster by a certain fixed number of points that are generated by selecting well scattered points from the cluster and then shrinking them toward the center of the cluster by a specified fraction. Having more than one representative point per cluster allows CURE to adjust well to the geometry of non-spherical shapes and the shrinking helps to dampen the effects of outliers. To handle large databases, CURE employs a combination of random sampling and partitioning. A random sample drawn from the data set is first partitioned and each partition is partially clustered. The partial clusters are then clustered in a second pass to yield the desired clusters. Our experimental results confirm that the quality of clusters produced by CURE is much better than those found by existing algorithms. Furthermore, they demonstrate that random sampling and partitioning enable CURE to not only outperform existing algorithms but also to scale well for large databases without sacrificing clustering quality.