Algorithms for clustering data
Algorithms for clustering data
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
OPTICS: ordering points to identify the clustering structure
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Hancock: a language for extracting signatures from data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining high-speed data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Scalability for clustering algorithms revisited
ACM SIGKDD Explorations Newsletter
Models and issues in data stream systems
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient and Effective Clustering Methods for Spatial Data Mining
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
A framework for diagnosing changes in evolving data streams
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Streaming-Data Algorithms for High-Quality Clustering
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Cost-efficient mining techniques for data streams
ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Online event-driven subsequence matching over financial data streams
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Incremental and effective data summarization for dynamic hierarchical clustering
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
MAIDS: mining alarming incidents from data streams
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
On demand classification of data streams
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Wavelet fuzzy classification for detecting and tracking region outliers in meteorological data
Proceedings of the 12th annual ACM international workshop on Geographic information systems
On Change Diagnosis in Evolving Data Streams
IEEE Transactions on Knowledge and Data Engineering
WWW '05 Proceedings of the 14th international conference on World Wide Web
Agents and Stream Data Mining: A New Perspective
IEEE Intelligent Systems
Combining proactive and reactive predictions for data streams
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Streaming pattern discovery in multiple time-series
VLDB '05 Proceedings of the 31st international conference on Very large data bases
ACM SIGMOD Record
Generalized Dimension-Reduction Framework for Recent-Biased Time Series Analysis
IEEE Transactions on Knowledge and Data Engineering
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
Distributed and Parallel Databases
2005 Special Issue: Efficient streaming text clustering
Neural Networks - 2005 Special issue: IJCNN 2005
A Framework for On-Demand Classification of Evolving Data Streams
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the 2006 ACM symposium on Applied computing
DSM-PLW: single-pass mining of path traversal patterns over streaming web click-sequences
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Adaptive Clustering for Multiple Evolving Streams
IEEE Transactions on Knowledge and Data Engineering
Adaptive non-linear clustering in data streams
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Collaborative filtering in dynamic usage environments
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Supporting dynamic migration in tightly coupled grid applications
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Maintaining stream statistics over multiscale sliding windows
ACM Transactions on Database Systems (TODS)
Can exclusive clustering on streaming data be achieved?
ACM SIGKDD Explorations Newsletter
GridRod: a dynamic runtime scheduler for grid workflows
Proceedings of the 21st annual international conference on Supercomputing
Cell trees: An adaptive synopsis structure for clustering multi-dimensional on-line data streams
Data & Knowledge Engineering
Density-based clustering for real-time stream data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Evolutionary spectral clustering by incorporating temporal smoothness
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for classification and segmentation of massive audio data streams
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Anomaly detection in a mobile communication network
Computational & Mathematical Organization Theory
Clustering over Multiple Evolving Streams by Events and Correlations
IEEE Transactions on Knowledge and Data Engineering
Detecting change in data streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
A framework for projected clustering of high dimensional data streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Boolean representation based data-adaptive correlation analysis over time series streams
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Grid-based subspace clustering over data streams
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Continuous subspace clustering in streaming time series
Information Systems
Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Efficient instance-based learning on data streams
Intelligent Data Analysis
A semi-random multiple decision-tree algorithm for mining data streams
Journal of Computer Science and Technology
Approximate mining of maximal frequent itemsets in data streams with different window models
Expert Systems with Applications: An International Journal
Discovering correlated spatio-temporal changes in evolving graphs
Knowledge and Information Systems
A bayesian mixture model with linear regression mixing proportions
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental tensor analysis: Theory and applications
ACM Transactions on Knowledge Discovery from Data (TKDD)
Summarizing spatial data streams using ClusterHulls
Journal of Experimental Algorithmics (JEA)
Clustering Streaming Time Series Using CBC
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A Coding Hierarchy Computing Based Clustering Algorithm
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
E-Stream: Evolution-Based Technique for Stream Clustering
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Efficiently Discovering Recent Frequent Items in Data Streams
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Clustering Distributed Sensor Data Streams
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Data Streaming with Affinity Propagation
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Memory efficient subspace clustering for online data streams
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Correlation-based load balancing for network intrusion detection and prevention systems
Proceedings of the 4th international conference on Security and privacy in communication netowrks
Incremental clustering of dynamic data streams using connectivity based representative points
Data & Knowledge Engineering
Mining frequent itemsets over data streams using efficient window sliding techniques
Expert Systems with Applications: An International Journal
CONTOUR: an efficient algorithm for discovering discriminating subsequences
Data Mining and Knowledge Discovery
A Scalable Framework For Segmenting Magnetic Resonance Images
Journal of Signal Processing Systems
ODMCA: An adaptive data mining control algorithm in multicarrier networks
Computer Communications
Adaptive correlation analysis in stream time series with sliding windows
Computers & Mathematics with Applications
Efficiently tracing clusters over high-dimensional on-line data streams
Data & Knowledge Engineering
Algorithms for clustering clickstream data
Information Processing Letters
Frequent items in streaming data: An experimental evaluation of the state-of-the-art
Data & Knowledge Engineering
Neighbor-based pattern detection for windows over streaming data
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
An EM-Based Algorithm for Clustering Data Streams in Sliding Windows
DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Efficiently Clustering Probabilistic Data Streams
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Intervention Events Detection and Prediction in Data Streams
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Clustering with Lower Bound on Similarity
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
A framework for flexible clustering of multiple evolving data streams
International Journal of Advanced Intelligence Paradigms
On exploiting the power of time in data mining
ACM SIGKDD Explorations Newsletter
Online pairing of VoIP conversations
The VLDB Journal — The International Journal on Very Large Data Bases
Stream data clustering based on grid density and attraction
ACM Transactions on Knowledge Discovery from Data (TKDD)
Density-based clustering of data streams at multiple resolutions
ACM Transactions on Knowledge Discovery from Data (TKDD)
Preface: an overview on learning from data streams
New Generation Computing
A holistic approach for resource-aware adaptive data stream mining
New Generation Computing
Clustering over Evolving Data Streams Based on Online Recent-Biased Approximation
Knowledge Acquisition: Approaches, Algorithms and Applications
Distributed and Incremental Clustering Based on Weighted Affinity Propagation
Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
Stream Clustering Based on Kernel Density Estimation
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Journal of Data and Information Quality (JDIQ)
Incremental spectral clustering by efficiently updating the eigen-system
Pattern Recognition
Clustering data stream: A survey of algorithms
International Journal of Knowledge-based and Intelligent Engineering Systems
Harnessing the strengths of anytime algorithms for constant data streams
Data Mining and Knowledge Discovery
On classification and segmentation of massive audio data streams
Knowledge and Information Systems
Detecting Projected Outliers in High-Dimensional Data Streams
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Incremental and Adaptive Clustering Stream Data over Sliding Window
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
History Guided Low-Cost Change Detection in Streams
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Detect and track latent factors with online nonnegative matrix factorization
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
On evolutionary spectral clustering
ACM Transactions on Knowledge Discovery from Data (TKDD)
Online Evaluation of Patterns from Evolving Web Data Streams
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Mining data streams with periodically changing distributions
Proceedings of the 18th ACM conference on Information and knowledge management
Cluster based rank query over multidimensional data streams
Proceedings of the 18th ACM conference on Information and knowledge management
Incremental Learning and Memory Consolidation of Whole Body Human Motion Primitives
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A shared execution strategy for multiple pattern mining requests over streaming data
Proceedings of the VLDB Endowment
Identifying spectrum usage by unknown systems using experiments in machine learning
WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
C-DenStream: Using Domain Knowledge on a Data Stream
DS '09 Proceedings of the 12th International Conference on Discovery Science
Stream Clustering of Growing Objects
DS '09 Proceedings of the 12th International Conference on Discovery Science
Efficient decision tree construction for mining time-varying data streams
CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Communication-Efficient Privacy-Preserving Clustering
Transactions on Data Privacy
Anomaly intrusion detection by clustering transactional audit streams in a host computer
Information Sciences: an International Journal
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
Approximate trace of grid-based clusters over high dimensional data streams
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Flexible selection of wavelet coefficients based on the estimation error of predefined queries
PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Connectivity based stream clustering using localised density exemplars
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
HDG-tree: a structure for clustering high-dimensional data streams
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Clustering high dimensional data streams with representative points
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Density-based data streams clustering over sliding windows
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
A framework to enforce access control over data streams
ACM Transactions on Information and System Security (TISSEC)
Interactive visual exploration of neighbor-based patterns in data streams
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
MG-join: detecting phenomena and their correlation in high dimensional data streams
Distributed and Parallel Databases
Detecting outliers on arbitrary data streams using anytime approaches
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Evolutionary clustering using frequent itemsets
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Towards subspace clustering on dynamic data: an incremental version of PreDeCon
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Data Mining and Knowledge Discovery
SKIF: a data imputation framework for concept drifting data streams
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Text stream clustering algorithm based on adaptive feature selection
Expert Systems with Applications: An International Journal
Discrete wavelet transform-based time series analysis and mining
ACM Computing Surveys (CSUR)
An efficient approach for mining segment-wise intervention rules in time-series streams
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Data selection for exact value acquisition to improve uncertain clustering
WAIM'10 Proceedings of the 11th international conference on Web-age information management
A framework for clustering categorical time-evolving data
IEEE Transactions on Fuzzy Systems
Supporting self-adaptation in streaming data mining applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Robust ensemble learning for mining noisy data streams
Decision Support Systems
Entity resolution with evolving rules
Proceedings of the VLDB Endowment
Distributed and Parallel Databases
A clustering algorithm based on matrix over high dimensional data stream
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Clustering distributed sensor data streams using local processing and reduced communication
Intelligent Data Analysis - Ubiquitous Knowledge Discovery
Self-adaptive change detection in streaming data with non-stationary distribution
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Research of fast SOM clustering for text information
Expert Systems with Applications: An International Journal
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
L2GClust: local-to-global clustering of stream sources
Proceedings of the 2011 ACM Symposium on Applied Computing
Efficient decision tree re-alignment for clustering time-changing data streams
From active data management to event-based systems and more
XStreamCluster: an efficient algorithm for streaming XML data clustering
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Precise anytime clustering of noisy sensor data with logarithmic complexity
Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
INFORMS Journal on Computing
An effective evaluation measure for clustering on evolving data streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Approximate kernel k-means: solution to large scale kernel clustering
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Tracing evolving clusters by subspace and value similarity
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Summarizing cluster evolution in dynamic environments
ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part II
Quality-driven resource-adaptive data stream mining?
ACM SIGKDD Explorations Newsletter
Density based subspace clustering over dynamic data
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Hierarchical clustering for real-time stream data with noise
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Concurrent semi-supervised learning of data streams
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Online and offline trend cluster discovery in spatially distributed data streams
MSM'10/MUSE'10 Proceedings of the 2010 international conference on Analysis of social media and ubiquitous data
MINETRAC: mining flows for unsupervised analysis & semi-supervised classification
Proceedings of the 23rd International Teletraffic Congress
A clustering algorithm for multiple data streams based on spectral component similarity
Information Sciences: an International Journal
CLUES: a unified framework supporting interactive exploration of density-based clusters in streams
Proceedings of the 20th ACM international conference on Information and knowledge management
Memory-less unsupervised clustering for data streaming by versatile ellipsoidal function
Proceedings of the 20th ACM international conference on Information and knowledge management
The algorithm APT to classify in concurrence of latency and drift
IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Summarization and matching of density-based clusters in streaming environments
Proceedings of the VLDB Endowment
A suspicious behaviour detection using a context space model for smart surveillance systems
Computer Vision and Image Understanding
A scalable distributed stream mining system for highway traffic data
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Knowledge-Conscious data clustering
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Efficient mining of emerging events in a dynamic spatiotemporal environment
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Shared execution strategy for neighbor-based pattern mining requests over streaming windows
ACM Transactions on Database Systems (TODS)
GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
Generalized projected clustering in high-dimensional data streams
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
On futuristic query processing in data streams
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Maintaining gaussian mixture models of data streams under block evolution
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Granularity adaptive density estimation and on demand clustering of concept-drifting data streams
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Tracing Evolving Subspace Clusters in Temporal Climate Data
Data Mining and Knowledge Discovery
Incremental clustering for trajectories
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Attribute outlier detection over data streams
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
SIC-means: a semi-fuzzy approach for clustering data streams using c-means
ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Density-based hierarchical clustering for streaming data
Pattern Recognition Letters
Scalable clustering using graphics processors
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
A grid-based clustering algorithm for high-dimensional data streams
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
An incremental data stream clustering algorithm based on dense units detection
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Determining the number of clusters using information entropy for mixed data
Pattern Recognition
σ-SCLOPE: clustering categorical streams using attribute selection
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Efficient trade-off between speed processing and accuracy in summarizing data streams
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Discovery and diagnosis of behavioral transitions in patient event streams
ACM Transactions on Management Information Systems (TMIS)
On clustering techniques for change diagnosis in data streams
WebKDD'05 Proceedings of the 7th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Techniques for knowledge acquisition in dynamically changing environments
ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special section on formal methods in pervasive computing, pervasive adaptation, and self-adaptive systems: Models and algorithms
Clustering distributed data streams in peer-to-peer environments
Information Sciences: an International Journal
A grid-based subspace clustering algorithm for high-dimensional data streams
WISE'06 Proceedings of the 7th international conference on Web Information Systems
Clustering similarity comparison using density profiles
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Clustering transactional data streams
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Expert Systems with Applications: An International Journal
HUE-Stream: evolution-based clustering technique for heterogeneous data streams with uncertainty
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Homogeneous and heterogeneous distributed classification for pocket data mining
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Proceedings of the 27th Annual ACM Symposium on Applied Computing
On pre-processing algorithms for data stream
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
On fuzzy clustering of data streams with concept drift
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
On resources optimization in fuzzy clustering of data streams
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
AnyOut: anytime outlier detection on streaming data
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Objective function-based clustering
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
CAMEUD: clustering approach for mining evolving usage data
Proceedings of the Ninth International Workshop on Information Integration on the Web
Clustering categorical data streams
Journal of Computational Methods in Sciences and Engineering
A framework for summarizing and analyzing twitter feeds
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A semi-supervised incremental clustering algorithm for streaming data
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Expert Systems with Applications: An International Journal
A density-based clustering structure mining algorithm for data streams
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Stream-dashboard: a framework for mining, tracking and validating clusters in a data stream
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
SOStream: self organizing density-based clustering over data stream
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
ciForager: Incrementally discovering regions of correlated change in evolving graphs
ACM Transactions on Knowledge Discovery from Data (TKDD)
A weightless neural network-based approach for stream data clustering
IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Socialized ubiquitous personal study: Toward an individualized information portal
Journal of Computer and System Sciences
Exclusive and complete clustering of streams
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Content-based crowd retrieval on the real-time web
Proceedings of the 21st ACM international conference on Information and knowledge management
Maximum margin clustering on evolutionary data
Proceedings of the 21st ACM international conference on Information and knowledge management
A single pass trellis-based algorithm for clustering evolving data streams
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Density-Based projected clustering of data streams
SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Expressive Query Support for Multidimensional Data in Distributed Hash Tables
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Mining neighbor-based patterns in data streams
Information Systems
Dynamic credit-card fraud profiling
MDAI'12 Proceedings of the 9th international conference on Modeling Decisions for Artificial Intelligence
A single pass algorithm for clustering evolving data streams based on swarm intelligence
Data Mining and Knowledge Discovery
An Internet Framework for Pervasive Sensor Computing
International Journal of Advanced Pervasive and Ubiquitous Computing
ASCCN: Arbitrary Shaped Clustering Method with Compatible Nucleoids
International Journal of Data Warehousing and Mining
FINGERPRINT: Summarizing Cluster Evolution in Dynamic Environments
International Journal of Data Warehousing and Mining
Weighted Fuzzy-Possibilistic C-Means Over Large Data Sets
International Journal of Data Warehousing and Mining
StreamEB: stream edge bundling
GD'12 Proceedings of the 20th international conference on Graph Drawing
Multimedia Tools and Applications
Effectively grouping trajectory streams
NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
GCplace: geo-cloud based correlation aware data replica placement
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Novelty detection algorithm for data streams multi-class problems
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Real time processing of data from patient biodevices
HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
Warped K-Means: An algorithm to cluster sequentially-distributed data
Information Sciences: an International Journal
Fast clustering-based anonymization approaches with time constraints for data streams
Knowledge-Based Systems
Sumblr: continuous summarization of evolving tweet streams
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Efficient event detection by exploiting crowds
Proceedings of the 7th ACM international conference on Distributed event-based systems
Exploiting online social data in ontology learning for event tracking and emergency response
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Clustering navigation sequences to create contexts for guiding code navigation
Journal of Systems and Software
Efficient processing of streaming graphs for evolution-aware clustering
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Online behavior change detection in computer games
Expert Systems with Applications: An International Journal
Journal of Information Science
Clustering cubes with binary dimensions in one pass
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Data stream clustering: A survey
ACM Computing Surveys (CSUR)
A fast algorithm for clustering with mapreduce
ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part I
Identifying streaming frequent items in ad hoc time windows
Data & Knowledge Engineering
Clustering spatial data streams for targeted alerting in disaster response
Proceedings of the 4th ACM SIGSPATIAL International Workshop on GeoStreaming
Aggregate profile clustering for telco analytics
Proceedings of the VLDB Endowment
Mining and linking patterns across live data streams and stream archives
Proceedings of the VLDB Endowment
Energy-based function to evaluate data stream clustering
Advances in Data Analysis and Classification
Online fuzzy medoid based clustering algorithms
Neurocomputing
Evolving soft subspace clustering
Applied Soft Computing
Semantic-based QoS management in cloud systems: Current status and future challenges
Future Generation Computer Systems
Incremental entity resolution on rules and data
The VLDB Journal — The International Journal on Very Large Data Bases
Dealing with trajectory streams by clustering and mathematical transforms
Journal of Intelligent Information Systems
On clustering large number of data streams
Intelligent Data Analysis
Data stream dynamic clustering supported by Markov chain isomorphisms
Intelligent Data Analysis
Hi-index | 0.01 |
The clustering problem is a difficult problem for the data stream domain. This is because the large volumes of data arriving in a stream renders most traditional algorithms too inefficient. In recent years, a few one-pass clustering algorithms have been developed for the data stream problem. Although such methods address the scalability issues of the clustering problem, they are generally blind to the evolution of the data and do not address the following issues: (1) The quality of the clusters is poor when the data evolves considerably over time. (2) A data stream clustering algorithm requires much greater functionality in discovering and exploring clusters over different portions of the stream. The widely used practice of viewing data stream clustering algorithms as a class of one-pass clustering algorithms is not very useful from an application point of view. For example, a simple one-pass clustering algorithm over an entire data stream of a few years is dominated by the outdated history of the stream. The exploration of the stream over different time windows can provide the users with a much deeper understanding of the evolving behavior of the clusters. At the same time, it is not possible to simultaneously perform dynamic clustering over all possible time horizons for a data stream of even moderately large volume. This paper discusses a fundamentally different philosophy for data stream clustering which is guided by application-centered requirements. The idea is divide the clustering process into an online component which periodically stores detailed summary statistics and an offine component which uses only this summary statistics. The offine component is utilized by the analyst who can use a wide variety of inputs (such as time horizon or number of clusters) in order to provide a quick understanding of the broad clusters in the data stream. The problems of efficient choice, storage, and use of this statistical data for a fast data stream turns out to be quite tricky. For this purpose, we use the concepts of a pyramidal time frame in conjunction with a microclustering approach. Our performance experiments over a number of real and synthetic data sets illustrate the effectiveness, efficiency, and insights provided by our approach.