A framework for clustering evolving data streams

Authors:
Charu C. Aggarwal;Jiawei Han;Jianyong Wang;Philip S. Yu
Affiliations:
T. J. Watson Resch. Ctr.;UIUC;UIUC;T. J. Watson Resch. Ctr.
Venue:
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Year:
2003

Citing 12
Cited 241

Algorithms for clustering data

Algorithms for clustering data
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
CURE: an efficient clustering algorithm for large databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
OPTICS: ordering points to identify the clustering structure

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Hancock: a language for extracting signatures from data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Scalability for clustering algorithms revisited

ACM SIGKDD Explorations Newsletter
Models and issues in data stream systems

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Clustering data streams

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
A framework for diagnosing changes in evolving data streams

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Streaming-Data Algorithms for High-Quality Clustering

ICDE '02 Proceedings of the 18th International Conference on Data Engineering

Cost-efficient mining techniques for data streams

ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Online event-driven subsequence matching over financial data streams

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Incremental and effective data summarization for dynamic hierarchical clustering

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
MAIDS: mining alarming incidents from data streams

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
On demand classification of data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Wavelet fuzzy classification for detecting and tracking region outliers in meteorological data

Proceedings of the 12th annual ACM international workshop on Geographic information systems
On Change Diagnosis in Evolving Data Streams

IEEE Transactions on Knowledge and Data Engineering
Ranking a stream of news

WWW '05 Proceedings of the 14th international conference on World Wide Web
Agents and Stream Data Mining: A New Perspective

IEEE Intelligent Systems
Combining proactive and reactive predictions for data streams

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Streaming pattern discovery in multiple time-series

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mining data streams: a review

ACM SIGMOD Record
Generalized Dimension-Reduction Framework for Recent-Biased Time Series Analysis

IEEE Transactions on Knowledge and Data Engineering
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams

Distributed and Parallel Databases
2005 Special Issue: Efficient streaming text clustering

Neural Networks - 2005 Special issue: IJCNN 2005
A Framework for On-Demand Classification of Evolving Data Streams

IEEE Transactions on Knowledge and Data Engineering
A framework for resource-aware knowledge discovery in data streams: a holistic approach with its application to clustering

Proceedings of the 2006 ACM symposium on Applied computing
DSM-PLW: single-pass mining of path traversal patterns over streaming web click-sequences

Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Adaptive Clustering for Multiple Evolving Streams

IEEE Transactions on Knowledge and Data Engineering
Adaptive non-linear clustering in data streams

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Collaborative filtering in dynamic usage environments

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Supporting dynamic migration in tightly coupled grid applications

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Maintaining stream statistics over multiscale sliding windows

ACM Transactions on Database Systems (TODS)
Can exclusive clustering on streaming data be achieved?

ACM SIGKDD Explorations Newsletter
GridRod: a dynamic runtime scheduler for grid workflows

Proceedings of the 21st annual international conference on Supercomputing
Cell trees: An adaptive synopsis structure for clustering multi-dimensional on-line data streams

Data & Knowledge Engineering
Density-based clustering for real-time stream data

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Evolutionary spectral clustering by incorporating temporal smoothness

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for classification and segmentation of massive audio data streams

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Anomaly detection in a mobile communication network

Computational & Mathematical Organization Theory
Clustering over Multiple Evolving Streams by Events and Correlations

IEEE Transactions on Knowledge and Data Engineering
Detecting change in data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
A framework for projected clustering of high dimensional data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Boolean representation based data-adaptive correlation analysis over time series streams

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Grid-based subspace clustering over data streams

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Continuous subspace clustering in streaming time series

Information Systems
A fuzzy approach for interpretation of ubiquitous data stream clustering and its application in road safety

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Efficient instance-based learning on data streams

Intelligent Data Analysis
A semi-random multiple decision-tree algorithm for mining data streams

Journal of Computer Science and Technology
Approximate mining of maximal frequent itemsets in data streams with different window models

Expert Systems with Applications: An International Journal
Discovering correlated spatio-temporal changes in evolving graphs

Knowledge and Information Systems
A bayesian mixture model with linear regression mixing proportions

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental tensor analysis: Theory and applications

ACM Transactions on Knowledge Discovery from Data (TKDD)
Summarizing spatial data streams using ClusterHulls

Journal of Experimental Algorithmics (JEA)
Clustering Streaming Time Series Using CBC

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A Coding Hierarchy Computing Based Clustering Algorithm

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
E-Stream: Evolution-Based Technique for Stream Clustering

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Efficiently Discovering Recent Frequent Items in Data Streams

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Clustering Distributed Sensor Data Streams

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Data Streaming with Affinity Propagation

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Memory efficient subspace clustering for online data streams

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Correlation-based load balancing for network intrusion detection and prevention systems

Proceedings of the 4th international conference on Security and privacy in communication netowrks
Incremental clustering of dynamic data streams using connectivity based representative points

Data & Knowledge Engineering
Mining frequent itemsets over data streams using efficient window sliding techniques

Expert Systems with Applications: An International Journal
CONTOUR: an efficient algorithm for discovering discriminating subsequences

Data Mining and Knowledge Discovery
A Scalable Framework For Segmenting Magnetic Resonance Images

Journal of Signal Processing Systems
ODMCA: An adaptive data mining control algorithm in multicarrier networks

Computer Communications
Adaptive correlation analysis in stream time series with sliding windows

Computers & Mathematics with Applications
Efficiently tracing clusters over high-dimensional on-line data streams

Data & Knowledge Engineering
Algorithms for clustering clickstream data

Information Processing Letters
Frequent items in streaming data: An experimental evaluation of the state-of-the-art

Data & Knowledge Engineering
Neighbor-based pattern detection for windows over streaming data

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
An EM-Based Algorithm for Clustering Data Streams in Sliding Windows

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Efficiently Clustering Probabilistic Data Streams

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Intervention Events Detection and Prediction in Data Streams

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Clustering with Lower Bound on Similarity

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
A framework for flexible clustering of multiple evolving data streams

International Journal of Advanced Intelligence Paradigms
On exploiting the power of time in data mining

ACM SIGKDD Explorations Newsletter
Online pairing of VoIP conversations

The VLDB Journal — The International Journal on Very Large Data Bases
Stream data clustering based on grid density and attraction

ACM Transactions on Knowledge Discovery from Data (TKDD)
Density-based clustering of data streams at multiple resolutions

ACM Transactions on Knowledge Discovery from Data (TKDD)
Preface: an overview on learning from data streams

New Generation Computing
A holistic approach for resource-aware adaptive data stream mining

New Generation Computing
Clustering over Evolving Data Streams Based on Online Recent-Biased Approximation

Knowledge Acquisition: Approaches, Algorithms and Applications
Distributed and Incremental Clustering Based on Weighted Affinity Propagation

Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
Stream Clustering Based on Kernel Density Estimation

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Mining in Large Noisy Domains

Journal of Data and Information Quality (JDIQ)
Incremental spectral clustering by efficiently updating the eigen-system

Pattern Recognition
Clustering data stream: A survey of algorithms

International Journal of Knowledge-based and Intelligent Engineering Systems
Harnessing the strengths of anytime algorithms for constant data streams

Data Mining and Knowledge Discovery
On classification and segmentation of massive audio data streams

Knowledge and Information Systems
Detecting Projected Outliers in High-Dimensional Data Streams

DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Incremental and Adaptive Clustering Stream Data over Sliding Window

DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
History Guided Low-Cost Change Detection in Streams

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Detect and track latent factors with online nonnegative matrix factorization

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
On evolutionary spectral clustering

ACM Transactions on Knowledge Discovery from Data (TKDD)
Online Evaluation of Patterns from Evolving Web Data Streams

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Mining data streams with periodically changing distributions

Proceedings of the 18th ACM conference on Information and knowledge management
Cluster based rank query over multidimensional data streams

Proceedings of the 18th ACM conference on Information and knowledge management
Incremental Learning and Memory Consolidation of Whole Body Human Motion Primitives

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
A shared execution strategy for multiple pattern mining requests over streaming data

Proceedings of the VLDB Endowment
Identifying spectrum usage by unknown systems using experiments in machine learning

WCNC'09 Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference
C-DenStream: Using Domain Knowledge on a Data Stream

DS '09 Proceedings of the 12th International Conference on Discovery Science
Stream Clustering of Growing Objects

DS '09 Proceedings of the 12th International Conference on Discovery Science
Efficient decision tree construction for mining time-varying data streams

CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
A spike sorting framework using nonparametric detection and incremental clustering

Neurocomputing
Communication-Efficient Privacy-Preserving Clustering

Transactions on Data Privacy
Anomaly intrusion detection by clustering transactional audit streams in a host computer

Information Sciences: an International Journal
Data clustering: 50 years beyond K-means

Pattern Recognition Letters
Approximate trace of grid-based clusters over high dimensional data streams

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Flexible selection of wavelet coefficients based on the estimation error of predefined queries

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Connectivity based stream clustering using localised density exemplars

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
HDG-tree: a structure for clustering high-dimensional data streams

IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Clustering high dimensional data streams with representative points

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Density-based data streams clustering over sliding windows

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
A framework to enforce access control over data streams

ACM Transactions on Information and System Security (TISSEC)
Interactive visual exploration of neighbor-based patterns in data streams

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
MG-join: detecting phenomena and their correlation in high dimensional data streams

Distributed and Parallel Databases
Detecting outliers on arbitrary data streams using anytime approaches

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Evolutionary clustering using frequent itemsets

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Towards subspace clustering on dynamic data: an incremental version of PreDeCon

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
A clustering comparison measure using density profiles and its application to the discovery of alternate clusterings

Data Mining and Knowledge Discovery
SKIF: a data imputation framework for concept drifting data streams

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Text stream clustering algorithm based on adaptive feature selection

Expert Systems with Applications: An International Journal
Discrete wavelet transform-based time series analysis and mining

ACM Computing Surveys (CSUR)
An efficient approach for mining segment-wise intervention rules in time-series streams

WAIM'10 Proceedings of the 11th international conference on Web-age information management
Data selection for exact value acquisition to improve uncertain clustering

WAIM'10 Proceedings of the 11th international conference on Web-age information management
A framework for clustering categorical time-evolving data

IEEE Transactions on Fuzzy Systems
Supporting self-adaptation in streaming data mining applications

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Robust ensemble learning for mining noisy data streams

Decision Support Systems
Entity resolution with evolving rules

Proceedings of the VLDB Endowment
Stream engines meet wireless sensor networks: cost-based planning and processing of complex queries in AnduIN

Distributed and Parallel Databases
A clustering algorithm based on matrix over high dimensional data stream

WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Clustering distributed sensor data streams using local processing and reduced communication

Intelligent Data Analysis - Ubiquitous Knowledge Discovery
Self-adaptive change detection in streaming data with non-stationary distribution

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Research of fast SOM clustering for text information

Expert Systems with Applications: An International Journal
Generating associative ripples of relevant information from a variety of data streams by throwing a heuristic stone

Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
L2GClust: local-to-global clustering of stream sources

Proceedings of the 2011 ACM Symposium on Applied Computing
Efficient decision tree re-alignment for clustering time-changing data streams

From active data management to event-based systems and more
XStreamCluster: an efficient algorithm for streaming XML data clustering

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Precise anytime clustering of noisy sensor data with logarithmic complexity

Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
A Cluster-Based Context-Tree Model for Multivariate Data Streams with Applications to Anomaly Detection

INFORMS Journal on Computing
An effective evaluation measure for clustering on evolving data streams

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Approximate kernel k-means: solution to large scale kernel clustering

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Tracing evolving clusters by subspace and value similarity

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Summarizing cluster evolution in dynamic environments

ICCSA'11 Proceedings of the 2011 international conference on Computational science and its applications - Volume Part II
Quality-driven resource-adaptive data stream mining?

ACM SIGKDD Explorations Newsletter
Density based subspace clustering over dynamic data

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Hierarchical clustering for real-time stream data with noise

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Concurrent semi-supervised learning of data streams

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Online and offline trend cluster discovery in spatially distributed data streams

MSM'10/MUSE'10 Proceedings of the 2010 international conference on Analysis of social media and ubiquitous data
MINETRAC: mining flows for unsupervised analysis & semi-supervised classification

Proceedings of the 23rd International Teletraffic Congress
A clustering algorithm for multiple data streams based on spectral component similarity

Information Sciences: an International Journal
CLUES: a unified framework supporting interactive exploration of density-based clusters in streams

Proceedings of the 20th ACM international conference on Information and knowledge management
Memory-less unsupervised clustering for data streaming by versatile ellipsoidal function

Proceedings of the 20th ACM international conference on Information and knowledge management
The algorithm APT to classify in concurrence of latency and drift

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Summarization and matching of density-based clusters in streaming environments

Proceedings of the VLDB Endowment
A suspicious behaviour detection using a context space model for smart surveillance systems

Computer Vision and Image Understanding
A scalable distributed stream mining system for highway traffic data

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Knowledge-Conscious data clustering

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Efficient mining of emerging events in a dynamic spatiotemporal environment

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Shared execution strategy for neighbor-based pattern mining requests over streaming windows

ACM Transactions on Database Systems (TODS)
A single-pass online data mining algorithm combined with control theory with limited memory in dynamic data streams

GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
Generalized projected clustering in high-dimensional data streams

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
On futuristic query processing in data streams

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Maintaining gaussian mixture models of data streams under block evolution

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Granularity adaptive density estimation and on demand clustering of concept-drifting data streams

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Tracing Evolving Subspace Clusters in Temporal Climate Data

Data Mining and Knowledge Discovery
Incremental clustering for trajectories

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
Attribute outlier detection over data streams

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
SIC-means: a semi-fuzzy approach for clustering data streams using c-means

ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Density-based hierarchical clustering for streaming data

Pattern Recognition Letters
Scalable clustering using graphics processors

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
A grid-based clustering algorithm for high-dimensional data streams

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
An incremental data stream clustering algorithm based on dense units detection

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Determining the number of clusters using information entropy for mixed data

Pattern Recognition
σ-SCLOPE: clustering categorical streams using attribute selection

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Efficient trade-off between speed processing and accuracy in summarizing data streams

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Discovery and diagnosis of behavioral transitions in patient event streams

ACM Transactions on Management Information Systems (TMIS)
On clustering techniques for change diagnosis in data streams

WebKDD'05 Proceedings of the 7th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Techniques for knowledge acquisition in dynamically changing environments

ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special section on formal methods in pervasive computing, pervasive adaptation, and self-adaptive systems: Models and algorithms
Clustering distributed data streams in peer-to-peer environments

Information Sciences: an International Journal
A grid-based subspace clustering algorithm for high-dimensional data streams

WISE'06 Proceedings of the 7th international conference on Web Information Systems
Clustering similarity comparison using density profiles

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Clustering transactional data streams

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Mining spatio-temporal information on microblogging streams using a density-based online clustering method

Expert Systems with Applications: An International Journal
HUE-Stream: evolution-based clustering technique for heterogeneous data streams with uncertainty

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Homogeneous and heterogeneous distributed classification for pocket data mining

Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Learning from concept drifting data streams with unlabeled data

Neurocomputing
A framework for application-driven classification of data streams

Neurocomputing
2012 Special Issue: Enriched topological learning for cluster detection and visualization

Neural Networks
Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters

Proceedings of the 27th Annual ACM Symposium on Applied Computing
On pre-processing algorithms for data stream

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
On fuzzy clustering of data streams with concept drift

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
On resources optimization in fuzzy clustering of data streams

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
AnyOut: anytime outlier detection on streaming data

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Objective function-based clustering

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
CAMEUD: clustering approach for mining evolving usage data

Proceedings of the Ninth International Workshop on Information Integration on the Web
Clustering categorical data streams

Journal of Computational Methods in Sciences and Engineering
A framework for summarizing and analyzing twitter feeds

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A semi-supervised incremental clustering algorithm for streaming data

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Unsupervised and supervised learning to evaluate event relatedness based on content mining from social-media streams

Expert Systems with Applications: An International Journal
A density-based clustering structure mining algorithm for data streams

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Stream-dashboard: a framework for mining, tracking and validating clusters in a data stream

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
SOStream: self organizing density-based clustering over data stream

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
ciForager: Incrementally discovering regions of correlated change in evolving graphs

ACM Transactions on Knowledge Discovery from Data (TKDD)
A weightless neural network-based approach for stream data clustering

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Socialized ubiquitous personal study: Toward an individualized information portal

Journal of Computer and System Sciences
Exclusive and complete clustering of streams

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Content-based crowd retrieval on the real-time web

Proceedings of the 21st ACM international conference on Information and knowledge management
Maximum margin clustering on evolutionary data

Proceedings of the 21st ACM international conference on Information and knowledge management
A single pass trellis-based algorithm for clustering evolving data streams

DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Density-Based projected clustering of data streams

SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Expressive Query Support for Multidimensional Data in Distributed Hash Tables

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Mining neighbor-based patterns in data streams

Information Systems
Dynamic credit-card fraud profiling

MDAI'12 Proceedings of the 9th international conference on Modeling Decisions for Artificial Intelligence
A single pass algorithm for clustering evolving data streams based on swarm intelligence

Data Mining and Knowledge Discovery
An Internet Framework for Pervasive Sensor Computing

International Journal of Advanced Pervasive and Ubiquitous Computing
ASCCN: Arbitrary Shaped Clustering Method with Compatible Nucleoids

International Journal of Data Warehousing and Mining
FINGERPRINT: Summarizing Cluster Evolution in Dynamic Environments

International Journal of Data Warehousing and Mining
Weighted Fuzzy-Possibilistic C-Means Over Large Data Sets

International Journal of Data Warehousing and Mining
StreamEB: stream edge bundling

GD'12 Proceedings of the 20th international conference on Graph Drawing
Enriching user search experience by mining social streams with heuristic stones and associative ripples

Multimedia Tools and Applications
Effectively grouping trajectory streams

NFMCP'12 Proceedings of the First international conference on New Frontiers in Mining Complex Patterns
GCplace: geo-cloud based correlation aware data replica placement

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Novelty detection algorithm for data streams multi-class problems

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Real time processing of data from patient biodevices

HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
Warped K-Means: An algorithm to cluster sequentially-distributed data

Information Sciences: an International Journal
Fast clustering-based anonymization approaches with time constraints for data streams

Knowledge-Based Systems
Sumblr: continuous summarization of evolving tweet streams

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Efficient event detection by exploiting crowds

Proceedings of the 7th ACM international conference on Distributed event-based systems
Exploiting online social data in ontology learning for event tracking and emergency response

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Clustering navigation sequences to create contexts for guiding code navigation

Journal of Systems and Software
Efficient processing of streaming graphs for evolution-aware clustering

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Online behavior change detection in computer games

Expert Systems with Applications: An International Journal
Leveraging microblogging big data with a modified density-based clustering approach for event awareness and topic ranking

Journal of Information Science
Clustering cubes with binary dimensions in one pass

Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Data stream clustering: A survey

ACM Computing Surveys (CSUR)
A fast algorithm for clustering with mapreduce

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part I
Identifying streaming frequent items in ad hoc time windows

Data & Knowledge Engineering
Clustering spatial data streams for targeted alerting in disaster response

Proceedings of the 4th ACM SIGSPATIAL International Workshop on GeoStreaming
Aggregate profile clustering for telco analytics

Proceedings of the VLDB Endowment
Mining and linking patterns across live data streams and stream archives

Proceedings of the VLDB Endowment
Energy-based function to evaluate data stream clustering

Advances in Data Analysis and Classification
Online fuzzy medoid based clustering algorithms

Neurocomputing
Evolving soft subspace clustering

Applied Soft Computing
Semantic-based QoS management in cloud systems: Current status and future challenges

Future Generation Computer Systems
Incremental entity resolution on rules and data

The VLDB Journal — The International Journal on Very Large Data Bases
Dealing with trajectory streams by clustering and mathematical transforms

Journal of Intelligent Information Systems
On clustering large number of data streams

Intelligent Data Analysis
Data stream dynamic clustering supported by Markov chain isomorphisms

Intelligent Data Analysis

Quantified Score

Hi-index	0.01

Visualization

Abstract

The clustering problem is a difficult problem for the data stream domain. This is because the large volumes of data arriving in a stream renders most traditional algorithms too inefficient. In recent years, a few one-pass clustering algorithms have been developed for the data stream problem. Although such methods address the scalability issues of the clustering problem, they are generally blind to the evolution of the data and do not address the following issues: (1) The quality of the clusters is poor when the data evolves considerably over time. (2) A data stream clustering algorithm requires much greater functionality in discovering and exploring clusters over different portions of the stream. The widely used practice of viewing data stream clustering algorithms as a class of one-pass clustering algorithms is not very useful from an application point of view. For example, a simple one-pass clustering algorithm over an entire data stream of a few years is dominated by the outdated history of the stream. The exploration of the stream over different time windows can provide the users with a much deeper understanding of the evolving behavior of the clusters. At the same time, it is not possible to simultaneously perform dynamic clustering over all possible time horizons for a data stream of even moderately large volume. This paper discusses a fundamentally different philosophy for data stream clustering which is guided by application-centered requirements. The idea is divide the clustering process into an online component which periodically stores detailed summary statistics and an offine component which uses only this summary statistics. The offine component is utilized by the analyst who can use a wide variety of inputs (such as time horizon or number of clusters) in order to provide a quick understanding of the broad clusters in the data stream. The problems of efficient choice, storage, and use of this statistical data for a fast data stream turns out to be quite tricky. For this purpose, we use the concepts of a pyramidal time frame in conjunction with a microclustering approach. Our performance experiments over a number of real and synthetic data sets illustrate the effectiveness, efficiency, and insights provided by our approach.