Probabilistic counting algorithms for data base applications
Journal of Computer and System Sciences
Algorithms for clustering data
Algorithms for clustering data
Optimal algorithms for approximate clustering
STOC '88 Proceedings of the twentieth annual ACM symposium on Theory of computing
e-approximations with minimum packing constraint violation (extended abstract)
STOC '92 Proceedings of the twenty-fourth annual ACM symposium on Theory of computing
Approximation algorithms for geometric median problems
Information Processing Letters
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
The space complexity of approximating the frequency moments
STOC '96 Proceedings of the twenty-eighth annual ACM symposium on Theory of computing
Approximation algorithms for facility location problems (extended abstract)
STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Incremental clustering and dynamic information retrieval
STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Automatic subspace clustering of high dimensional data for data mining applications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Approximate medians and other quantiles in one pass and with limited memory
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Approximation schemes for Euclidean k-medians and related problems
STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Randomized query processing in robot path planning
Journal of Computer and System Sciences
A constant-factor approximation algorithm for the k-median problem (extended abstract)
STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Sublinear time algorithms for metric space problems
STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Subquadratic approximation algorithms for clustering problems in high dimensional spaces
STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
OPTICS: ordering points to identify the clustering structure
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Clustering in large graphs and matrices
Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Greedy strikes back: improved facility location algorithms
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Synopsis data structures for massive data sets
Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Towards estimation error guarantees for distinct values
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Approximation algorithms for projective clustering
SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Data mining: concepts and techniques
Data mining: concepts and techniques
Scalability for clustering algorithms revisited
ACM SIGKDD Explorations Newsletter
Sublinear time approximate clustering
SODA '01 Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms
Space-efficient online computation of quantile summaries
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Approximating min-sum k-clustering in metric spaces
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Local search heuristic for k-median and facility location problems
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Fast computation of low rank matrix approximations
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Approximation algorithms
Near-optimal sparse fourier representations via sampling
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Fast, small-space algorithms for approximate histogram maintenance
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
A new greedy approach for facility location problems
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Sampling from a moving window over streaming data
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
Maintaining stream statistics over sliding windows: (extended abstract)
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
A Monte Carlo algorithm for fast projective clustering
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Dynamic multidimensional histograms
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Mathematical Programming in Data Mining
Data Mining and Knowledge Discovery
DEMON: Mining and Monitoring Evolving Data
IEEE Transactions on Knowledge and Data Engineering
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Sampling-Based Estimation of the Number of Distinct Values of an Attribute
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
A Statistical Method for Profiling Network Traffic
Proceedings of the Workshop on Intrusion Detection and Network Monitoring
STING: A Statistical Information Grid Approach to Spatial Data Mining
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
A Nearly Linear-Time Approximation Scheme for the Euclidean kappa-median Problem
ESA '99 Proceedings of the 7th Annual European Symposium on Algorithms
Fast Monte-Carlo Algorithms for finding low-rank approximations
FOCS '98 Proceedings of the 39th Annual Symposium on Foundations of Computer Science
Improved Combinatorial Algorithms for the Facility Location and k-Median Problems
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
Primal-Dual Approximation Algorithms for Metric Facility Location and k-Median Problems
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
An Approximate L1-Difference Algorithm for Massive Data Streams
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
A Sublinear Time Approximation Scheme for Clustering in Metric Spaces
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
On clusterings-good, bad and spectral
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Polynomial time approximation schemes for geometric k-clustering
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Stable distributions, pseudorandom generators, embeddings and data stream computation
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
FOCS '01 Proceedings of the 42nd IEEE symposium on Foundations of Computer Science
Streaming-Data Algorithms for High-Quality Clustering
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Approximating a Data Stream for Querying and Estimation: Algorithms and Performance Evaluation
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
How to summarize the universe: dynamic maintenance of quantiles
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Optimal time bounds for approximate clustering
UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Modeling and clustering of photo capture streams
MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
A New Conceptual Clustering Framework
Machine Learning
Finding hot query patterns over an XQuery stream
The VLDB Journal — The International Journal on Very Large Data Bases
AutoLag: Automatic Discovery of Lag Correlations in Stream Data
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
WWW '05 Proceedings of the 14th international conference on World Wide Web
BRAID: stream mining through group lag correlations
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Streaming pattern discovery in multiple time-series
VLDB '05 Proceedings of the 31st international conference on Very large data bases
ACM SIGMOD Record
Applications of knowledge discovery
IEA/AIE'2005 Proceedings of the 18th international conference on Innovations in Applied Artificial Intelligence
2005 Special Issue: Efficient streaming text clustering
Neural Networks - 2005 Special issue: IJCNN 2005
Evaluating the intrinsic dimension of evolving data streams
Proceedings of the 2006 ACM symposium on Applied computing
Detecting and tracking regional outliers in meteorological data
Information Sciences: an International Journal
Supervised clustering of streaming data for email batch detection
Proceedings of the 24th international conference on Machine learning
Adaptive similarity search in streaming time series with sliding windows
Data & Knowledge Engineering
Density-based clustering for real-time stream data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
k-means++: the advantages of careful seeding
SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
Anomaly detection in a mobile communication network
Computational & Mathematical Organization Theory
Compressing large boolean matrices using reordering techniques
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Boolean representation based data-adaptive correlation analysis over time series streams
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Continuous subspace clustering in streaming time series
Information Systems
Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Discovering frequent sets from data streams with CPU constraint
AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
Incremental tensor analysis: Theory and applications
ACM Transactions on Knowledge Discovery from Data (TKDD)
Summarizing spatial data streams using ClusterHulls
Journal of Experimental Algorithmics (JEA)
ACM SIGKDD Explorations Newsletter
Clustering Streaming Time Series Using CBC
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
E-Stream: Evolution-Based Technique for Stream Clustering
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Online Outlier Detection Based on Relative Neighbourhood Dissimilarity
WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
Continuous Trend-Based Clustering in Data Streams
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Data Streaming with Affinity Propagation
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
A dynamic data granulation through adjustable fuzzy clustering
Pattern Recognition Letters
SNIF TOOL: sniffing for patterns in continuous streams
Proceedings of the 17th ACM conference on Information and knowledge management
Incremental clustering of dynamic data streams using connectivity based representative points
Data & Knowledge Engineering
Finding cohesive clusters for analyzing knowledge communities
Knowledge and Information Systems
A Scalable Framework For Segmenting Magnetic Resonance Images
Journal of Signal Processing Systems
Adaptive correlation analysis in stream time series with sliding windows
Computers & Mathematics with Applications
Semantics and implementation of continuous sliding window queries over data streams
ACM Transactions on Database Systems (TODS)
Efficiently tracing clusters over high-dimensional on-line data streams
Data & Knowledge Engineering
Tight results for clustering and summarizing data streams
Proceedings of the 12th International Conference on Database Theory
Neighbor-based pattern detection for windows over streaming data
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
A method for clustering transient data streams
Proceedings of the 2009 ACM symposium on Applied Computing
Online pairing of VoIP conversations
The VLDB Journal — The International Journal on Very Large Data Bases
Stream data clustering based on grid density and attraction
ACM Transactions on Knowledge Discovery from Data (TKDD)
Density-based clustering of data streams at multiple resolutions
ACM Transactions on Knowledge Discovery from Data (TKDD)
Measuring evolving data streams' behavior through their intrinsic dimension
New Generation Computing
BBM: bayesian browsing model from petabyte-scale data
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Toward autonomic grids: analyzing the job flow with affinity streaming
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Optimal sampling from sliding windows
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Combining Multiple Interrelated Streams for Incremental Clustering
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Efficient Clustering of Web-Derived Data Sets
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
An Approach to Web-Scale Named-Entity Disambiguation
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Clustering data stream: A survey of algorithms
International Journal of Knowledge-based and Intelligent Engineering Systems
Adaptive Sampling for k-Means Clustering
APPROX '09 / RANDOM '09 Proceedings of the 12th International Workshop and 13th International Workshop on Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques
Incremental and Adaptive Clustering Stream Data over Sliding Window
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Efficient Pruning Schemes for Distance-Based Outlier Detection
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Cluster-Swap: A Distributed K-median Algorithm for Sensor Networks
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02
Streamed learning: one-pass SVMs
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
HE-Tree: a framework for detecting changes in clustering structure for categorical data streams
The VLDB Journal — The International Journal on Very Large Data Bases
C-DenStream: Using Domain Knowledge on a Data Stream
DS '09 Proceedings of the 12th International Conference on Discovery Science
Stream Clustering of Growing Objects
DS '09 Proceedings of the 12th International Conference on Discovery Science
Efficient decision tree construction for mining time-varying data streams
CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Mining fuzzy frequent itemsets for hierarchical document clustering
Information Processing and Management: an International Journal
Communication-Efficient Privacy-Preserving Clustering
Transactions on Data Privacy
Analyzing knowledge communities using foreground and background clusters
ACM Transactions on Knowledge Discovery from Data (TKDD)
Anomaly intrusion detection by clustering transactional audit streams in a host computer
Information Sciences: an International Journal
Data clustering: 50 years beyond K-means
Pattern Recognition Letters
On the complexity of approximation streaming algorithms for the k-center problem
FAW'07 Proceedings of the 1st annual international conference on Frontiers in algorithmics
Continuous medoid queries over moving objects
SSTD'07 Proceedings of the 10th international conference on Advances in spatial and temporal databases
Event-based lossy compression for effective and efficient OLAP over data streams
Data & Knowledge Engineering
A new algorithm for mining global frequent itemsets in a stream
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
Density-based data streams clustering over sliding windows
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
Scalable Clustering for Mining Local-Correlated Clusters in High Dimensions and Large Datasets
Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
MG-join: detecting phenomena and their correlation in high dimensional data streams
Distributed and Parallel Databases
Towards subspace clustering on dynamic data: an incremental version of PreDeCon
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Discovery of significant emerging trends
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
A data placement strategy in scientific cloud workflows
Future Generation Computer Systems
Texture analysis in quantitative osteoporosis assessment: characterizing microarchitecture
ISBI'10 Proceedings of the 2010 IEEE international conference on Biomedical imaging: from nano to Macro
Bayesian Browsing Model: Exact Inference of Document Relevance from Petabyte-Scale Data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Data & Knowledge Engineering
Fast Discovery of Group Lag Correlations in Streams
ACM Transactions on Knowledge Discovery from Data (TKDD)
Describing data with the support vector shell in distributed environments
ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Discrete wavelet transform-based time series analysis and mining
ACM Computing Surveys (CSUR)
A time-efficient pattern reduction algorithm for k-means clustering
Information Sciences: an International Journal
Robust ensemble learning for mining noisy data streams
Decision Support Systems
Evolutionary FCMAC-BYY applied to stream data analysis
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
Online and incremental algorithms for facility location
ACM SIGACT News
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Efficient decision tree re-alignment for clustering time-changing data streams
From active data management to event-based systems and more
Memoryless facility location in one pass
ACM Transactions on Algorithms (TALG)
Dynamic hierarchical triangulation of a clustered data stream
Computers & Geosciences
Fast clustering using MapReduce
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Approximate kernel k-means: solution to large scale kernel clustering
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Density based subspace clustering over dynamic data
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
A clustering algorithm for multiple data streams based on spectral component similarity
Information Sciences: an International Journal
WSEAS TRANSACTIONS on COMMUNICATIONS
Optimal sampling from sliding windows
Journal of Computer and System Sciences
Anomaly intrusion detection based on clustering a data stream
ISC'06 Proceedings of the 9th international conference on Information Security
Memoryless facility location in one pass
STACS'06 Proceedings of the 23rd Annual conference on Theoretical Aspects of Computer Science
TWStream: finding correlated data streams under time warping
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Maintaining gaussian mixture models of data streams under block evolution
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Attribute outlier detection over data streams
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part II
SIC-means: a semi-fuzzy approach for clustering data streams using c-means
ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Parallelized kernel patch clustering
ANNPR'10 Proceedings of the 4th IAPR TC3 conference on Artificial Neural Networks in Pattern Recognition
Streaming k-means on well-clusterable data
Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms
Scalable clustering using graphics processors
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
A grid-based clustering algorithm for high-dimensional data streams
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
A brief observation-centric analysis on anomaly-based intrusion detection
ISPEC'05 Proceedings of the First international conference on Information Security Practice and Experience
Continuous trend-based classification of streaming time series
ADBIS'05 Proceedings of the 9th East European conference on Advances in Databases and Information Systems
Dense subgraph maintenance under streaming edge weight updates for real-time story identification
Proceedings of the VLDB Endowment
Clustering transactional data streams
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Proceedings of the VLDB Endowment
Expert Systems with Applications: An International Journal
Mining databases and data streams with query languages and rules
KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
StreamKM++: A clustering algorithm for data streams
Journal of Experimental Algorithmics (JEA)
Modeling the flow and change of information on the web
Proceedings of the 21st international conference companion on World Wide Web
Non-linear data stream compression: foundations and theoretical results
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
On resources optimization in fuzzy clustering of data streams
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
Property preserving symmetric encryption
EUROCRYPT'12 Proceedings of the 31st Annual international conference on Theory and Applications of Cryptographic Techniques
Clustering categorical data streams
Journal of Computational Methods in Sciences and Engineering
Expert Systems with Applications: An International Journal
A density-based clustering structure mining algorithm for data streams
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Clustering data stream by a sub-window approach using DCA
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Socialized ubiquitous personal study: Toward an individualized information portal
Journal of Computer and System Sciences
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A single pass trellis-based algorithm for clustering evolving data streams
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
DS-means: distributed data stream clustering
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Data stability in clustering: a closer look
ALT'12 Proceedings of the 23rd international conference on Algorithmic Learning Theory
Mining neighbor-based patterns in data streams
Information Systems
A single pass algorithm for clustering evolving data streams based on swarm intelligence
Data Mining and Knowledge Discovery
Deterministic sublinear-time approximations for metric 1-median selection
Information Processing Letters
Multimedia Tools and Applications
When Is the Right Time to Refresh Knowledge Discovered from Data?
Operations Research
Real time processing of data from patient biodevices
HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
Warped K-Means: An algorithm to cluster sequentially-distributed data
Information Sciences: an International Journal
Journal of Information Science
Clustering cubes with binary dimensions in one pass
Proceedings of the sixteenth international workshop on Data warehousing and OLAP
Data stream clustering: A survey
ACM Computing Surveys (CSUR)
Intent capturing through multimodal inputs
HCI'13 Proceedings of the 15th international conference on Human-Computer Interaction: interaction modalities and techniques - Volume Part IV
Energy-based function to evaluate data stream clustering
Advances in Data Analysis and Classification
Online fuzzy medoid based clustering algorithms
Neurocomputing
Evolving soft subspace clustering
Applied Soft Computing
Semi-supervised clustering of large data sets with kernel methods
Pattern Recognition Letters
On approximating metric 1-median in sublinear time
Information Processing Letters
On clustering large number of data streams
Intelligent Data Analysis
Hi-index | 0.00 |
The data stream model has recently attracted attention for its applicability to numerous types of data, including telephone records, Web documents, and clickstreams. For analysis of such data, the ability to process the data in a single pass, or a small number of passes, while using little memory, is crucial. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.