Random sampling with a reservoir
ACM Transactions on Mathematical Software (TOMS)
Probabilistic counting algorithms for data base applications
Journal of Computer and System Sciences
Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
A linear-time probabilistic counting algorithm for database applications
ACM Transactions on Database Systems (TODS)
Randomized algorithms
An effective hash-based algorithm for mining association rules
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
The space complexity of approximating the frequency moments
STOC '96 Proceedings of the twenty-eighth annual ACM symposium on Theory of computing
New sampling-based summary statistics for improving approximate query answers
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Online association rule mining
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Bottom-up computation of sparse and Iceberg CUBE
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Mining frequent patterns without candidate generation
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
WebBase: a repository of Web pages
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Efficient computation of Iceberg cubes with complex measures
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
On computing correlated aggregates over continual data streams
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Space-efficient online computation of quantile summaries
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
New directions in traffic measurement and accounting
IMW '01 Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement
Maintaining stream statistics over sliding windows: (extended abstract)
SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
Computing Iceberg Queries Efficiently
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Aqua: A Fast Decision Support Systems Using Approximate Query Answers
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Dynamic Maintenance of Wavelet-Based Histograms
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries
Proceedings of the 27th International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
An Efficient Algorithm for Mining Association Rules in Large Databases
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Sampling Large Databases for Association Rules
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Finding Frequent Items in Data Streams
ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
An Approximate L1-Difference Algorithm for Massive Data Streams
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
Correlating XML data streams using tree-edit distance embeddings
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
What's hot and what's not: tracking most frequent items dynamically
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks
WWW '03 Proceedings of the 12th international conference on World Wide Web
Issues in data stream management
ACM SIGMOD Record
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Identifying frequent items in sliding windows over on-line packet streams
Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Association Rule Mining in Peer-to-Peer Systems
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Efficient data reduction with EASE
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Finding recent frequent itemsets adaptively over online data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Towards NIC-based intrusion detection
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamically maintaining frequent items over a data stream
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
estWin: adaptively monitoring the recent change of frequent itemsets over online data streams
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Taming the underlying challenges of reliable multihop routing in sensor networks
Proceedings of the 1st international conference on Embedded networked sensor systems
Understanding the semantics of sensor data
ACM SIGMOD Record
Statistical grid-based clustering over data streams
ACM SIGMOD Record
Cost-efficient mining techniques for data streams
ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Continuously Maintaining Quantile Summaries of the Most Recent N Elements over a Data Stream
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Finding frequent items in data streams
Theoretical Computer Science - Special issue on automata, languages and programming
Detection of complex temporal patterns over data streams
Information Systems - Special issue: ADBIS 2002: Advances in databases and information systems
Deterministic sampling and range counting in geometric data streams
SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
Holistic UDAFs at streaming speeds
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Diamond in the rough: finding Hierarchical Heavy Hitters in multi-dimensional data
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Spatially-decaying aggregation over a network: model and algorithms
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Online identification of hierarchical heavy hitters: algorithms, evaluation, and applications
Proceedings of the 4th ACM SIGCOMM conference on Internet measurement
Reversible sketches for efficient and accurate change detection over network data streams
Proceedings of the 4th ACM SIGCOMM conference on Internet measurement
Medians and beyond: new aggregation techniques for sensor networks
SenSys '04 Proceedings of the 2nd international conference on Embedded networked sensor systems
Finding hot query patterns over an XQuery stream
The VLDB Journal — The International Journal on Very Large Data Bases
Tracking set-expression cardinalities over continuous update streams
The VLDB Journal — The International Journal on Very Large Data Bases
Maintaining Implicated Statistics in Constrained Environments
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Finding (Recently) Frequent Items in Distributed Data Streams
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Approximate counts and quantiles over sliding windows
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
estWin: Online data stream mining of recent frequent itemsets by sliding window method
Journal of Information Science
Duplicate detection in click streams
WWW '05 Proceedings of the 14th international conference on World Wide Web
What's hot and what's not: tracking most frequent items dynamically
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
XML stream processing using tree-edit distance embeddings
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
Sampling in dynamic data streams and applications
SCG '05 Proceedings of the twenty-first annual symposium on Computational geometry
Space efficient mining of multigraph streams
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Space complexity of hierarchical heavy hitters in multi-dimensional data streams
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Sampling algorithms in a stream operator
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Holistic aggregates in a networked world: distributed tracking of approximate quantiles
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Tributaries and deltas: efficient and robust aggregation in sensor network streams
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Fast and approximate stream mining of quantiles and frequencies using graphics processors
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Agents and Stream Data Mining: A New Perspective
IEEE Intelligent Systems
An improved data stream summary: the count-min sketch and its applications
Journal of Algorithms
Detecting malicious network traffic using inverse distributions of packet contents
Proceedings of the 2005 ACM SIGCOMM workshop on Mining network data
Efficient mining method for retrieving sequential patterns over online data streams
Journal of Information Science
Sketching streams through the net: distributed approximate query tracking
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Summarizing and mining inverse distributions on data streams via dynamic inverse sampling
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Using association rules for fraud detection in web advertising networks
VLDB '05 Proceedings of the 31st international conference on Very large data bases
ACM SIGMOD Record
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
An Algorithm for In-Core Frequent Itemset Mining on Streaming Data
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Finding Maximal Frequent Itemsets over Online Data Streams Adaptively
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
Distributed and Parallel Databases
What's new: finding significant differences in network data streams
IEEE/ACM Transactions on Networking (TON)
Research issues in data stream association rule mining
ACM SIGMOD Record
ACM SIGMOD Record
Approximate Processing of Massive Continuous Quantile Queries over High-Speed Data Streams
IEEE Transactions on Knowledge and Data Engineering
Online mining of frequent query trees over XML data streams
Proceedings of the 15th international conference on World Wide Web
Finding global icebergs over distributed data sets
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Approximately detecting duplicates for streaming data using stable bloom filters
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
A geometric approach to monitoring threshold functions over distributed data streams
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
CFI-Stream: mining closed frequent itemsets in data streams
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Data mining middleware for wide-area high-performance networks
Future Generation Computer Systems - IGrid 2005: The global lambda integrated facility
Polymorphic worm detection and defense: system design, experimental methodology, and data resources
Proceedings of the 2006 SIGCOMM workshop on Large-scale attack defense
On biased reservoir sampling in the presence of stream evolution
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
An integrated efficient solution for computing frequent and top-k elements in data streams
ACM Transactions on Database Systems (TODS)
Data streams: algorithms and applications
Foundations and Trends® in Theoretical Computer Science
Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
Mining evolving data streams for frequent patterns
Pattern Recognition
Spatially-decaying aggregation over a network
Journal of Computer and System Sciences
Towards a new approach for mining frequent itemsets on data stream
Journal of Intelligent Information Systems
Deterministic sampling and range counting in geometric data streams
ACM Transactions on Algorithms (TALG)
A priority random sampling algorithm for time-based sliding windows over weighted streaming data
Proceedings of the 2007 ACM symposium on Applied computing
Sketching probabilistic data streams
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Mining maximal frequent itemsets from data streams
Journal of Information Science
Cell trees: An adaptive synopsis structure for clustering multi-dimensional on-line data streams
Data & Knowledge Engineering
A new deterministic data aggregation method for wireless sensor networks
Signal Processing
Answering ad hoc aggregate queries from data streams using prefix aggregate trees
Knowledge and Information Systems
A geometric approach to monitoring threshold functions over distributed data streams
ACM Transactions on Database Systems (TODS)
Sampling streaming data with replacement
Computational Statistics & Data Analysis
A data streaming algorithm for estimating entropies of od flows
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
The history of histograms (abridged)
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A regression-based temporal pattern mining scheme for data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Finding hierarchical heavy hitters in data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Remembrance of streams past: overload-sensitive management of archived streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
An on-line interactive method for finding association rules data streams
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Reversible sketches: enabling monitoring and analysis over high-speed data streams
IEEE/ACM Transactions on Networking (TON)
High-speed detection of unsolicited bulk emails
Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
Finding hierarchical heavy hitters in streaming data
ACM Transactions on Knowledge Discovery from Data (TKDD)
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Deterministic algorithms for sampling count data
Data & Knowledge Engineering
Probabilistic lossy counting: an efficient algorithm for finding heavy hitters
ACM SIGCOMM Computer Communication Review
A scalable pattern mining approach to web graph compression with communities
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Incremental maintenance of generalized association rules under taxonomy evolution
Journal of Information Science
A stratified approach to progressive approximate joins
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
On-line generation association rules over data streams
Information and Software Technology
Approximate continuous querying over distributed streams
ACM Transactions on Database Systems (TODS)
Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Approximate mining of frequent patterns on streams
Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Processing top k queries from samples
CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
An efficient algorithm for mining temporal high utility itemsets from data streams
Journal of Systems and Software
Finding frequent items in probabilistic data
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Shape sensitive geometric monitoring
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Dynamic adaptive data structures for monitoring data streams
Data & Knowledge Engineering
Discovering frequent sets from data streams with CPU constraint
AusDM '07 Proceedings of the sixth Australasian conference on Data mining and analytics - Volume 70
Interactive mining of frequent itemsets over arbitrary time intervals in a data stream
ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Mining sequential patterns across time sequences
New Generation Computing
Approximate mining of maximal frequent itemsets in data streams with different window models
Expert Systems with Applications: An International Journal
Mining top-k frequent patterns in the presence of the memory constraint
The VLDB Journal — The International Journal on Very Large Data Bases
The VLDB Journal — The International Journal on Very Large Data Bases
A survey on algorithms for mining frequent itemsets over data streams
Knowledge and Information Systems
Short communication: TOPSIS: Finding Top-K significant N-itemsets in sliding windows adaptively
Knowledge-Based Systems
Online mining of frequent sets in data streams with error guarantee
Knowledge and Information Systems
Processing top-k queries from samples
Computer Networks: The International Journal of Computer and Telecommunications Networking
Summarizing spatial data streams using ClusterHulls
Journal of Experimental Algorithmics (JEA)
FIDS: Monitoring Frequent Items over Distributed Data Streams
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Memory Efficient Algorithm for Mining Recent Frequent Items in a Stream
RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
DELAY: A Lazy Approach for Mining Frequent Patterns over High Speed Data Streams
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Separator: Sifting Hierarchical Heavy Hitters Accurately from Data Streams
ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Computing Frequent Elements Using Gossip
SIROCCO '08 Proceedings of the 15th international colloquium on Structural Information and Communication Complexity
Finding Frequent Items in a Turnstile Data Stream
COCOON '08 Proceedings of the 14th annual international conference on Computing and Combinatorics
Maintaining the Maximum Normalized Mean and Applications in Data Stream Mining
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Event-Based Compression and Mining of Data Streams
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Mining Serial Episode Rules with Time Lags over Multiple Data Streams
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Efficient Approximate Mining of Frequent Patterns over Transactional Data Streams
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Mining Multidimensional Sequential Patterns over Data Streams
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Clustering Distributed Sensor Data Streams
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Adaptive shared-state sampling
Proceedings of the 8th ACM SIGCOMM conference on Internet measurement
DSM-FI: an efficient algorithm for mining frequent itemsets in data streams
Knowledge and Information Systems
Multidimensional content eXploration
Proceedings of the VLDB Endowment
SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting
Proceedings of the VLDB Endowment
Finding frequent items in data streams
Proceedings of the VLDB Endowment
Conceptual modeling rules extracting for data streams
Knowledge-Based Systems
CAM conscious integrated answering of frequent elements and top-k queries over data streams
Proceedings of the 4th international workshop on Data management on new hardware
Maintaining frequent closed itemsets over a sliding window
Journal of Intelligent Information Systems
Feature-preserved sampling over streaming data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Mining frequent itemsets over data streams using efficient window sliding techniques
Expert Systems with Applications: An International Journal
Incremental updates of closed frequent itemsets over continuous data streams
Expert Systems with Applications: An International Journal
Multi-query optimization for sketch-based estimation
Information Systems
An Efficient Approach for Analyzing Multidimensional Network Traffic
APNOMS '08 Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service Management
Incrementally Mining Recently Repeating Patterns over Data Streams
New Frontiers in Applied Data Mining
Mining frequent closed itemsets from a landmark window over online data streams
Computers & Mathematics with Applications
The design of a query monitoring system
ACM Transactions on Database Systems (TODS)
Efficient query processing on graph databases
ACM Transactions on Database Systems (TODS)
Semantics and implementation of continuous sliding window queries over data streams
ACM Transactions on Database Systems (TODS)
Frequent items in streaming data: An experimental evaluation of the state-of-the-art
Data & Knowledge Engineering
Mining non-derivable frequent itemsets over data stream
Data & Knowledge Engineering
Information processing using data stream management system on Jamdroid
Proceedings of the International Conference on Advances in Computing, Communication and Control
HIDS: a multifunctional generator of hierarchical data streams
ACM SIGMIS Database
A Sliding-Window Approach for Finding Top-k Frequent Itemsets from Uncertain Streams
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Interactive mining of top-K frequent closed itemsets from data streams
Expert Systems with Applications: An International Journal
Spatio-Temporal Sensor Graphs (STSG): A data model for the discovery of spatio-temporal patterns
Intelligent Data Analysis - Knowledge Discovery from Data Streams
Density-based clustering of data streams at multiple resolutions
ACM Transactions on Knowledge Discovery from Data (TKDD)
Data Mining and Knowledge Discovery
Mining frequent itemsets in data streams using the weighted sliding window model
Expert Systems with Applications: An International Journal
Optimal sampling from sliding windows
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Space-optimal heavy hitters with strong error bounds
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Optimal tracking of distributed heavy hitters and quantiles
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
WSFI-Mine: Mining Frequent Patterns in Data Streams
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Finding the frequent items in streams of data
Communications of the ACM - A View of Parallel Computing
Small synopses for group-by query verification on outsourced data streams
ACM Transactions on Database Systems (TODS)
Deterministically Estimating Data Stream Frequencies
COCOA '09 Proceedings of the 3rd International Conference on Combinatorial Optimization and Applications
Journal of Data and Information Quality (JDIQ)
Frequency-based load shedding over a data stream of tuples
Information Sciences: an International Journal
Expert Systems with Applications: An International Journal
A frequent pattern based framework for event detection in sensor network stream data
Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
Sliding window-based frequent pattern mining over data streams
Information Sciences: an International Journal
Harnessing the strengths of anytime algorithms for constant data streams
Data Mining and Knowledge Discovery
The Frequent Items Problem, under Polynomial Decay, in the Streaming Model
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Which Is Better for Frequent Pattern Mining: Approximate Counting or Sampling?
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Multivariable stream data classification using motifs and their temporal relations
Information Sciences: an International Journal
Streaming for large scale NLP: language modeling
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Mining frequent itemsets in time-varying data streams
Proceedings of the 18th ACM conference on Information and knowledge management
Probabilistic counting with randomized storage
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Thread cooperation in multicore architectures for frequency counting over multiple data streams
Proceedings of the VLDB Endowment
An audit environment for outsourcing of frequent itemset mining
Proceedings of the VLDB Endowment
Mining Local Correlation Patterns in Sets of Sequences
DS '09 Proceedings of the 12th International Conference on Discovery Science
Approximate Frequent Itemset Discovery from Data Stream
AI*IA '09: Proceedings of the XIth International Conference of the Italian Association for Artificial Intelligence Reggio Emilia on Emergent Perspectives in Artificial Intelligence
A heuristic method of finding heavy hitter prefix pairs in IP traffic
IEEE Communications Letters
Methods for finding frequent items in data streams
The VLDB Journal — The International Journal on Very Large Data Bases
Finding frequent items over sliding windows with constant update time
Information Processing Letters
Reducing rule covers with deterministic error bounds
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Sampling-based stream mining for network risk management
JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence
Discovering correlated items in data streams
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Mining disjunctive sequential patterns from news stream
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
A robust approach to find effective items in distributed data streams
LSMS'07 Proceedings of the Life system modeling and simulation 2007 international conference on Bio-Inspired computational intelligence and applications
Finding frequent elements in non-bursty streams
ESA'07 Proceedings of the 15th annual European conference on Algorithms
Approximately mining recently representative patterns on data streams
PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Finding frequent items in data streams using ESBF
PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
CLAIM: an efficient method for relaxed frequent closed itemsets mining over stream data
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Discovery of frequent distributed event patterns in sensor networks
EWSN'08 Proceedings of the 5th European conference on Wireless sensor networks
Event-based lossy compression for effective and efficient OLAP over data streams
Data & Knowledge Engineering
Aggregate computation over data streams
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Finding heavy hitters over the sliding window of a weighted data stream
LATIN'08 Proceedings of the 8th Latin American conference on Theoretical informatics
Data aggregation in sensor networks: no more a slave to routing
Allerton'09 Proceedings of the 47th annual Allerton conference on Communication, control, and computing
Mining recent approximate frequent items in wireless sensor networks
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 2
A new algorithm for mining global frequent itemsets in a stream
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
MFISW: a new method for mining frequent itemsets in time and transaction sensitive sliding window
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
Adequacy of data for mining individual friendship pattern from cellular phone call logs
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
A test paradigm for detecting changes in transactional data streams
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
HiFIND: A high-speed flow-level intrusion detection approach with DoS resiliency
Computer Networks: The International Journal of Computer and Telecommunications Networking
An online framework for catching top spreaders and scanners
Computer Networks: The International Journal of Computer and Telecommunications Networking
Stateful bulk processing for incremental analytics
Proceedings of the 1st ACM symposium on Cloud computing
Mining top-k frequent closed itemsets over data streams using the sliding window model
Expert Systems with Applications: An International Journal
The frequent items problem, under polynomial decay, in the streaming model
Theoretical Computer Science
High-speed per-flow traffic measurement with probabilistic multiplicity counting
INFOCOM'10 Proceedings of the 29th conference on Information communications
Approximating sliding windows by cyclic tree-like histograms for efficient range queries
Data & Knowledge Engineering
Mining top-K frequent itemsets through progressive sampling
Data Mining and Knowledge Discovery
Mining discriminative items in multiple data streams
World Wide Web
Space-optimal heavy hitters with strong error bounds
ACM Transactions on Database Systems (TODS)
Finding top-k elements in data streams
Information Sciences: an International Journal
Sketching techniques for large scale NLP
WAC-6 '10 Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
Open user schema guided evaluation of streaming RDF queries
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Estimating top-k destinations in data streams
IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
Identifying frequent items in a network using gossip
Journal of Parallel and Distributed Computing
Private and continual release of statistics
ICALP'10 Proceedings of the 37th international colloquium conference on Automata, languages and programming: Part II
Hybrid in-memory and on-disk tables for speeding-up table accesses
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Efficient term cloud generation for streaming web content
ICWE'10 Proceedings of the 10th international conference on Web engineering
Sequential hashing: A flexible approach for unveiling significant patterns in high speed networks
Computer Networks: The International Journal of Computer and Telecommunications Networking
Speed up gradual rule mining from stream data! A B-Tree and OWA-based approach
Journal of Intelligent Information Systems
Mining informative rule set for prediction over a sliding window
ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part II
Robust ensemble learning for mining noisy data streams
Decision Support Systems
Lightweight problem determination in DBMSs using data stream analysis techniques
Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
Parallelizing weighted frequency counting in high-speed network monitoring
Computer Communications
Clustering distributed sensor data streams using local processing and reduced communication
Intelligent Data Analysis - Ubiquitous Knowledge Discovery
Resource aware distributed knowledge discovery
Ubiquitous knowledge discovery
A geometric approach to monitoring threshold functions over distributed data streams
Ubiquitous knowledge discovery
Discovery of frequent patterns in transactional data streams
Transactions on large-scale data- and knowledge-centered systems II
Resource aware distributed knowledge discovery
Ubiquitous knowledge discovery
A geometric approach to monitoring threshold functions over distributed data streams
Ubiquitous knowledge discovery
Discovery of frequent patterns in transactional data streams
Transactions on large-scale data- and knowledge-centered systems II
Finding heavy distinct hitters in data streams
Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
Unsupervised topographic learning for spatiotemporal data mining
Advances in Artificial Intelligence - Special issue on machine learning paradigms for modeling spatial and temporal information in multimedia data mining
Mining hot calling contexts in small space
Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Expert Systems with Applications: An International Journal
Mining frequent itemsets over distributed data streams by continuously maintaining a global synopsis
Data Mining and Knowledge Discovery
Space-efficient tracking of persistent items in a massive data stream
Proceedings of the 5th ACM international conference on Distributed event-based system
A generic approach for mining indirect association rules in data streams
IEA/AIE'11 Proceedings of the 24th international conference on Industrial engineering and other applications of applied intelligent systems conference on Modern approaches in applied intelligence - Volume Part I
Mining approximate frequent closed flows over packet streams
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Data-driven modeling and analysis of online social networks
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Private and Continual Release of Statistics
ACM Transactions on Information and System Security (TISSEC)
MHUI-max: An efficient algorithm for discovering high-utility itemsets from data streams
Journal of Information Science
Mining frequent patterns across multiple data streams
Proceedings of the 20th ACM international conference on Information and knowledge management
Proceedings of the 15th Symposium on International Database Engineering & Applications
gSketch: on query estimation in graph streams
Proceedings of the VLDB Endowment
Processing frequent items over distributed data streams
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Search method of time sensitive frequent itemsets in data streams
CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
A scalable distributed stream mining system for highway traffic data
PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Maintaining frequent itemsets over high-speed data streams
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Variable support mining of frequent itemsets over data streams using synopsis vectors
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Dynamically mining frequent patterns over online data streams
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
Fast approximate wavelet tracking on streams
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
On futuristic query processing in data streams
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
EStream: online mining of frequent sets with precise error guarantee
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Adaptive load shedding for mining frequent patterns from data streams
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
An approximate approach for mining recently frequent itemsets from data streams
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
MFIS—Mining frequent itemsets on data streams
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
SuffixMiner: efficiently mining frequent itemsets in data streams by suffix-forest
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Efficient computation of frequent and top-k elements in data streams
ICDT'05 Proceedings of the 10th international conference on Database Theory
Error-adaptive and time-aware maintenance of frequency counts over data streams
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Statistical supports for frequent itemsets on data streams
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Mining recent frequent itemsets in data streams by radioactively attenuating strategy
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
User subjectivity in change modeling of streaming itemsets
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Mining global association rules on an oracle grid by scanning once distributed databases
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Streams, security and scalability
DBSec'05 Proceedings of the 19th annual IFIP WG 11.3 working conference on Data and Applications Security
Online algorithms for mining inter-stream associations from large sensor networks
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Approximate scalable bounded space sketch for large data NLP
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Adaptive spatial partitioning for multidimensional data streams
ISAAC'04 Proceedings of the 15th international conference on Algorithms and Computation
A dynamic layout of sliding window for frequent itemset mining over data streams
Journal of Systems and Software
False-Negative frequent items mining from data streams with bursting
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
An efficient algorithm for frequent itemset mining on data streams
ICDM'06 Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining
A false negative approach to mining frequent itemsets from high speed transactional data streams
Information Sciences: an International Journal
A scalable supervised algorithm for dimensionality reduction on streaming data
Information Sciences: an International Journal
Data stream synopsis using saintetiq
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Mining frequent patterns from dynamic data streams with data load management
Journal of Systems and Software
Secure Distributed Data Aggregation
Foundations and Trends in Databases
A randomized algorithm for finding frequent elements in streams using o(loglogn) space
ISAAC'11 Proceedings of the 22nd international conference on Algorithms and Computation
A false negative maximal frequent itemset mining algorithm over stream
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Randomized algorithms for tracking distributed count, frequencies, and ranks
PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Towards a variable size sliding window model for frequent itemset mining over data streams
Computers and Industrial Engineering
Re-optimizing data-parallel computing
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Efficient mining of frequent items coupled with weight and /or support over progressive databases
ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
A sliding window-based false-negative approach for ubiquitous data stream analysis
International Journal of Communication Systems
International Journal of Sensor Networks
Non-linear data stream compression: foundations and theoretical results
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
Don't let the negatives bring you down: sampling from streams of signed updates
Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
Clustering categorical data streams
Journal of Computational Methods in Sciences and Engineering
A framework for summarizing and analyzing twitter feeds
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining frequent patterns in a varying-size sliding window of online transactional data streams
Information Sciences: an International Journal
Space-efficient sampling from social activity streams
Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Computers & Mathematics with Applications
Enhancing source selection for live queries over linked data via query log mining
JIST'11 Proceedings of the 2011 joint international conference on The Semantic Web
Approximate frequency counts over data streams
Proceedings of the VLDB Endowment
Approximate answers to OLAP queries on streaming data warehouses
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
Streaming analysis of discourse participants
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Sketch algorithms for estimating point queries in NLP
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
DWFIST: leveraging calendar-based pattern mining in data streams
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Recent frequent itemsets mining over data streams
Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
CR-PRECIS: a deterministic summary structure for update data streams
ESCAPE'07 Proceedings of the First international conference on Combinatorics, Algorithms, Probabilistic and Experimental Methodologies
Streaming algorithms for data in motion
ESCAPE'07 Proceedings of the First international conference on Combinatorics, Algorithms, Probabilistic and Experimental Methodologies
Extrapolation prefix tree for data stream mining using a landmark model
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Improved counter based algorithms for frequent pairs mining in transactional data streams
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Fast, Scalable, and Context-Sensitive Detection of Trending Topics in Microblog Post Streams
ACM Transactions on Management Information Systems (TMIS)
Responding rapidly to service level violations using virtual appliances
ACM SIGOPS Operating Systems Review
Streaming trend detection in Twitter
International Journal of Web Based Communities
Optimizing adaptive multi-route query processing via time-partitioned indices
Journal of Computer and System Sciences
A sliding window based algorithm for frequent closed itemset mining over data streams
Journal of Systems and Software
Incremental Algorithm for Discovering Frequent Subsequences in Multiple Data Streams
International Journal of Data Warehousing and Mining
Scalable identification and measurement of heavy-hitters
Computer Communications
When Is the Right Time to Refresh Knowledge Discovered from Data?
Operations Research
ProFID: Practical frequent items discovery in peer-to-peer networks
Future Generation Computer Systems
Real time processing of data from patient biodevices
HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
High throughput heavy hitter aggregation for modern SIMD processors
Proceedings of the Ninth International Workshop on Data Management on New Hardware
Pushing constraints into data streams
Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Evaluation of RDF queries via equivalence
Frontiers of Computer Science: Selected Publications from Chinese Universities
Spreader classification based on optimal dynamic bit sharing
IEEE/ACM Transactions on Networking (TON)
Sketch-based geometric monitoring of distributed stream queries
Proceedings of the VLDB Endowment
A lossy counting based approach for learning on streams of graphs on a budget
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
A methodological overview on anomaly detection
DataTraffic Monitoring and Analysis
Stream mining on univariate uncertain data
Applied Intelligence
Adaptive stratified reservoir sampling over heterogeneous data streams
Information Systems
Mining frequent items in data stream using time fading model
Information Sciences: an International Journal
Efficient frequent itemset mining methods over time-sensitive streams
Knowledge-Based Systems
Mining frequent itemsets in data streams within a time horizon
Data & Knowledge Engineering
Mining top-k frequent patterns over data streams sliding window
Journal of Intelligent Information Systems
A similarity-based approach for data stream classification
Expert Systems with Applications: An International Journal
Hi-index | 0.01 |
We present algorithms for computing frequency counts exceeding a user-specified threshold over data streams. Our algorithms are simple and have provably small memory footprints. Although the output is approximate, the error is guaranteed not to exceed a user-specified parameter. Our algorithms can easily be deployed for streams of singleton items like those found in IP network monitoring. We can also handle streams of variable sized sets of items exemplified by a sequence of market basket transactions at a retail store. For such streams, we describe an optimized implementation to compute frequent itemsets in a single pass.