SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Approximate medians and other quantiles in one pass and with limited memory
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Data networks as cascades: investigating the multifractal nature of Internet WAN traffic
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
BOAT—optimistic decision tree construction
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Ripple joins for online aggregation
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
The space complexity of approximating the frequency moments
Journal of Computer and System Sciences
Synopsis data structures for massive data sets
Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Eddies: continuously adaptive query processing
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Testing and spot-checking of data streams (extended abstract)
SODA '00 Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms
Mining high-speed data streams
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
The MD-join: An Operator for Complex OLAP
Proceedings of the 17th International Conference on Data Engineering
Online Dynamic Reordering for Interactive Data Processing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Offering a Precision-Performance Tradeoff for Aggregation Queries over Replicated Data
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Querying Multiple Features of Groups in Relational Databases
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A One-Pass Algorithm for Accurately Estimating Quantiles for Disk-Resident Data
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Fast Incremental Maintenance of Approximate Histograms
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
An Approximate L1-Difference Algorithm for Massive Data Streams
FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Ad Hoc OLAP: Expression and Evaluation
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Models and issues in data stream systems
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Characterizing memory requirements for queries over continuous data streams
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Processing complex aggregate queries over data streams
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Distributed streams algorithms for sliding windows
Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures
F4: large-scale automated forecasting using fractals
Proceedings of the eleventh international conference on Information and knowledge management
RHist: adaptive summarization over continuous data streams
Proceedings of the eleventh international conference on Information and knowledge management
Evaluating continuous nearest neighbor queries for streaming time series via pre-fetching
Proceedings of the eleventh international conference on Information and knowledge management
Continuous queries over data streams
ACM SIGMOD Record
Temporal and spatio-temporal aggregations over data streams using multiple time granularities
Information Systems - Special issue: Best papers from EDBT 2002
Temporal Aggregation over Data Streams Using Multiple Granularities
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries
Proceedings of the 27th International Conference on Very Large Data Bases
Approximate Query Processing: Taming the TeraBytes
Proceedings of the 27th International Conference on Very Large Data Bases
Complex Temporal Patterns Detection over Continuous Data Streams
ADBIS '02 Proceedings of the 6th East European Conference on Advances in Databases and Information Systems
QoS-Driven Load Shedding on Data Streams
EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
One-Pass Wavelet Decompositions of Data Streams
IEEE Transactions on Knowledge and Data Engineering
Exploiting Punctuation Semantics in Continuous Data Streams
IEEE Transactions on Knowledge and Data Engineering
Efficient Approximation of Correlated Sums on Data Streams
IEEE Transactions on Knowledge and Data Engineering
Issues in data stream management
ACM SIGMOD Record
The design of an acquisitional query processor for sensor networks
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Aurora: a new model and architecture for data stream management
The VLDB Journal — The International Journal on Very Large Data Bases
PSoup: a system for streaming queries over streaming data
The VLDB Journal — The International Journal on Very Large Data Bases
Efficient elastic burst detection in data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient decision tree construction on streaming data
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Journal of Computer Science and Technology
Understanding the semantics of sensor data
ACM SIGMOD Record
Distributed deviation detection in sensor networks
ACM SIGMOD Record
Characterizing memory requirements for queries over continuous data streams
ACM Transactions on Database Systems (TODS)
Continuously Maintaining Quantile Summaries of the Most Recent N Elements over a Data Stream
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Detection of complex temporal patterns over data streams
Information Systems - Special issue: ADBIS 2002: Advances in databases and information systems
Online event-driven subsequence matching over financial data streams
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Holistic UDAFs at streaming speeds
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Online maintenance of very large random samples
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Adaptive, unsupervised stream mining
The VLDB Journal — The International Journal on Very Large Data Bases
NESTREAM: querying nested streams
ACM SIGMOD Record
Duplicate detection in click streams
WWW '05 Proceedings of the 14th international conference on World Wide Web
TinyDB: an acquisitional query processing system for sensor networks
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2003
Subsequence matching on structured time series data
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Locking Protocols for Materialized Aggregate Join Views
IEEE Transactions on Knowledge and Data Engineering
Efficient evaluation of XQuery over streaming data
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Content-based routing: different plans for different data
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Une architecture hybride pour i'interrogation et i'administration des capteurs
UbiMob '05 Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
Distributed and Parallel Databases
Maintaining Sliding Window Skylines on Data Streams
IEEE Transactions on Knowledge and Data Engineering
A Framework for On-Demand Classification of Evolving Data Streams
IEEE Transactions on Knowledge and Data Engineering
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Online summarization of dynamic time series data
The VLDB Journal — The International Journal on Very Large Data Bases
Window-aware load shedding for aggregation queries over data streams
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
An integrated efficient solution for computing frequent and top-k elements in data streams
ACM Transactions on Database Systems (TODS)
Data streams: algorithms and applications
Foundations and Trends® in Theoretical Computer Science
Regression Cubes with Lossless Compression and Aggregation
IEEE Transactions on Knowledge and Data Engineering
New Algorithm for Computing Cube on Very Large Compressed Data Sets
IEEE Transactions on Knowledge and Data Engineering
Temporal abstraction in intelligent clinical data analysis: A survey
Artificial Intelligence in Medicine
Spatio-temporal join selectivity
Information Systems
Answering ad hoc aggregate queries from data streams using prefix aggregate trees
Knowledge and Information Systems
Streaming queries over streaming data
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Monitoring streams: a new class of data management applications
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A transducer-based XML query processor
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Multi-dimensional regression analysis of time-series data streams
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Approximate frequency counts over data streams
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
StatStream: statistical monitoring of thousands of data streams in real time
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Reverse nearest neighbor aggregates over data streams
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
MRST: an efficient monitoring technology of summarization on stream data
Journal of Computer Science and Technology
A regression-based temporal pattern mining scheme for data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Load shedding in a data stream manager
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Adaptive, hands-off stream mining
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Locking protocols for materialized aggregate join views
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Resource sharing in continuous sliding-window aggregates
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Declaring independence via the sketching of sketches
Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Query processing of multi-way stream window joins
The VLDB Journal — The International Journal on Very Large Data Bases
Maintaining very large random samples using the geometric file
The VLDB Journal — The International Journal on Very Large Data Bases
Knowledge Aquisition and Data Storage in Mobile GeoSensor Networks
GeoSensor Networks
Event-Based Compression and Mining of Data Streams
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Summarizing Distributed Data Streams for Storage in Data Warehouses
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Experimenting the Query Performance of a Grid-Based Sensor Network Data Warehouse
Globe '08 Proceedings of the 1st international conference on Data Management in Grid and Peer-to-Peer Systems
Multi-query optimization for sketch-based estimation
Information Systems
ODMCA: An adaptive data mining control algorithm in multicarrier networks
Computer Communications
Continuously monitoring top-k uncertain data streams: a probabilistic threshold method
Distributed and Parallel Databases
CAMS: OLAPing Multidimensional Data Streams Efficiently
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Fast likelihood search for hidden Markov models
ACM Transactions on Knowledge Discovery from Data (TKDD)
Data & Knowledge Engineering
Continuous Processing of Preference Queries in Data Streams
SOFSEM '10 Proceedings of the 36th Conference on Current Trends in Theory and Practice of Computer Science
Aggregation of asynchronous electric power consumption time series knowing the integral
Proceedings of the 13th International Conference on Extending Database Technology
Transaction reordering and grouping for continuous data loading
BIRTE'06 Proceedings of the 1st international conference on Business intelligence for the real-time enterprises
Transformation of continuous aggregation join queries over data streams
SSTD'07 Proceedings of the 10th international conference on Advances in spatial and temporal databases
Event-based lossy compression for effective and efficient OLAP over data streams
Data & Knowledge Engineering
Fast approximate correlation for massive time-series data
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
ASSET queries: a declarative alternative to MapReduce
ACM SIGMOD Record
Streaming multiple aggregations using phantoms
The VLDB Journal — The International Journal on Very Large Data Bases
Fast Discovery of Group Lag Correlations in Streams
ACM Transactions on Knowledge Discovery from Data (TKDD)
V locking protocol for materialized aggregate join views on B-tree indices
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Supporting real-time supply chain decisions based on RFID data streams
Journal of Systems and Software
Beyond simple aggregates: indexing for summary queries
Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Unsupervised topographic learning for spatiotemporal data mining
Advances in Artificial Intelligence - Special issue on machine learning paradigms for modeling spatial and temporal information in multimedia data mining
Code transformations for one-pass analysis
LCPC'05 Proceedings of the 18th international conference on Languages and Compilers for Parallel Computing
Stream operators for querying data streams
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
Generalized projected clustering in high-dimensional data streams
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
MFIS—Mining frequent itemsets on data streams
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
An efficient algorithm for frequent itemset mining on data streams
ICDM'06 Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining
Non-linear data stream compression: foundations and theoretical results
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
Proceedings of the 15th International Conference on Extending Database Technology
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
Database support for processing complex aggregate queries over data streams
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Exploiting online social data in ontology learning for event tracking and emergency response
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Pattern discovery in data streams under the time warping distance
The VLDB Journal — The International Journal on Very Large Data Bases
Indexing for summary queries: Theory and practice
ACM Transactions on Database Systems (TODS)
Streaming quotient filter: a near optimal approximate duplicate detection approach for data streams
Proceedings of the VLDB Endowment
Predicting knowledge in an ontology stream
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
STAR-CITY: semantic traffic analytics and reasoning for CITY
Proceedings of the 19th international conference on Intelligent User Interfaces
Hi-index | 0.00 |
In many applications from telephone fraud detection to network management, data arrives in a stream, and there is a need to maintain a variety of statistical summary information about a large number of customers in an online fashion. At present, such applications maintain basic aggregates such as running extrema values (MIN, MAX), averages, standard deviations, etc., that can be computed over data streams with limited space in a straightforward way. However, many applications require knowledge of more complex aggregates relating different attributes, so-called correlated aggregates. As an example, one might be interested in computing the percentage of international phone calls that are longer than the average duration of a domestic phone call. Exact computation of this aggregate requires multiple passes over the data stream, which is infeasible.We propose single-pass techniques for approximate computation of correlated aggregates over both landmark and sliding window views of a data stream of tuples, using a very limited amount of space. We consider both the case where the independent aggregate (average duration in the example above) is an extrema value and the case where it is an average value, with any standard aggregate as the dependent aggregate; these can be used as building blocks for more sophisticated aggregates. We present an extensive experimental study based on some real and a wide variety of synthetic data sets to demonstrate the accuracy of our techniques. We show that this effectiveness is explained by the fact that our techniques exploit monotonicity and convergence properties of aggregates over data streams.