Introduction to algorithms
An overview of data warehousing and OLAP technology
ACM SIGMOD Record
Querying and mining data streams: you only get one look a tutorial
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals
Data Mining and Knowledge Discovery
An Efficient Index Structure for Shift and Scale Invariant Search of Multi-Attribute Time Sequences
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Multi-dimensional regression analysis of time-series data streams
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
StatStream: statistical monitoring of thousands of data streams in real time
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Hi-index | 0.00 |
Facing tremendous and potentially infinite stream data, it is impossible to record them entirely. Thus synopses are required to be generated timely to capture the underlying model for stream management systems. Traditionally, curve fitting through Multiple Linear Regression (MLR) is a powerful and efficient modeling tool. In order to further accelerate its processing efficiency, we propose Step-by-step Regression (SR) as a more efficient alternative. As revealed in experiments, it speeds up for more than 40 times. In addition, inspired by previous work, we integrated SR into cube environment through similar compression technique to perform online analytical processing and mining over data stream. Finally, experiments show that SR not only significantly alleviates the computation pressure on the front ends of data stream management systems, but also results in a much smaller stream cube for on line analysis and real-time surveillance.