Enabling Real-Time Querying of Live and Historical Stream Data

  • Authors:
  • Frederick Reiss;Kurt Stockinger;Kesheng Wu;Arie Shoshani;Joseph M. Hellerstein

  • Affiliations:
  • Lawrence Berkeley National Laboratory, USA/ University of California, Berkeley, USA/ IBM Almaden Research Center, USA;Lawrence Berkeley National Laboratory, USA;Lawrence Berkeley National Laboratory, USA;Lawrence Berkeley National Laboratory, USA;University of California, Berkeley, USA

  • Venue:
  • SSDBM '07 Proceedings of the 19th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Applications that query data streams in order to identify trends, patterns, or anomalies can often benefit from comparing the live stream data with archived historical stream data. However, searching this historical data in real time has been considered so far to be prohibitively expensive. One of the main bottlenecks is the update costs of the indices over the archived data. In this paper, we address this problem by using our highly-efficient bitmap indexing technology (called FastBit) and demonstrate that the index update operations are sufficiently efficient for this bottleneck to be removed. We describe our prototype system based on the TelegraphCQ streaming query processor and the FastBit bitmap index. We present a detailed performance evaluation of our system using a complex query workload for analyzing real network traffic data. The combined system uses TelegraphCQ to analyze streams of traffic information and FastBit to correlate current behaviors with historical trends. We demonstrate that our system can simultaneously analyze (1) live streams with high data rates and (2) a large repository of historical stream data.