Continuously maintaining order statistics over data streams: extended abstract

  • Authors:
  • Xuemin Lin

  • Affiliations:
  • University of New South Wales, Australia

  • Venue:
  • ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

A rank query is essentially to find a data element with a given rank against a monotonic order specified on data elements. It has several equivalent variations [8, 17, 30]. Rank queries over data streams have been investigated in the form of quantile computation. A &phis;-quantile (&phis; ∈ (0,1]) of a collection of N data elements is the element with rank [&phis;N] against a monotonic order specified on data elements. Rank and quantile queries have many applications [1, 3, 6, 7, 10, 14-16, 26, 27], including monitoring high speed networks, trends and fleeting opportunities detection in the stock market, sensor data analysis, Web ranking aggregation and log mining, etc. In these applications, they not only play very important roles in the decision making but also have been used in summarizing data distributions of data streams. The following example shows a popular tool to compare the distributions of two data sets (data streams).