Correlation analysis of spatial time series datasets: a filter-and-refine approach

  • Authors:
  • Pusheng Zhang;Yan Huang;Shashi Shekhar;Vipin Kumar

  • Affiliations:
  • Computer Science & Engineering Department, University of Minnesota,, Minneapolis, MN;Computer Science & Engineering Department, University of Minnesota,, Minneapolis, MN;Computer Science & Engineering Department, University of Minnesota,, Minneapolis, MN;Computer Science & Engineering Department, University of Minnesota,, Minneapolis, MN

  • Venue:
  • PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

A spatial time series dataset is a collection of time series, each referencing a location in a common spatial framework. Correlation analysis is often used to identify pairs of potentially interacting elements from the cross product of two spatial time series datasets. However, the computational cost of correlation analysis is very high when the dimension of the time series and the number of locations in the spatial frameworks are large. The key contribution of this paper is the use of spatial autocorrelation among spatial neighboring time series to reduce computational cost. A filter-and-refine algorithm based on coning, i.e. grouping of locations, is proposed to reduce the cost of correlation analysis over a pair of spatial time series datasets. Cone-level correlation computation can be used to eliminate (filter out) a large number of element pairs whose correlation is clearly below (or above) a given threshold. Element pair correlation needs to be computed for remaining pairs. Using experimental studies with Earth science datasets, we show that the filter-and-refine approach can save a large fraction of the computational cost, particularly when the minimal correlation threshold is high.