Randomization tests
PERUSE: An Unsupervised Algorithm for Finding Recurrig Patterns in Time Series
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Discovery of climate indices using clustering
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Assessing data mining results via swap randomization
ACM Transactions on Knowledge Discovery from Data (TKDD)
Mining approximate motifs in time series
DS'06 Proceedings of the 9th international conference on Discovery Science
Hi-index | 0.00 |
Dipoles represent long distance connections between the pressure anomalies of two distant regions that are negatively correlated with each other. Such dipoles have proven important for understanding and explaining the variability in climate in many regions of the world, e.g., the El Nino climate phenomenon is known to be responsible for precipitation and temperature anomalies over large parts of the world. Systematic approaches for dipole detection generate a large number of candidate dipoles, but there exists no method to evaluate the significance of the candidate teleconnections. In this paper, we present a novel method for testing the statistical significance of the class of spatio-temporal teleconnection patterns called as dipoles. One of the most important challenges in addressing significance testing in a spatio-temporal context is how to address the spatial and temporal dependencies that show up as high autocorrelation. We present a novel approach that uses the wild bootstrap to capture the spatio-temporal dependencies, in the special use case of teleconnections in climate data. Our approach to find the statistical significance takes into account the autocorrelation, the seasonality and the trend in the time series over a period of time. This framework is applicable to other problems in spatio-temporal data mining to assess the significance of the patterns.