A binary-classification-based metric between time-series distributions and its use in statistical and learning problems

  • Authors:
  • Daniil Ryabko;Jérémie Mary

  • Affiliations:
  • SequeL-INRIA, LIFL-CNRS, Université de Lille, Villeneuve d'Ascq, France;SequeL-INRIA, LIFL-CNRS, Université de Lille, Villeneuve d'Ascq, France

  • Venue:
  • The Journal of Machine Learning Research
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A metric between time-series distributions is proposed that can be evaluated using binary classification methods, which were originally developed to work on i.i.d. data. It is shown how this metric can be used for solving statistical problems that are seemingly unrelated to classification and concern highly dependent time series. Specifically, the problems of time-series clustering, homogeneity testing and the three-sample problem are addressed. Universal consistency of the resulting algorithms is proven under most general assumptions. The theoretical results are illustrated with experiments on synthetic and real-world data.