Supporting exact indexing of arbitrarily rotated shapes and periodic time series under Euclidean and warping distance measures

  • Authors:
  • Eamonn Keogh;Li Wei;Xiaopeng Xi;Michail Vlachos;Sang-Hee Lee;Pavlos Protopapas

  • Affiliations:
  • Department of Computer Science and Engineering, UCR, Riverside, USA;Department of Computer Science and Engineering, UCR, Riverside, USA;Department of Computer Science and Engineering, UCR, Riverside, USA;IBM T.J. Watson Research Center, Yorktown Heights, USA;Department of Anthropology, UCR, Riverside, USA;Harvard-Smithsonian Center for Astrophysics, Cambridge, USA

  • Venue:
  • The VLDB Journal — The International Journal on Very Large Data Bases
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Shape matching and indexing is important topic in its own right, and is a fundamental subroutine in most shape data mining algorithms. Given the ubiquity of shape, shape matching is an important problem with applications in domains as diverse as biometrics, industry, medicine, zoology and anthropology. The distance/similarity measure for used for shape matching must be invariant to many distortions, including scale, offset, noise, articulation, partial occlusion, etc. Most of these distortions are relatively easy to handle, either in the representation of the data or in the similarity measure used. However, rotation invariance is noted in the literature as being an especially difficult challenge. Current approaches typically try to achieve rotation invariance in the representation of the data, at the expense of discrimination ability, or in the distance measure, at the expense of efficiency. In this work, we show that we can take the slow but accurate approaches and dramatically speed them up. On real world problems our technique can take current approaches and make them four orders of magnitude faster without false dismissals. Moreover, our technique can be used with any of the dozens of existing shape representations and with all the most popular distance measures including Euclidean distance, dynamic time warping and Longest Common Subsequence. We further show that our indexing technique can be used to index star light curves, an important type of astronomical data, without modification.