Efficient algorithms for similarity measures over sequential data: a look beyond kernels

  • Authors:
  • Konrad Rieck;Pavel Laskov;Klaus-Robert Müller

  • Affiliations:
  • Fraunhofer FIRST.IDA, Berlin, Germany;Fraunhofer FIRST.IDA, Berlin, Germany;Fraunhofer FIRST.IDA, Berlin, Germany

  • Venue:
  • DAGM'06 Proceedings of the 28th conference on Pattern Recognition
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Kernel functions as similarity measures for sequential data have been extensively studied in previous research. This contribution addresses the efficient computation of distance functions and similarity coefficients for sequential data. Two proposed algorithms utilize different data structures for efficient computation and yield a runtime linear in the sequence length. Experiments on network data for intrusion detection suggest the importance of distances and even non-metric similarity measures for sequential data.