A Similarity Measure for Sequences of Categorical Data Based on the Ordering of Common Elements

  • Authors:
  • Cristina Gómez-Alonso;Aida Valls

  • Affiliations:
  • iTAKA Research Group - Intelligent Tech. for Advanced Knowledge Acquisition Department of Computer Science and Mathematics, Universitat Rovira i Virgili, Tarragona, Spain 43007;iTAKA Research Group - Intelligent Tech. for Advanced Knowledge Acquisition Department of Computer Science and Mathematics, Universitat Rovira i Virgili, Tarragona, Spain 43007

  • Venue:
  • MDAI '08 Sabadell Proceedings of the 5th International Conference on Modeling Decisions for Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Similarity measures are usually used to compare items and identify pairs or groups of similar individuals. The similarity measure strongly depends on the type of values to compare. We have faced the problem of considering that the information of the individuals is a sequence of events (i.e. sequences of web pages visited by a certain user or the personal daily schedule). Some measures for numerical sequences exist, but very few methods consider sequences of categorical data. In this paper, we present a new similarity measure for sequences of categorical labels and compare it with the previous approaches.