Computing similarity of run-length encoded strings with affine gap penalty

  • Authors:
  • Jin Wook Kim;Amihood Amir;Gad M. Landau;Kunsoo Park

  • Affiliations:
  • School of Computer Science and Engineering, Seoul National University;Department of Computer Science, Bar-Ilan University and Georgia Tech;Department of Computer Science, University of Haifa and Polytechnic University;School of Computer Science and Engineering, Seoul National University

  • Venue:
  • SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of computing similarity of two run-length encoded strings has been studied for various scoring metrics. Many algorithms have been developed for the longest common subsequence metric and some algorithms for the Levenshtein distance metric and the weighted edit distance metric. In this paper we consider similarity based on the affine gap penalty metric which is a more general and rather complicated scoring metric than the weighted edit distance. To compute similarity in this model efficiently, we convert the problem to a path problem on a directed acyclic graph and use some properties of maximum paths in this graph. We present an O(nm′+n′m) time algorithm for computing similarity of two run-length encoded strings in the affine gap penalty model, where n′ and m′ are the lengths of given two run-length encoded strings, and n and m are the decoded lengths of given two strings, respectively.