A practical algorithm to find the best subsequence patterns

  • Authors:
  • Masahiro Hirao;Hiromasa Hoshino;Ayumi Shinohara;Masayuki Takeda;Setsuo Arikawa

  • Affiliations:
  • Department of Informatics, Kyushu University 33, Fukuoka 812-8581, Japan;Matsushita Electric Industrial Co. and Department of Informatics, Kyushu University 33, Fukuoka 812-8581, Japan;Department of Informatics, Kyushu University 33, Fukuoka 812-8581, Japan;Department of Informatics, Kyushu University 33, Fukuoka 812-8581, Japan;Department of Informatics, Kyushu University 33, Fukuoka 812-8581, Japan

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2003

Quantified Score

Hi-index 5.23

Visualization

Abstract

Given two sets of strings, consider the problem to find a subsequence that is common to one set but never appears in the other set. We regard it to find a subsequence pattern which separates these two sets. The problem is known to be NP-complete. We naturally generalize it to an optimization problem, where we try to find a subsequence pattern which maximally separates these two sets. We provide a practical algorithm to solve it exactly. Our algorithm uses two pruning heuristics based on the properties of subsequence languages, and utilizes the data structure called subsequence automata. We report some experimental results, which show these heuristics and the data structure contribute to reduce the search time.