Algorithms for finding a most similar subforest

  • Authors:
  • Jesper Jansson;Zeshan Peng

  • Affiliations:
  • Department of Computer Science and Communication Engineering, Kyushu University, Higashi-ku, Fukuoka, Japan;Department of Computer Science, The University of Hong Kong, Hong Kong

  • Venue:
  • CPM'06 Proceedings of the 17th Annual conference on Combinatorial Pattern Matching
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given an ordered labeled forest F (“the target forest”) and an ordered labeled forest G (“the pattern forest”), the most similar subforest problem is to find a subforest F′ of F such that the distance between F′ and G is minimum over all possible F′. This problem generalizes several well-studied problems which have important applications in locating patterns in hierarchical structures such as RNA molecules' secondary structures and XML documents. In this paper, we present efficient algorithms for the most similar subforest problem with forest edit distance for three types of subforests: simple substructures, sibling substructures, and closed subforests.