Mining maximal flexible patterns in a sequence

  • Authors:
  • Hiroki Arimura;Takeaki Uno

  • Affiliations:
  • Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan;National Institute of Informatics, Tokyo, Japan

  • Venue:
  • JSAI'07 Proceedings of the 2007 conference on New frontiers in artificial intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the problem of enumerating all maximal flexible patterns in an input sequence database for the class of flexible patterns, where a maximal pattern (also called a closed pattern) is the most specific pattern among the equivalence class of patterns having the same list of occurrences in the input. Since our notion of maximal patterns is based on position occurrences, it is weaker than the traditional notion of maximal patterns based on document occurrences. Based on the framework of reverse search, we present an efficient depth-first search algorithm MaxFlex for enumerating all maximal flexible patterns in a given sequence database without duplicates in O(||T||×|σ|) time per pattern and O(||T||) space, where ||T|| is the size of the input sequence database T and |σ| is the size of the alphabet on which the sequences are defined. This means that the enumeration problem for maximal flexible patterns is shown to be solvable in polynomial delay and polynomial space.