A frequent pattern mining method for finding planted (l, d)-motifs of unknown length

  • Authors:
  • Caiyan Jia;Ruqian Lu;Lusheng Chen

  • Affiliations:
  • Department of Computer Science, Beijing Jiaotong University, Beijing, China;Shanghai Key Lab of Intelligent Information Processing & Department of Computer Science and Engineering, Fudan University, Shanghai, China and Institute of Mathematics, Chinese Academy of Scie ...;Shanghai Key Lab of Intelligent Information Processing & Department of Computer Science and Engineering, Fudan University, Shanghai, China

  • Venue:
  • RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Identification and characterization of gene regulatory binding motifs is one of the fundamental tasks toward systematically understanding the molecular mechanisms of transcriptional regulation. Recently, the problem has been abstracted as the challenge planted (l, d)- motif problem. Previous studies have developed numerous methods to solve the problem. But most of methods need to specify the length l of a motif in advance. In this study, we present an exact and efficient algorithm, called Apriori-Motif, without given l. The algorithm uses breadth first search and prunes the search space quickly by the downward closure property used in Apriori, a classical algorithm of frequent pattern mining. Empirical study shows that Apriori-Motif is better than some existing methods.