Efficient discovery of loosely structured motifs in biological data

  • Authors:
  • Fabio Fassetti;Gianluigi Greco;Giorgio Terracina

  • Affiliations:
  • Università della Calabria, Rende - Italy;Università della Calabria, Rende - Italy;Università della Calabria, Rende - Italy

  • Venue:
  • Proceedings of the 2006 ACM symposium on Applied computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

In the last few years, the completion of the human genome sequencing showed up a wide range of new challenging issues involving raw data analysis. In particular, the discovery of information implicitly encoded in biological sequences is assuming a prominent role in identifying genetic diseases and in deciphering biological mechanisms. This information is usually represented by patterns frequently occurring in the sequences, also called motifs. Because of biological observations, the class of structured motifs have received much attention. This paper gives a contribution in this setting by providing an efficient algorithm for the identification of novel classes of structured motifs, where several kinds of "exceptions" (whose biological relevance recently emerged in the literature) may be tolerated in pattern repetitions.