Fast computation of entropic profiles for the detection of conservation in genomes

  • Authors:
  • Matteo Comin;Morris Antonello

  • Affiliations:
  • Department of Information Engineering, University of Padova, Padova, Italy;Department of Information Engineering, University of Padova, Padova, Italy

  • Venue:
  • PRIB'13 Proceedings of the 8th IAPR international conference on Pattern Recognition in Bioinformatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The information theory has been used for quite some time in the area of computational biology. In this paper we discuss and improve the function Entropic Profile, introduced by Vinga and Almeida in [23]. The Entropic Profiler is a function of the genomic location that captures the importance of that region with respect to the whole genome. We provide a linear time linear space algorithm called Fast Entropic Profile, as opposed to the original quadratic implementation. Moreover we propose an alternative normalization that can be also efficiently implemented. We show that Fast EP is suitable for large genomes and for the discovery of motifs with unbounded length.