Evaluation of different complexity measures for signal detection in genome sequences

  • Authors:
  • Mehdi Kargar;Aijun An

  • Affiliations:
  • York University, Toronto, Canada;York University, Toronto, Canada

  • Venue:
  • Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analyzing large amounts of data is one of the most challenging problem in modern molecular biology. In this work, different complexity measures and methods are applied to identify the signals in the whole genome of the three prokaryotic organisms. In addition to previous complexity measures, new measures are introduced for representing Open Reading Frames (ORF). We apply classification algorithms to determine which complexity measures can lead to better predictive performance in discriminating genes from pseudo-genes in ORFs. Also, we investigate whether positions and lengths of windows in ORFs have significant impact on distinguishing between genes and pseudo-genes. Different classification algorithms are applied for classifying ORFs into genes and pseudo-genes.