Biological domain identification based in codon usage by means of rule and tree induction

  • Authors:
  • Antonio Neme;Pedro Miramontes

  • Affiliations:
  • IIMAS, UNAM, México;Facultad de Ciencias, UNAM, México

  • Venue:
  • CMSB'04 Proceedings of the 20 international conference on Computational Methods in Systems Biology
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are three domains in living nature: archaea, bacteria and eukarya. It has been shown, trough a number of multivariate tools, that codon usage, a 64 dimensional vector that stablishes how often a given organism makes use of each codon, is related to domain. Another method is proposed here based in rule and tree induction from codon usage of several organisms. It is shown that domain can be identified trough codon usage and a simple set of rules. Two methods were applied, CN 2 and C 4.5. Obtained rules describe data better than other methods, in the sense that are topological interpretable and have phenomenological meaning.