An SVM-based Algorithm for Identification of Photosynthesis-specific Genome Features

  • Authors:
  • Gong-Xin Yu;George Ostrouchov;Al Geist;Nagiza F. Samatova

  • Affiliations:
  • -;-;-;-

  • Venue:
  • CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
  • Year:
  • 2003

Quantified Score

Hi-index 0.02

Visualization

Abstract

This paper presents a novel algorithm for identificationand functional characterization of "key" genomefeatures responsible for a particular biochemical processof interest. The central idea is that individual genomefeatures are identified as "key" features if thediscrimination accuracy between two classes of genomeswith respect to a given biochemical process is sufficientlyaffected by the inclusion or exclusion of these features. Inthis paper, genome features are defined by high-resolutiongene functions. The discrimination procedureutilizes the Support Vector Machine classificationtechnique. The application to the oxygenic photosyntheticprocess resulted in 126 highly confident candidategenome features. While many of these features are well-knowncomponents in the oxygenic photosyntheticprocess, others are completely unknown, even includingsome hypothetical proteins. It is obvious that ouralgorithm is capable of discovering features related to atargeted biochemical process.