Brief communication: Operon prediction based on SVM

  • Authors:
  • Guo-qing Zhang;Zhi-wei Cao;Qing-ming Luo;Yu-dong Cai;Yi-xue Li

  • Affiliations:
  • Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China;Shanghai Center for Bioinformation Technology, 100 Qinzhou Road, Shanghai 200235, China;Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China;Department of Chemistry, College of Sciences, Shanghai University, 99 Shang-Da Road, Shanghai 200436, China and Biomolecular Sciences Department, University of Manchester, Institute of Science and ...;Shanghai Center for Bioinformation Technology, 100 Qinzhou Road, Shanghai 200235, China and Bioinformation Center of Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Yue Y ...

  • Venue:
  • Computational Biology and Chemistry
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The operon is a specific functional organization of genes found in bacterial genomes. Most genes within operons share common features. The support vector machine (SVM) approach is here used to predict operons at the genomic level. Four features were chosen as SVM input vectors: the intergenic distances, the number of common pathways, the number of conserved gene pairs and the mutual information of phylogenetic profiles. The analysis reveals that these common properties are indeed characteristic of the genes within operons and are different from that of non-operonic genes. Jackknife testing indicates that these input feature vectors, employed with RBF kernel SVM, achieve high accuracy. To validate the method, Escherichia coli K12 and Bacillus subtilis were taken as benchmark genomes of known operon structure, and the prediction results in both show that the SVM can detect operon genes in target genomes efficiently and offers a satisfactory balance between sensitivity and specificity.