Prediction of Binding Sites in the Mouse Genome Using Support Vector Machines

  • Authors:
  • Yi Sun;Mark Robinson;Rod Adams;Alistair Rust;Neil Davey

  • Affiliations:
  • Science and technology research school, University of Hertfordshire, United Kingdom AL10 9AB;Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, USA 48824;Science and technology research school, University of Hertfordshire, United Kingdom AL10 9AB;Institute for Systems Biology, Seattle, USA 98103;Science and technology research school, University of Hertfordshire, United Kingdom AL10 9AB

  • Venue:
  • ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Computational prediction of cis-regulatory binding sites is widely acknowledged as a difficult task. There are many different algorithms for searching for binding sites in current use. However, most of them produce a high rate of false positive predictions. Moreover, many algorithmic approaches are inherently constrained with respect to the range of binding sites that they can be expected to reliably predict. We propose to use SVMs to predict binding sites from multiple sources of evidence. We combine random selection under-sampling and the synthetic minority over-sampling technique to deal with the imbalanced nature of the data. In addition, we remove some of the final predicted binding sites on the basis of their biological plausibility. The results show that we can generate a new prediction that significantly improves on the performance of any one of the individual prediction algorithms.