SpliceMachine: predicting splice sites from high-dimensional local context representations

  • Authors:
  • Sven Degroeve;Yvan Saeys;Bernard De Baets;Pierre Rouzé;Yves Van De Peer

  • Affiliations:
  • Department of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology (VIB) Technologiepark 927, Gent 9052, Belgium;Department of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology (VIB) Technologiepark 927, Gent 9052, Belgium;Department of Applied Mathematics, Biometrics and Process Control, Ghent University Coupure links 653, Gent 9000, Belgium;Laboratoire associé de l'INRA (France) Technologiepark 927, Gent 9052, Belgium;Department of Plant Systems Biology, Flanders Interuniversity Institute for Biotechnology (VIB) Technologiepark 927, Gent 9052, Belgium

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: In this age of complete genome sequencing, finding the location and structure of genes is crucial for further molecular research. The accurate prediction of intron boundaries largely facilitates the correct prediction of gene structure in nuclear genomes. Many tools for localizing these boundaries on DNA sequences have been developed and are available to researchers through the internet. Nevertheless, these tools still make many false positive predictions. Results: This manuscript presents a novel publicly available splice site prediction tool named SpliceMachine that (i) shows state-of-the-art prediction performance on Arabidopsis thaliana and human sequences, (ii) performs a computationally fast annotation and (iii) can be trained by the user on its own data. Availability: Results, figures and software are available at http://www.bioinformatics.psb.ugent.be/supplementary_data/ Contact:sven.degroeve@psb.ugent.be; yves.vandepeer@psb.ugent.be