MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition

  • Authors:
  • Annette Höglund;Pierre Dönnes;Torsten Blum;Hans-Werner Adolph;Oliver Kohlbacher

  • Affiliations:
  • Division for Simulation of Biological Systems, WSI/ZBIT, Eberhard Karls University Tübingen Sand 14, D-72076 Tübingen, Germany;Division for Simulation of Biological Systems, WSI/ZBIT, Eberhard Karls University Tübingen Sand 14, D-72076 Tübingen, Germany;Division for Simulation of Biological Systems, WSI/ZBIT, Eberhard Karls University Tübingen Sand 14, D-72076 Tübingen, Germany;Department of Biochemistry and Center for Bioinformatics, Saarland University D-66041 Saarbrücken, Germany;Division for Simulation of Biological Systems, WSI/ZBIT, Eberhard Karls University Tübingen Sand 14, D-72076 Tübingen, Germany

  • Venue:
  • Bioinformatics
  • Year:
  • 2006

Quantified Score

Hi-index 3.86

Visualization

Abstract

Motivation: Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein's subcellular localization. Numerous prediction techniques have been developed, typically focusing on a single underlying biological aspect or predicting a subset of all possible localizations. An important step is taken towards emulating the protein sorting process by capturing and bringing together biologically relevant information, and addressing the clear need to improve prediction accuracy and localization coverage. Results: Here we present a novel SVM-based approach for predicting subcellular localization, which integrates N-terminal targeting sequences, amino acid composition and protein sequence motifs. We show how this approach improves the prediction based on N-terminal targeting sequences, by comparing our method TargetLoc against existing methods. Furthermore, MultiLoc performs considerably better than comparable methods predicting all major eukaryotic subcellular localizations, and shows better or comparable results to methods that are specialized on fewer localizations or for one organism. Availability: http://www-bs.informatik.uni-tuebingen.de/Services/MultiLoc/ Contact: hoeglund@informatik.uni-tuebingen.de