MIPS bacterial genomes functional annotation benchmark dataset

  • Authors:
  • Igor V. Tetko;Barbara Brauner;Irmtraud Dunger-Kaltenbach;Goar Frishman;Corinna Montrone;Gisela Fobo;Andreas Ruepp;Alexey V. Antonov;Dimitrij Surmeli;Hans-Wernen Mewes

  • Affiliations:
  • Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany;Institute for Bioinformatics (MIPS), GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. Results: The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. Availability: BFAB is available at http://mips.gsf.de/proj/bfab Contact: i.tetko@gsf.de