Constructing ontology-driven protein family databases

  • Authors:
  • K. Wolstencroft;R. Mcentire;R. Stevens;L. Tabernero;A. Brass

  • Affiliations:
  • School of Biological Sciences, Michael Smith Building, University of Manchester Oxford Road, Manchester, M13 9PT, UK;Department of Computer Science, Kilburn Building, University of Manchester Oxford Road, Manchester, M13 9PL, UK;GlaxoSmithKline 709 Swedeland Road, King of Prussia, Pennsylvania, 19406, USA;School of Biological Sciences, Michael Smith Building, University of Manchester Oxford Road, Manchester, M13 9PT, UK;School of Biological Sciences, Michael Smith Building, University of Manchester Oxford Road, Manchester, M13 9PT, UK

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation:Protein family databases provide a central focus for scientific communities as well as providing useful resources to aide research. However, such resources require constant curation and often become outdated and discontinued. We have developed an ontology-driven system for capturing and managing protein family data that addresses the problems of maintenance and sustainability. Results:Using protein phosphatases and ABC transporters as model protein families, we constructed two protein family database resources around a central DAML+OIL ontology. Each resource contains specialist information about each protein family, providing specialized domain-specific resources based on the same template structure. The formal structure, combined with the extraction of biological data using GO terms, allows for automated update strategies. Despite the functional differences between the two protein families, the ontology model was equally applicable to both, demonstrating the generic nature of the system. Availability: The protein phosphatase resource, PhosphaBase, is freely available on the internet (http://www.bioinf.man.ac.uk/phosphabase). The DAML+OIL ontology for the protein phosphatases and the ABC transporters is available on request from the authors. Contact: kwolstencroft@cs.man.ac.uk