Database note: The iProClass integrated database for protein functional analysis

  • Authors:
  • Cathy H Wu;Hongzhan Huang;Anastasia Nikolskaya;Zhangzhi Hu;Winona C Barker

  • Affiliations:
  • Georgetown University Medical Center, 3900 Reservoir Road, NW, Box 571455, Preclinical Building, Room LR-3, Washington, DC 20057-1455, USA;Georgetown University Medical Center, 3900 Reservoir Road, NW, Box 571455, Preclinical Building, Room LR-3, Washington, DC 20057-1455, USA;Georgetown University Medical Center, 3900 Reservoir Road, NW, Box 571455, Preclinical Building, Room LR-3, Washington, DC 20057-1455, USA;National Biomedical Research Foundation, 3900 Reservoir Road, NW, Washington, DC 20057-1455, USA;National Biomedical Research Foundation, 3900 Reservoir Road, NW, Washington, DC 20057-1455, USA

  • Venue:
  • Computational Biology and Chemistry
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Increasingly, scientists have begun to tackle gene functions and other complex regulatory processes by studying organisms at the global scales for various levels of biological organization, ranging from genomes to metabolomes and physiomes. Meanwhile, new bioinformatics methods have been developed for inferring protein function using associative analysis of functional properties to complement the traditional sequence homology-based methods. To fully exploit the value of the high-throughput system biology data and to facilitate protein functional studies requires bioinformatics infrastructures that support both data integration and associative analysis. The iProClass database, designed to serve as a framework for data integration in a distributed networking environment, provides comprehensive descriptions of all proteins, with rich links to over 50 databases of protein family, function, pathway, interaction, modification, structure, genome, ontology, literature, and taxonomy. In particular, the database is organized with PIRSF family classification and maps to other family, function, and structure classification schemes. Coupled with the underlying taxonomic information for complete genomes, the iProClass system (http://pir.georgetown.edu/iproclass/) supports associative studies of protein family, domain, function, and structure. A case study of the phosphoglycerate mutases illustrates a systematic approach for protein family and phylogenetic analysis. Such studies may serve as a basis for further analysis of protein functional evolution, and its relationship to the co-evolution of metabolic pathways, cellular networks, and organisms.