Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms

  • Authors:
  • Robert J. Clifford;Michael N. Edmonson;Cu Nguyen;Kenneth H. Buetow

  • Affiliations:
  • Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA;Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA;Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA;Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2004

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Single nucleotide polymorphisms (SNPs) are the most common form of genetic variant in humans. SNPs causing amino acid substitutions are of particular interest as candidates for loci affecting susceptibility to complex diseases, such as diabetes and hypertension. To efficiently screen SNPs for disease association, it is important to distinguish neutral variants from deleterious ones. Results: We describe the use of Pfam protein motif models and the HMMER program to predict whether amino acid changes in conserved domains are likely to affect protein function. We find that the magnitude of the change in the HMMER E-value caused by an amino acid substitution is a good predictor of whether it is deleterious. We provide internet-accessible display tools for a genomewide collection of SNPs, including 7391 distinct non-synonymous coding region SNPs in 2683 genes. Availability: http://lpgws.nci.nih.gov/cgi-bin/GeneViewer.cgi