The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions

  • Authors:
  • Yi-Kuo Yu;Stephen F. Altschul

  • Affiliations:
  • National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD 20894, USA;National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD 20894, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Amino acid substitution matrices play a central role in protein alignment methods. Standard log-odds matrices, such as those of the PAM and BLOSUM series, are constructed from large sets of protein alignments having implicit background amino acid frequencies. However, these matrices frequently are used to compare proteins with markedly different amino acid compositions, such as transmembrane proteins or proteins from organisms with strongly biased nucleotide compositions. It has been argued elsewhere that standard matrices are not ideal for such comparisons and, furthermore, a rationale has been presented for transforming a standard matrix for use in a non-standard compositional context. Results: This paper presents the mathematical details underlying the compositional adjustment of amino acid or DNA substitution matrices. Availability: Programs implementing the methods described are available from the authors upon request. Contact: altschul@ncbi.nlm.nih.gov