Multipattern consensus regions in multiple aligned protein sequences and their segmentation

  • Authors:
  • David K. Y. Chiu;Yan Wang

  • Affiliations:
  • Department of Computing and Information Science, University of Guelph, Guelph, ON, Canada;Department of Computing and Information Science, University of Guelph, Guelph, ON, Canada

  • Venue:
  • EURASIP Journal on Bioinformatics and Systems Biology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Decomposing a biological sequence into its functional regions is an important prerequisite to understand the molecule. Using the multiple alignments of the sequences, we evaluate a segmentation based on the type of statistical variation pattern from each of the aligned sites. To describe such a more general pattern, we introduce multipattern consensus regions as segmented regions based on conserved as well as interdependent patterns. Thus the proposed consensus region considers patterns that are statistically significant and extends a local neighborhood. To show its relevance in protein sequence analysis, a cancer suppressor gene called p53 is examined. The results show significant associations between the detected regions and tendency of mutations, location on the 3D structure, and cancer hereditable factors that can be inferred from human twin studies.