Acquisition of rule-based knowledge for analyzing DNA-binding sites in proteins

  • Authors:
  • Shinn-Jang Ho;Chia-Yun Chang;Liang-Tsung Huang;Shiow-Fen Hwang;Shinn-Ying Ho

  • Affiliations:
  • Formosa University, Yunlin, Taiwan;National Chiao Tung Univ., Hsinchu, Taiwan;Feng Chia University, Taichung, Taiwan;Feng Chia University, Taichung, Taiwan;National Chiao Tung Univ., Hsinchu, Taiwan

  • Venue:
  • Proceedings of the 2nd international conference on Scalable information systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study aims to analyze DNA-binding proteins via acquisition of interpretable knowledge which can accurately predict binding sites in proteins to understand DNA-protein recognition mechanism. For mining accurate and interpretable knowledge, a large-scale dataset consisting of 982 DNA-binding proteins is constructed. This study investigates a novel feature set consisting of 11 features, including solvent accessibility, secondary structure, charge information near the residue, amino acid group and neighbor property. The derived binding and non-binding rules reveal that besides the well-known solvent accessibility, the electric charge distribution near the residue and the amino acid groups also play important roles in prediction of binding sites. The interpretable and accurate knowledge is helpful for biologist to analyze DNA-binding proteins.