Improving prediction of zinc binding sites by modeling the linkage between residues close in sequence

  • Authors:
  • Sauro Menchetti;Andrea Passerini;Paolo Frasconi;Claudia Andreini;Antonio Rosato

  • Affiliations:
  • Machine Learning and Neural Networks Group, Dipartimento di Sistemi e Informatica, Università degli Studi di Firenze, Italy;Machine Learning and Neural Networks Group, Dipartimento di Sistemi e Informatica, Università degli Studi di Firenze, Italy;Machine Learning and Neural Networks Group, Dipartimento di Sistemi e Informatica, Università degli Studi di Firenze, Italy;Magnetic Resonance Center (CERM), Dipartimento di Chimica, Università degli Studi di Firenze, Italy;Magnetic Resonance Center (CERM), Dipartimento di Chimica, Università degli Studi di Firenze, Italy

  • Venue:
  • RECOMB'06 Proceedings of the 10th annual international conference on Research in Computational Molecular Biology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe and empirically evaluate machine learning methods for the prediction of zinc binding sites from protein sequences. We start by observing that a data set consisting of single residues as examples is affected by autocorrelation and we propose an ad-hoc remedy in which sequentially close pairs of candidate residues are classified as being jointly involved in the coordination of a zinc ion. We develop a kernel for this particular type of data that can handle variable length gaps between candidate coordinating residues. Our empirical evaluation on a data set of non redundant protein chains shows that explicit modeling the correlation between residues close in sequence allows us to gain a significant improvement in the prediction performance.