MMRF for Proteome annotation applied to human protein disease prediction

  • Authors:
  • Beatriz García-Jiménez;Agapito Ledezma;Araceli Sanchis

  • Affiliations:
  • Universidad Carlos III de Madrid, Leganés, Madrid, Spain;Universidad Carlos III de Madrid, Leganés, Madrid, Spain;Universidad Carlos III de Madrid, Leganés, Madrid, Spain

  • Venue:
  • ILP'10 Proceedings of the 20th international conference on Inductive logic programming
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Biological processes where every gene and protein participates is an essential knowledge for designing disease treatments. Nowadays, these annotations are still unknown for many genes and proteins. Since making annotations from in-vivo experiments is costly, computational predictors are needed for different kinds of annotation such as metabolic pathway, interaction network, protein family, tissue, disease and so on. Biological data has an intrinsic relational structure, including genes and proteins, which can be grouped by many criteria. This hinders the possibility of finding good hypotheses when attribute-value representation is used. Hence, we propose the generic Modular Multi-Relational Framework (MMRF) to predict different kinds of gene and protein annotation using Relational Data Mining (RDM). The specific MMRF application to annotate human protein with diseases verifies that group knowledge (mainly protein-protein interaction pairs) improves the prediction, particularly doubling the area under the precision-recall curve.