DEEPER: a full parsing based approach to protein relation extraction

  • Authors:
  • Timur Fayruzov;Martine De Cock;Chris Cornelis;Véronique Hoste

  • Affiliations:
  • Department of Applied Mathematics and Computer Science, Ghent University Association, Gent, Belgium;Department of Applied Mathematics and Computer Science, Ghent University Association, Gent, Belgium;Department of Applied Mathematics and Computer Science, Ghent University Association, Gent, Belgium;School of Translation Studies, Ghent University Association, Gent, Belgium

  • Venue:
  • EvoBIO'08 Proceedings of the 6th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Lexical variance in biomedical texts poses a challenge to automatic protein relation mining. We therefore propose a new approach that relies only on more general language structures such as parsing and dependency information for the construction of feature vectors that can be used by standard machine learning algorithms in deciding whether a sentence describes a protein interaction or not. As our approach is not dependent on the use of specific interaction keywords, it is applicable to heterogeneous corpora. Evaluation on benchmark datasets shows that our method is competitive with existing state-of-the-art algorithms for the extraction of protein interactions.