Learning to Learn Biological Relations from a Small Training Set

  • Authors:
  • Laura Alonso I Alemany;Santiago Bruno

  • Affiliations:
  • NLP Group Facultad de Matemática Astronomía y Física (FaMAF), UNC, Córdoba, Argentina;NLP Group Facultad de Matemática Astronomía y Física (FaMAF), UNC, Córdoba, Argentina

  • Venue:
  • CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present different ways to improve a basic machine learning approach to identify relations between biological named entities as annotated in the Genia corpus. The main difficulty with learning from the Genia event-annotated corpus is the small amount of examples that are available for each relation type. We compare different ways to address the data sparseness problem: using the corpus as the initial seed of a bootstrapping procedure, generalizing classes of relations via the Genia ontology and generalizing classes via clustering.