Building frame-based corpus on the basis of ontological domain knowledge

  • Authors:
  • He Tan;Rajaram Kaliyaperumal;Nirupama Benis

  • Affiliations:
  • Institutionen för datavetenskap, Linköpings universitet, Sweden;Institutionen för medicinsk teknik, Linköpings universitet, Sweden;Institutionen för medicinsk teknik, Linköpings universitet, Sweden

  • Venue:
  • BioNLP '11 Proceedings of BioNLP 2011 Workshop
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Semantic Role Labeling (SRL) plays a key role in many NLP applications. The development of SRL systems for the biomedical domain is frustrated by the lack of large domain-specific corpora that are labeled with semantic roles. Corpus development has been very expensive and time-consuming. In this paper we propose a method for building frame-based corpus on the basis of domain knowledge provided by ontologies. We believe that ontologies, as a structured and semantic representation of domain knowledge, can instruct and ease the tasks in building the corpora. In the paper we present a corpus built by using the method. We compared it to BioFrameNet, and examined the gaps between the semantic classification of the target words in the domain-specific corpus and in FrameNet and Prop-Bank/VerbNet.