Score-Based approach for anaphora resolution in drug-drug interactions documents

  • Authors:
  • Isabel Segura-Bedmar;Mario Crespo;Cesar de Pablo-Sánchez

  • Affiliations:
  • Computer Science Department, Universidad Carlos III de Madrid, Leganés, Madrid, Spain;Computer Science Department, Universidad Carlos III de Madrid, Leganés, Madrid, Spain;Computer Science Department, Universidad Carlos III de Madrid, Leganés, Madrid, Spain

  • Venue:
  • NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Drug-drug interactions are frequently reported in biomedical literature and Information Extraction (IE) techniques have been devised as a useful instrument for managing this knowledge. Nevertheless, IE at the sentence level has a limited effect because there are frequent references to previous entities in the discourse, a phenomenon known as ‘anaphora'. The problem of resolving pronominal and nominal anaphora to improve a system that detects drug interactions is addressed in this paper. To our knowledge, this is the first research article that tackles this issue. A corpus and a system for the evaluation of drug anaphora resolution have been developed and an analysis of the phenomena is also included. The system uses a domain-specific syntactic and semantic parser, UMLS Metamap Transfer (MMTx) [1], to select anaphoric expressions and candidate references. It is shown that a combination of the domain-specific syntax and semantic information with generic heuristics can be leveraged to produce good results comparable to other related domains. Furthermore, the analysis of the errors suggests that the use of additional semantic knowledge is needed to improve results and deal with this linguistic phenomenon in this particular domain.