Exploring domain differences for the design of pronoun resolution systems for biomedical text

  • Authors:
  • Ngan L. T. Nguyen;Jin-Dong Kim

  • Affiliations:
  • University of Tokyo, Tokyo, Japan;University of Tokyo, Tokyo, Japan

  • Venue:
  • COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Much effort in the research community has been spent on solving the anaphora resolution or pronoun resolution problem, and in particular for news texts. In order to selectively inherit the previous works and solve the same problem for a new domain, we carried out a comparative study with three different corpora: MUC, ACE for the news texts, and GENIA for bio-medical papers. Our corpus analysis and experimental results show the significant differences in the use of pronouns in the two domains, thus by properly considering the characteristics of a domain, we can improve the performance of pronoun resolution for that domain.