Terminological paraphrase extraction from scientific literature based on predicate argument tuples

  • Authors:
  • Sung-Pil Choi;Sung-Hyon Myaeng

  • Affiliations:
  • ;

  • Venue:
  • Journal of Information Science
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Terminological paraphrases (TPs) are sentences or phrases that express the concepts of terminologies in a different form. Here we propose an effective way to identify and extract TPs from large-scale scientific literature databases. We propose a novel method for effectively retrieving sentences that contain a given terminological concept based on semantic units called predicate-argument tuples. This method enables effective textual similarity computations and minimized errors based on six TP ranking models. For evaluation, we constructed an evaluation collection for the TP recognition task by extracting TPs from a target literature database using the proposed method. Through the two experiments, we learned that scientific literature contain many TPs that could not have been identified so far. Also, the experimental results showed the potential and extensibility of our proposed methods to extract the TPs.