Automatic paraphrase acquisition from news articles

  • Authors:
  • Yusuke Shinyama;Satoshi Sekine;Kiyoshi Sudo

  • Affiliations:
  • New York University, New York, NY;New York University, New York, NY;New York University, New York, NY

  • Venue:
  • HLT '02 Proceedings of the second international conference on Human Language Technology Research
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Paraphrases play an important role in the variety and complexity of natural language documents. However, they add to the difficulty of natural language processing. Here we describe a procedure for obtaining paraphrases from news articles. Articles derived from different newspapers can contain paraphrases if they report the same event on the same day. We exploit this feature by using Named Entity recognition. Our approach is based on the assumption that Named Entities are preserved across paraphrases. We applied our method to articles of two domains and obtained notable examples. Although this is our initial attempt at automatically extracting paraphrases from a corpus, the results are promising.