Annotation and data mining of the Penn Discourse TreeBank

  • Authors:
  • Rashmi Prasad;Eleni Miltsakaki;Aravind Joshi;Bonnie Webber

  • Affiliations:
  • University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA;University of Edinburgh, Edinburgh, Scotland

  • Venue:
  • DiscAnnotation '04 Proceedings of the 2004 ACL Workshop on Discourse Annotation
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Penn Discourse TreeBank (PDTB) is a new resource built on top of the Penn Wall Street Journal corpus, in which discourse connectives are annotated along with their arguments. Its use of standoff annotation allows integration with a stand-off version of the Penn TreeBank (syntactic structure) and PropBank (verbs and their arguments), which adds value for both linguistic discovery and discourse modeling. Here we describe the PDTB and some experiments in linguistic discovery based on the PDTB alone, as well as on the linked PTB and PDTB corpora.