Finding event, temporal and causal structure in text: a machine learning approach

  • Authors:
  • James H. Martin;Steven John Bethard

  • Affiliations:
  • University of Colorado at Boulder;University of Colorado at Boulder

  • Venue:
  • Finding event, temporal and causal structure in text: a machine learning approach
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Humans often describe their experiences through the event, temporal and causal structures they perceive. These structures are often expressed in textual forms, for example in timelines, where text is summarized by aligning events with the times at which they occurred. These same sorts of temporal-causal structures are also useful for a variety of computational tasks, like summarization and question answering. However, to reason over such structures they must first be extracted from their textual representations and organized into a machine readable form. This work demonstrates that various important parts of the event, temporal and causal structure of a text can be extracted automatically using machine learning methods. Events, which serve as the basic anchors of temporal and causal relations, can be extracted with F-measures in the 70s and 80s using a word-chunking approach. Temporal relations between adjacent events in some common syntactic constructions can be identified with almost 90% accuracy using pair-wise classification. Causal relations are much more difficult, but initial work suggests that even this task may become tractable to machine learning methods. Analyses of the various tasks lead to several conclusions about how best to approach the automatic extraction of temporal-causal structure. Tasks with little linguistic motivation had low agreement between humans and low machine learning model performance. Tasks with clear annotation guidelines based on known linguistic constructions had much higher inter-annotator agreement and much better model performance. Thus, future progress will depend on careful task selection guided by linguistic knowledge about how event, temporal and causal relations are expressed in text.