Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities

  • Authors:
  • Du-Seong Chang;Key-Sun Choi

  • Affiliations:
  • Spoken Language Research Team, KT, 17 Woomyeon-dong, Seocho-gu, Seoul 137-792, Republic of Korea;Division of Computer Science, Korea Advanced Institute of Science and Technology, BOLA, KORTERM, 373-1 Guseong-dong, Yuseong-gu, Daejeon 305-701, Republic of Korea

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work aims to extract possible causal relations that exist between noun phrases. Some causal relations are manifested by lexical patterns like causal verbs and their sub-categorization. We use lexical patterns as a filter to find causality candidates and we transfer the causality extraction problem to the binary classification. To solve the problem, we introduce probabilities for word pair and concept pair that could be part of causal noun phrase pairs. We also use the cue phrase probability that could be a causality pattern. These probabilities are learned from the raw corpus in an unsupervised manner. With this probabilistic model, we increase both precision and recall. Our causality extraction shows an F-score of 77.37%, which is an improvement of 21.14 percentage points over the baseline model. The long distance causal relation is extracted with the binary tree-styled cue phrase. We propose an incremental cue phrase learning method based on the cue phrase confidence score that was measured after each causal classifier learning step. A better recall of 15.37 percentage points is acquired after the cue phrase learning.