Discovering event evolution graphs from news corpora

  • Authors:
  • Christopher C. Yang;Xiaodong Shi;Chih-Ping Wei

  • Affiliations:
  • College of Information Science and Technology, Drexel University, Philadelphia, PA and Digital Library Laboratory, The Chinese University of Hong Kong, Shatin, Hong Kong;Digital Library Laboratory, The Chinese University of Hong Kong, Shatin, Hong Kong;Institute of Service Science, National Tsing Hua University, Hsinchu, Taiwan

  • Venue:
  • IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given the advance of Internet technologies, we can now easily extract hundreds or thousands of news stories of any ongoing incidents from newswires such as CNN.com, but the volume of information is too large for us to capture the blueprint. Information retrieval techniques such as Topic Detection and Tracking are able to organize news stories as events, in a flat hierarchical structure, within a topic. However, they are incapable of presenting the complex evolution relationships between the events. We are interested to learn not only what the major events are but also how they develop within the topic. It is beneficial to identify the seminal events, the intermediary and ending events, and the evolution of these events. In this paper, we propose to utilize the event timestamp, event content similarity, temporal proximity, and document distributional proximity to model the event evolution relationships between events in an incident. An event evolution graph is constructed to present the underlying structure of events for efficient browsing and extracting of information. Case study and experiments are presented to illustrate and show the performance of our proposed technique. It is found that our proposed technique outperforms the baseline technique and other comparable techniques in previous work.