Term committee based event identification within news topics

  • Authors:
  • Kuo Zhang;JuanZi Li;Gang Wu;KeHong Wang

  • Affiliations:
  • Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China

  • Venue:
  • PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most previous research focus on organizing news set into flat collections of stories. However, a topic in news is more than a mere collection of stories: it is characterized by a definite structure of inter-related events. Stories within a topic usually share some terms which are related to the topic other than a specific event, so stories of different events are usually very similar to each other within a topic. To deal with this problem, we propose a new event identification method based on the term committee. We first capture some tight term clusters as term committees of potential events, and then use them to reweight the key terms in a story. The experimental results on two Linguistic Data Consortium (LDC) datasets show that the proposed method for event identification outperforms previous methods significantly.