A novel burst-based text representation model for scalable event detection

  • Authors:
  • Wayne Xin Zhao;Rishan Chen;Kai Fan;Hongfei Yan;Xiaoming Li

  • Affiliations:
  • Peking University, China;Peking University, China;Peking University, China;Beihang University, China;Peking University, China and Beihang University, China

  • Venue:
  • ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of documents. To address it, we proposed a novel burst-based text representation model, denoted as BurstVSM. BurstVSM corresponds dimensions to bursty features instead of terms, which can capture semantic and temporal information. Meanwhile, it significantly reduces the number of non-zero entries in the representation. We test it via scalable event detection, and experiments in a 10-year news archive show that our methods are both effective and efficient.