Discovering event episodes from news corpora: a temporal-based approach

  • Authors:
  • Chih-Ping Wei;Yen-Hsien Lee;Yu-Sheng Chiang;Jyun-Da Chen;Christopher C. C. Yang

  • Affiliations:
  • National Tsing Hua University, Hsinchu, Taiwan, R.O.C.;National Chiayi University, Chiayi, Taiwan, R.O.C.;IBM, Taiwan Taipei, Taiwan, R.O.C.;Industrial Tech. Research Institute, Hsinchu, Taiwan, R.O.C.;Drexel University, Philadelphia, PA

  • Venue:
  • Proceedings of the 11th International Conference on Electronic Commerce
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

When performing environmental scanning, organizations typically deal with a numerous of events and topics about their core business, relevant technique standards, competitors, and market, where each event or topic to monitor or track generally is associated with many news documents. To reduce information overload and information fatigues when monitoring or tracking such events, it is essential to develop an effective event episode discovery mechanism for organizing all news documents pertaining to an event of interest. In this study, we propose a new metric, referred to as TFxIDFTempo and develop a temporal-based event episode discovery technique that uses the proposed TFxIDFTempo metric as its feature selection method and document representation scheme. Using the traditional TFxIDF-based HAC technique as performance benchmarks, our empirical evaluation results suggest that the proposed temporal-based event episode discovery technique outperforms its benchmark in cluster recall and cluster precision.