News story clustering from both what and how aspects: using bag of word model and affinity propagation

  • Authors:
  • Wei-Ta Chu;Chao-Chin Huang;Wen-Fang Cheng

  • Affiliations:
  • National Chung Cheng University, Chiayi, Taiwan Roc;National Chung Cheng University, Chiayi, Taiwan Roc;Industrial Technology Research Institute, Hsinchu, Taiwan Roc

  • Venue:
  • AIEMPro '11 Proceedings of the 2011 ACM international workshop on Automated media analysis and production for novel TV services
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The 24-hour news TV channels repeat the same news stories again and again. In this paper we cluster hundreds of news stories broadcasted in a day into dozens of clusters according to topics, and thus facilitate efficient browsing and summarization. The proposed system automatically removes commercial breaks and detects anchorpersons, and then determines boundaries of news stories. Semantic concepts, the bag of visual word model and the bag of trajectory model are used to describe what and how objects present in news stories. After measuring similarity between stories by the earth mover's distance, the affinity propagation algorithm is utilized to cluster stories of the same topic together. The experimental results show that with the proposed methods sophisticated news stories can be effectively clustered.