Topic word selection for blogs by topic richness using web search result clustering

  • Authors:
  • Jinhee Park;Sungwoo Lee;Hye-Wuk Jung;Jee-Hyong Lee

  • Affiliations:
  • Sungkyunkwan University;Sungkyunkwan University;Sungkyunkwan University;Sungkyunkwan University

  • Venue:
  • Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Blogs are one of popular services to publish and archive posts with personal opinions on the web. The topics of a blog can be used for classification, recommendation, opinion mining, and ranking, etc. In this paper, we propose a method for extracting important topic words from a blog. Our method selects topic words by measuring whether the blog includes rich content on the word. To measure the richness of a blog on candidate topic words, we compare web search results by the candidate words with the content of the blog. We used document clustering and cluster matching in order to compare them.