Topic Detection from Blog Documents Using Users' Interests

  • Authors:
  • Yuichiro Sekiguchi;Harumi Kawashima;Hidenori Okuda;Masahiro Oku

  • Affiliations:
  • NTT Corporation, Japan;NTT Corporation, Japan;NTT Corporation, Japan;NTT Corporation, Japan

  • Venue:
  • MDM '06 Proceedings of the 7th International Conference on Mobile Data Management
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe a method to detect topic words from blog documents. We define "topic words" as words frequently used by people who share the same interests. In this method, each blogger's interests are extracted from each blog site, and interest similarities between bloggers are calculated. Unusual words that are used by bloggers who have a high level of similarity are then extracted as topic words. We evaluated the precision of this method using blog documents, and the results show that the proposed method is superior (by 4.4 %) to the traditional TF-IDF method in terms of precision.