Subject-based extraction of a latent blog community

  • Authors:
  • Seok-Ho Yoon;Jung-Hwan Shin;Sang-Wook Kim;Sunju Park;Jae Bum Lee

  • Affiliations:
  • Department of Electronics, Communication, and Computer Engineering, Hanyang University, 17 Haendang-dong, Seongdong-gu, Seoul 133-791, Republic of Korea;Department of Electronics, Communication, and Computer Engineering, Hanyang University, 17 Haendang-dong, Seongdong-gu, Seoul 133-791, Republic of Korea;Department of Electronics, Communication, and Computer Engineering, Hanyang University, 17 Haendang-dong, Seongdong-gu, Seoul 133-791, Republic of Korea;School of Business, Yonsei University, 262 Seongsanno, Seodaemun-gu, Seoul 120-749, Republic of Korea;NHN Corp., Venture Town Bldg., 25-1 Jeongja-dong, Bundang-gu, Seongnam City, Gyeonggi-do 463-844, Republic of Korea

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 0.07

Visualization

Abstract

In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere.