Finding Key Bloggers, One Post At A Time

  • Authors:
  • Wouter Weerkamp;Krisztian Balog;Maarten de Rijke

  • Affiliations:
  • ISLA, University of Amsterdam, The Netherlands. Email: weerkamp,kbalog,mdr@science.uva.nl;ISLA, University of Amsterdam, The Netherlands. Email: weerkamp,kbalog,mdr@science.uva.nl;ISLA, University of Amsterdam, The Netherlands. Email: weerkamp,kbalog,mdr@science.uva.nl

  • Venue:
  • Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

User generated content in general, and blogs in particular, form an interesting and relatively little explored domain for mining knowledge. We address the task of blog distillation: to find blogs that are principally devoted to a given topic, as opposed to blogs that merely happen to discuss the topic in passing. Working in the setting of statistical language modeling, we model the task by aggregating a blogger's blog posts to collect evidence of relevance to the topic and persistence of interest in the topic. This approach achieves state-of-the-art performance. On top of this baseline, we extend our model by incorporating a number of blog-specific features, concerning document structure, social structure, and temporal structure. These blog-specific features yield further improvements.