Key blog distillation: ranking aggregates

Authors:
Craig Macdonald;Iadh Ounis
Affiliations:
University of Glasgow, Glasgow, United Kingdom;University of Glasgow, Glasgow, United Kingdom
Venue:
Proceedings of the 17th ACM conference on Information and knowledge management
Year:
2008

Citing 10
Cited 19

Pivoted document length normalization

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Improving two-stage ad-hoc retrieval for short queries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval

Information Retrieval
Simple BM25 extension to multiple weighted fields

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Introduction to Data Mining, (First Edition)

Introduction to Data Mining, (First Edition)
Formal models for expert finding in enterprise corpora

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Voting for candidates: adapting data fusion techniques for an expert search task

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Combining fields for query expansion and adaptive query expansion

Information Processing and Management: an International Journal
Expertise drift and query expansion in expert search

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A study of blog search

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval

A two-stage model for blog feed search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Blog track research at TREC

ACM SIGIR Forum
Improving web search relevance and freshness with content previews

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Quantifying sentiment and influence in blogspaces

Proceedings of the First Workshop on Social Media Analytics
Relevance stability in blog retrieval

Proceedings of the 2011 ACM Symposium on Applied Computing
TEMPER: a temporal relevance feedback method

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Learning models for ranking aggregates

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Blog feed search with a post index

Information Retrieval
Find me opinion sources in blogosphere: a unified framework for opinionated blog feed retrieval

Proceedings of the fifth ACM international conference on Web search and data mining
Linguistic aggregation methods in blog retrieval

Information Processing and Management: an International Journal
Utilizing local evidence for blog feed search

Information Retrieval
Employing document dependency in blog search

Journal of the American Society for Information Science and Technology
Expertise Retrieval

Foundations and Trends in Information Retrieval
Information Retrieval on the Blogosphere

Foundations and Trends in Information Retrieval
A data-centric approach to feed search in blogs

International Journal of Web Engineering and Technology
Diversity in blog feed retrieval

Proceedings of the 21st ACM international conference on Information and knowledge management
Generalizing diversity detection in blog feed retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Quality biased thread retrieval using the voting model

Proceedings of the 18th Australasian Document Computing Symposium
Feature identification for topical relevance assessment in feed search engines

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Searchers on the blogosphere often have a need to identify other key bloggers with similar interests to their own. However, a main difference of this blog distillation task from normal adhoc or Web document retrieval is that each blog can be seen as an aggregate of its constituent posts. On the other hand, we show that the task is similar to the expert search task, where a person's expertise is derived from the aggregate of their publications or emails. In this paper, we investigate several aspects of blog retrieval: Firstly, we experiment whether a blog should be represented as a whole unit, or as by considering each of its posts as indicators of its relevance, showing that expert search techniques can be adapted for blog search; Secondly, we examine whether indexing only the XML feed provided by each blog (and which is often incomplete) is sufficient, or whether the full-text of each blog post should be downloaded; Lastly, we use approaches to detect the central or recurring interests of each blog to increase the retrieval effectiveness of the system. Using the TREC 2007 Blog dataset, the results show that our proposed expert search paradigm is indeed useful in identifying key bloggers, achieving high retrieval effectiveness.