Retrieval and feedback models for blog feed search

Authors:
Jonathan L. Elsas;Jaime Arguello;Jamie Callan;Jaime G. Carbonell
Affiliations:
Carnegie Mellon University, Pittsburgh, PA, USA;Carnegie Mellon University, Pittsburgh, PA, USA;Carnegie Mellon University, Pittsburgh, PA, USA;Carnegie Mellon University, Pittsburgh, PA, USA
Venue:
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2008

Citing 7
Cited 64

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Relevant document distribution estimation method for resource selection

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A study of smoothing methods for language models applied to information retrieval

ACM Transactions on Information Systems (TOIS)
A Markov random field model for term dependencies

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Improving the estimation of relevance models using large external corpora

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval and feedback models for blog feed search

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Retrieval and feedback models for blog feed search

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Blog site search using resource selection

Proceedings of the 17th ACM conference on Information and knowledge management
Adaptive subjective triggers for opinionated document retrieval

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Using Contextual Information to Improve Search in Email Archives

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Query dependent pseudo-relevance feedback based on wikipedia

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Blog distillation using random walks

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
It pays to be picky: an evaluation of thread retrieval in online forums

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Image tag clarity: in search of visual-representative tags for social images

WSM '09 Proceedings of the first SIGMM workshop on Social media
Social reader: following social networks in the wilds of the blogosphere

WSM '09 Proceedings of the first SIGMM workshop on Social media
Facet-based opinion retrieval from blogs

Information Processing and Management: an International Journal
Experimental Results on the Aggregation Methods in Blog Distillation

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Identifying Influential Bloggers: Time Does Matter

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Beyond hyperlinks: organizing information footprints in search logs to support effective browsing

Proceedings of the 18th ACM conference on Information and knowledge management
What makes categories difficult to classify?: a study on predicting classification performance for categories

Proceedings of the 18th ACM conference on Information and knowledge management
Online community search using thread structure

Proceedings of the 18th ACM conference on Information and knowledge management
An improved feedback approach using relevant local posts for blog feed retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
A generative blog post retrieval model that uses query expansion based on external collections

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Effectiveness of Aggregation Methods in Blog Distillation

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
A brief survey of computational approaches in social computing

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A two-stage model for blog feed search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Learning hidden variable models for blog retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Investigation on smoothing and aggregation methods in blog retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Blog track research at TREC

ACM SIGIR Forum
Tagging and linking web forum posts

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Using the past to score the present: extending term weighting models through revision history analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improving web search relevance and freshness with content previews

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
iAVATAR: an interactive tool for finding and visualizing visual-representative tags in image search

Proceedings of the VLDB Endowment
Identifying influential bloggers using blogs semantics

Proceedings of the 8th International Conference on Frontiers of Information Technology
A probabilistic model for opinionated blog feed retrieval

Proceedings of the 20th international conference companion on World wide web
Federated Search

Foundations and Trends in Information Retrieval
Relevance stability in blog retrieval

Proceedings of the 2011 ACM Symposium on Applied Computing
TEMPER: a temporal relevance feedback method

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Learning models for ranking aggregates

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Time-based relevance models

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
A multi-collection latent topic model for federated search

Information Retrieval
Blog feed search with a post index

Information Retrieval
External query reformulation for text-based image retrieval

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
TEXplorer: keyword-based object search and exploration in multidimensional text databases

Proceedings of the 20th ACM international conference on Information and knowledge management
On relevance, time and query expansion

Proceedings of the 20th ACM international conference on Information and knowledge management
Online community search using conversational structures

Information Retrieval
Learning to rank with multi-aspect relevance for vertical search

Proceedings of the fifth ACM international conference on Web search and data mining
Find me opinion sources in blogosphere: a unified framework for opinionated blog feed retrieval

Proceedings of the fifth ACM international conference on Web search and data mining
Blog opinion retrieval based on topic-opinion mixture model

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Linguistic aggregation methods in blog retrieval

Information Processing and Management: an International Journal
Utilizing local evidence for blog feed search

Information Retrieval
Employing document dependency in blog search

Journal of the American Society for Information Science and Technology
To what problem is distributed information retrieval the solution?

Journal of the American Society for Information Science and Technology
Information Retrieval on the Blogosphere

Foundations and Trends in Information Retrieval
A data-centric approach to feed search in blogs

International Journal of Web Engineering and Technology
Exploiting External Collections for Query Expansion

ACM Transactions on the Web (TWEB)
Diversity in blog feed retrieval

Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical target type identification for entity-oriented queries

Proceedings of the 21st ACM international conference on Information and knowledge management
Ranking distributed knowledge repositories

TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Robust query rewriting using anchor data

Proceedings of the sixth ACM international conference on Web search and data mining
Distributed information retrieval and applications

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Exploiting Forum Thread Structures to Improve Thread Clustering

Proceedings of the 2013 Conference on the Theory of Information Retrieval
Generalizing diversity detection in blog feed retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Retrieving opinions from discussion forums

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Constructing query-specific knowledge bases

Proceedings of the 2013 workshop on Automated knowledge base construction
Leveraging related entities for knowledge base acceleration

Proceedings of the 4th international workshop on Web-scale knowledge representation retrieval and reasoning
Collaborative pseudo-relevance feedback

Expert Systems with Applications: An International Journal
Finding keywords in blogs: Efficient keyword extraction in blog mining via user behaviors

Expert Systems with Applications: An International Journal
Feature identification for topical relevance assessment in feed search engines

Intelligent Data Analysis
Social reader: towards browsing the social web

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Blog feed search poses different and interesting challenges from traditional ad hoc document retrieval. The units of retrieval, the blogs, are collections of documents, the blog posts. In this work we adapt a state-of-the-art federated search model to the feed retrieval task, showing a significant improvement over algorithms based on the best performing submissions in the TREC 2007 Blog Distillation task[12]. We also show that typical query expansion techniques such as pseudo-relevance feedback using the blog corpus do not provide any significant performance improvement and in many cases dramatically hurt performance. We perform an in-depth analysis of the behavior of pseudo-relevance feedback for this task and develop a novel query expansion technique using the link structure in Wikipedia. This query expansion technique provides significant and consistent performance improvements for this task, yielding a 22% and 14% improvement in MAP over the unexpanded query for our baseline and federated algorithms respectively.