From machu_picchu to "rafting the urubamba river": anticipating information needs via the entity-query graph

Authors:
Ilaria Bordino;Gianmarco De Francisci Morales;Ingmar Weber;Francesco Bonchi
Affiliations:
Yahoo! Research, Barcelona, Spain;Yahoo! Research, Barcelona, Spain;Qatar Computing Research Institute, Doha, Qatar;Yahoo! Research, Barcelona, Spain
Venue:
Proceedings of the sixth ACM international conference on Web search and data mining
Year:
2013

Citing 37
Cited 3

An algorithm for suffix stripping

Readings in information retrieval
Predictive caching and prefetching of query results in search engines

WWW '03 Proceedings of the 12th international conference on World Wide Web
Context-sensitive information retrieval using implicit feedback

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Concept-based interactive query expansion

Proceedings of the 14th ACM international conference on Information and knowledge management
Being accurate is not enough: how accuracy metrics have hurt recommender systems

CHI '06 Extended Abstracts on Human Factors in Computing Systems
Generating query substitutions

Proceedings of the 15th international conference on World Wide Web
Improving web search ranking by incorporating user behavior information

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Information re-retrieval: repeat queries in Yahoo's logs

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Studying the use of popular destinations to enhance web search interaction

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Query suggestion based on user landing pages

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Extracting semantic relations from query logs

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Wikify!: linking documents to encyclopedic knowledge

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Query suggestion using hitting time

Proceedings of the 17th ACM conference on Information and knowledge management
Learning to link with wikipedia

Proceedings of the 17th ACM conference on Information and knowledge management
The query-flow graph: model and applications

Proceedings of the 17th ACM conference on Information and knowledge management
Query suggestions using query-flow graphs

Proceedings of the 2009 workshop on Web Search Click Data
Towards context-aware search by learning a very large variable length hidden markov model from search logs

Proceedings of the 18th international conference on World wide web
Collective annotation of Wikipedia entities in web text

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining rich session context to improve web search

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
From "Dango" to "Japanese Cakes": Query Reformulation Models and Patterns

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Discovery is never by chance: designing for (un)serendipity

Proceedings of the seventh ACM conference on Creativity and cognition
Learning document aboutness from implicit user feedback and document structure

Proceedings of the 18th ACM conference on Information and knowledge management
Learning Semantic Query Suggestions

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Large scale query log analysis of re-finding

Proceedings of the third ACM international conference on Web search and data mining
Actively predicting diverse search intent from user browsing behaviors

Proceedings of the 19th international conference on World wide web
Pregel: a system for large-scale graph processing

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Context-aware ranking in web search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Exploring the use of labels to shortcut search trails

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Resolving surface forms to Wikipedia topics

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Sparse hidden-dynamics conditional random fields for user intent understanding

Proceedings of the 20th international conference on World wide web
Improving recommendation for long-tail queries via templates

Proceedings of the 20th international conference on World wide web
Using a Wikipedia-based semantic relatedness measure for document clustering

TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Predicting Next Search Actions with Search Engine Query Logs

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Multi-view random walk framework for search task discovery from click-through log

Proceedings of the 20th ACM international conference on Information and knowledge management
Robust disambiguation of named entities in text

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Query recommendation using query logs in search engines

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Efficient query recommendations in the long tail via center-piece subgraphs

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

A framework for benchmarking entity-annotation systems

Proceedings of the 22nd international conference on World Wide Web
Penguins in sweaters, or serendipitous entity search on user-generated content

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts

Proceedings of the 7th ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

We study the problem of anticipating user search needs, based on their browsing activity. Given the current web page p that a user is visiting we want to recommend a small and diverse set of search queries that are relevant to the content of p, but also non-obvious and serendipitous. We introduce a novel method that is based on the content of the page visited, rather than on past browsing patterns as in previous literature. Our content-based approach can be used even for previously unseen pages. We represent the topics of a page by the set of Wikipedia entities extracted from it. To obtain useful query suggestions for these entities, we exploit a novel graph model that we call EQGraph (Entity-Query Graph), containing entities, queries, and transitions between entities, between queries, as well as from entities to queries. We perform Personalized PageRank computation on such a graph to expand the set of entities extracted from a page into a richer set of entities, and to associate these entities with relevant query suggestions. We develop an efficient implementation to deal with large graph instances and suggest queries from a large and diverse pool. We perform a user study that shows that our method produces relevant and interesting recommendations, and outperforms an alternative method based on reverse IR.