Query expansion using random walk models

Authors:
Kevyn Collins-Thompson;Jamie Callan
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
Proceedings of the 14th ACM international conference on Information and knowledge management
Year:
2005

Citing 21
Cited 55

On the use of spreading activation methods in automatic information

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Word sense disambiguation for large text databases

Word sense disambiguation for large text databases
Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Application of Spreading Activation Techniques in InformationRetrieval

Artificial Intelligence Review
The impact of query structure and query expansion on retrieval performance

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Applications of linear algebra in information retrieval and hypertext analysis

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
The Association Factor in Information Retrieval

Journal of the ACM (JACM)
Semantic Clustering of Index Terms

Journal of the ACM (JACM)
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
A language modeling approach to information retrieval

A language modeling approach to information retrieval
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval

Information Retrieval
Latent Semantic Kernels

Journal of Intelligent Information Systems
Vector space model of information retrieval: a reevaluation

SIGIR '84 Proceedings of the 7th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating high accuracy retrieval techniques

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Why current IR engines fail

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Learning random walk models for inducing word dependency distributions

ICML '04 Proceedings of the twenty-first international conference on Machine learning
A generative theory of relevance

A generative theory of relevance

Contextual search and name disambiguation in email using graphs

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Constructing better document and query models with markov chains

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Representing documents with named entities for story link detection (SLD)

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Introduction to special issue on reasoning in natural language information processing

ACM Transactions on Asian Language Information Processing (TALIP)
Latent concept expansion using markov random fields

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting underrepresented query aspects for automatic query expansion

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Searching ontologies based on content: experiments in the biomedical domain

Proceedings of the 4th international conference on Knowledge capture
SLOQUE: slot-based query expansion for complex questions

Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience
Extending query translation to cross-language query expansion with markov chain models

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Learning to rank typed graph walks: local and global approaches

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Discovering key concepts in verbose queries

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
The query-flow graph: model and applications

Proceedings of the 17th ACM conference on Information and knowledge management
Modeling multi-step relevance propagation for expert finding

Proceedings of the 17th ACM conference on Information and knowledge management
Wikipedia pages as entry points for book search

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Statistical Language Models for Information Retrieval A Critical Review

Foundations and Trends in Information Retrieval
Query Expansion Using External Evidence

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Effective query expansion for federated search

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Learning graph walk based similarity measures for parsed text

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Hierarchical location and topic based query expansion

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Query-URL bipartite based approach to personalized query recommendation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Users, Queries and Documents: A Unified Representation for Web Mining

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Reducing the risk of query expansion via robust constrained optimization

Proceedings of the 18th ACM conference on Information and knowledge management
A graphical framework for contextual search and name disambiguation in email

TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
Enhancing Web Search by Aggregating Results of Related Web Queries

WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
Random walks for text semantic similarity

TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Finding Related Search Engine Queries by Web Community Based Query Enrichment

World Wide Web
Use of topicality and information measures to improve document representation for story link detection

ECIR'07 Proceedings of the 29th European conference on IR research
Learning to efficiently rank

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multi-style language model for web scale information retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multilingual PRF: english lends a helping hand

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Aspect presence verification conditional on other aspects

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
Multilingual pseudo-relevance feedback: performance study of assisting languages

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A unified optimization framework for robust pseudo-relevance feedback algorithms

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Generating advertising keywords from video content

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Improving graph-walk-based similarity with reranking: Case studies for personal information management

ACM Transactions on Information Systems (TOIS)
Using Markov chains to exploit word relationships in information retrieval

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
A unified representation of web logs for mining applications

Information Retrieval
Social annotation in query expansion: a machine learning approach

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query expansion in folksonomies

SAMT'10 Proceedings of the 5th international conference on Semantic and digital media technologies
Improving retrieval accuracy of difficult queries through generalizing negative document language models

Proceedings of the 20th ACM international conference on Information and knowledge management
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Query suggestion by constructing term-transition graphs

Proceedings of the fifth ACM international conference on Web search and data mining
Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries

Proceedings of the fifth ACM international conference on Web search and data mining
An efficient framework for constructing generalized locally-induced text metrics

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Natural language technology and query expansion: issues, state-of-the-art and perspectives

Journal of Intelligent Information Systems
Web query disambiguation using PageRank

Journal of the American Society for Information Science and Technology
Thesaurus-based feedback to support mixed search and browsing environments

ECDL'07 Proceedings of the 11th European conference on Research and Advanced Technology for Digital Libraries
Computing text semantic relatedness using the contents and links of a hypertext encyclopedia

Artificial Intelligence
Experiments on pseudo relevance feedback using graph random walks

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
An efficient approach to suggesting topically related web queries using hidden topic model

World Wide Web
Query expansion using path-constrained random walks

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Compact query term selection using topically related text

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
QUBiC: An adaptive approach to query-based recommendation

Journal of Intelligent Information Systems
A novel neighborhood based document smoothing model for information retrieval

Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

It has long been recognized that capturing term relationships is an important aspect of information retrieval. Even with large amounts of data, we usually only have significant evidence for a fraction of all potential term pairs. It is therefore important to consider whether multiple sources of evidence may be combined to predict term relations more accurately. This is particularly important when trying to predict the probability of relevance of a set of terms given a query, which may involve both lexical and semantic relations between the terms.We describe a Markov chain framework that combines multiple sources of knowledge on term associations. The stationary distribution of the model is used to obtain probability estimates that a potential expansion term reflects aspects of the original query. We use this model for query expansion and evaluate the effectiveness of the model by examining the accuracy and robustness of the expansion methods, and investigate the relative effectiveness of various sources of term evidence. Statistically significant differences in accuracy were observed depending on the weighting of evidence in the random walk. For example, using co-occurrence data later in the walk was generally better than using it early, suggesting further improvements in effectiveness may be possible by learning walk behaviors.