Learning in a pairwise term-term proximity framework for information retrieval

Authors:
Ronan Cummins;Colm O'Riordan
Affiliations:
Digital Enterprise Research Institute, Galway, Ireland;Dept. of Information Technology, Galway, Ireland
Venue:
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Year:
2009

Citing 15
Cited 13

Genetic programming: on the programming of computers by means of natural selection

Genetic programming: on the programming of computers by means of natural selection
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing

Communications of the ACM
A probabilistic model of information retrieval: development and comparative experiments

Information Processing and Management: an International Journal
Enhancing the Set-Based Model Using Proximity Information

SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Learning to Rank

Information Retrieval
An information retrieval model using the fuzzy proximity degree of term occurences

Proceedings of the 2005 ACM symposium on Applied computing
An exploration of axiomatic approaches to information retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Evolving local and global weighting schemes in information retrieval

Information Retrieval
Term proximity scoring for ad-hoc retrieval on very large text collections

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
An exploration of proximity measures in information retrieval

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Investigation of partial query proximity in web search

Proceedings of the 17th international conference on World Wide Web
Exploiting proximity feature in bigram language model for information retrieval

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Term proximity scoring for keyword-based retrieval systems

ECIR'03 Proceedings of the 25th European conference on IR research
Contextual proximity based term-weighting for improved web information retrieval

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management

Learning concept importance using a weighted dependence model

Proceedings of the third ACM international conference on Web search and data mining
Positional relevance model for pseudo-relevance feedback

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Examining the information retrieval process from an inductive perspective

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
An analysis of learned proximity functions

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Identifying disease diagnosis factors by proximity-based mining of medical texts

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part II
Finding images of difficult entities in the long tail

Proceedings of the 20th ACM international conference on Information and knowledge management
Improving retrievability of patents in prior-art search

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Measuring the ability of score distributions to model relevance

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Predicting query performance directly from score distributions

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Modeling higher-order term dependencies in information retrieval using query hypergraphs

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Permutation indexing: fast approximate retrieval from large corpora

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
On segmentation of eCommerce queries

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Enhancement of passage scorers by proximity-based term occurrence weighting

International Journal of Intelligent Information and Database Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional ad hoc retrieval models do not take into account the closeness or proximity of terms. Document scores in these models are primarily based on the occurrences or non-occurrences of query-terms considered independently of each other. Intuitively, documents in which query-terms occur closer together should be ranked higher than documents in which the query-terms appear far apart. This paper outlines several term-term proximity measures and develops an intuitive framework in which they can be used to fully model the proximity of all query-terms for a particular topic. As useful proximity functions may be constructed from many proximity measures, we use a learning approach to combine proximity measures to develop a useful proximity function in the framework. An evaluation of the best proximity functions show that there is a significant improvement over the baseline ad hoc retrieval model and over other more recent methods that employ the use of single proximity measures.