A proximity language model for information retrieval

Authors:
Jinglei Zhao;Yeogirl Yun
Affiliations:
iZENEsoft, Inc., Shanghai, China;Wisenut, Inc., Seoul, South Korea
Venue:
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Year:
2009

Citing 15
Cited 20

The use of phrases and structured queries in information retrieval

SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
OHSUMED: an interactive retrieval evaluation and new large test collection for research

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval

ACM Transactions on Information Systems (TOIS)
A hidden Markov model information retrieval system

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval

Proceedings of the eighth international conference on Information and knowledge management
Shortest-substring retrieval and ranking

ACM Transactions on Information Systems (TOIS)
Relevance ranking for one to three term queries

Information Processing and Management: an International Journal
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Biterm language models for document retrieval

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Capturing term dependencies using a language model based on sentence trees

Proceedings of the eleventh international conference on Information and knowledge management
A model of lexical attraction and repulsion

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A study of smoothing methods for language models applied to information retrieval

ACM Transactions on Information Systems (TOIS)
Dependence language model for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A Markov random field model for term dependencies

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
An exploration of proximity measures in information retrieval

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Positional relevance model for pseudo-relevance feedback

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Language pyramid and multi-scale text analysis

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using various term dependencies according to their utilities

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
An analysis of learned proximity functions

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Modeling term proximity for probabilistic information retrieval models

Information Sciences: an International Journal
Identifying disease diagnosis factors by proximity-based mining of medical texts

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part II
Combining strategies for XML retrieval

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
A New Language Model Combining Single and Compound Terms

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Does word sense disambiguation improve information retrieval?

Proceedings of the fourth workshop on Exploiting semantic annotations in information retrieval
A study of the integration of passage-, document-, and cluster-based information for re-ranking search results

Information Retrieval
High-performance processing of text queries with tunable pruned term and term pair indexes

ACM Transactions on Information Systems (TOIS)
Improving retrievability of patents in prior-art search

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Proximity-based rocchio's model for pseudo relevance

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Reduction of training noises for text classifiers

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II
Modeling term dependencies with quantum language models for IR

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A novel neighborhood based document smoothing model for information retrieval

Information Retrieval
Exploiting proximity feature in statistical translation models for information retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Enhancement of passage scorers by proximity-based term occurrence weighting

International Journal of Intelligent Information and Database Systems
Latent word context model for information retrieval

Information Retrieval
Semantic concept-enriched dependence model for medical information retrieval

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.01

Visualization

Abstract

The proximity of query terms in a document is a very important information to enable ranking models go beyond the "bag of word" assumption in information retrieval. This paper studies the integration of term proximity information into the unigram language modeling. A new proximity language model (PLM) is proposed which views query terms' proximity centrality as the Dirichlet hyper-parameter that weights the parameters of the unigram document language model. Several forms of proximity measure are developed to be used in PLM which could compute a query term's proximate centrality in a specific document. In experiments, the proximity language model is compared with the basic language model and previous works that combine the proximity information with language model using linear score combination. The experiment results show that the proposed model performs better in both top precision and average precision.