Title language model for information retrieval

Authors:
Rong Jin;Alex G. Hauptmann;Cheng Xiang Zhai
Affiliations:
Carnegie Mellon University;Carnegie Mellon University;Carnegie Mellon University
Venue:
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2002

Citing 8
Cited 36

A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Applying summarization techniques for term selection in relevance feedback

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II

Searching the workplace web

WWW '03 Proceedings of the 12th international conference on World Wide Web
Empirical development of an exponential probabilistic model for text retrieval: using textual analysis to build a better model

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Analysis of anchor text for web search

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Probabilistic model for contextual retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Regularizing translation models for better automatic image annotation

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Simplified similarity scoring using term ranks

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Integrating word relationships into language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Discovering "title-like" terms

Information Processing and Management: an International Journal
Learn to weight terms in information retrieval using category information

ICML '05 Proceedings of the 22nd international conference on Machine learning
Two-stage statistical language models for text database selection

Information Retrieval
Using thematic information in statistical headline generation

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Context-sensitive semantic smoothing for the language modeling approach to genomic IR

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A trigger language model-based IR system

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Parsimonious translation models for information retrieval

Information Processing and Management: an International Journal
Enhancing relevance scoring with chronological term rank

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval models for question and answer archives

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
User language model for collaborative personalized search

ACM Transactions on Information Systems (TOIS)
Statistical Language Models for Information Retrieval A Critical Review

Foundations and Trends in Information Retrieval
Multinomial randomness models for retrieval with document fields

ECIR'07 Proceedings of the 29th European conference on IR research
Estimation of statistical translation models based on mutual information for ad hoc information retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multi-style language model for web scale information retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Web spam detection: new classification features based on qualified link analysis and language models

IEEE Transactions on Information Forensics and Security
Clickthrough-based translation models for web search: from word models to phrase models

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Clickthrough-based latent semantic models for web search

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Blog feed search with a post index

Information Retrieval
Wikipedia-based semantic smoothing for the language modeling approach to information retrieval

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
An article language model for BBS search

ICWE'05 Proceedings of the 5th international conference on Web Engineering
Literal-matching-biased link analysis

AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Axiomatic analysis of translation language model for information retrieval

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Building enriched web page representations using link paths

Proceedings of the 23rd ACM conference on Hypertext and social media
Translation techniques in cross-language information retrieval

ACM Computing Surveys (CSUR)
Learning lexicon models from search logs for query expansion

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Classifying and ranking: the first step towards mining inside vertical search engines

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Modeling click-through based word-pairs for web search

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Exploiting proximity feature in statistical translation models for information retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Towards Concept-Based Translation Models Using Search Logs for Query Expansion

Proceedings of the 21st ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a new language model, namely, a title language model, for information retrieval. Different from the traditional language model used for retrieval, we define the conditional probability P(Q|D) as the probability of using query Q as the title for document D. We adopted the statistical translation model learned from the title and document pairs in the collection to compute the probability P(Q|D). To avoid the sparse data problem, we propose two new smoothing methods. In the experiments with four different TREC document collections, the title language model for information retrieval with the new smoothing method outperforms both the traditional language model and the vector space model for IR significantly.