Data-driven text features for sponsored search click prediction

Authors:
Benyah Shaparenko;Özgür Çetin;Rukmini Iyer
Affiliations:
Cornell University, Ithaca, NY;Yahoo! Labs, New York, NY;Yahoo! Labs, Santa Clara, CA
Venue:
Proceedings of the Third International Workshop on Data Mining and Audience Intelligence for Advertising
Year:
2009

Citing 30
Cited 12

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Optimum polynomial retrieval functions based on the probability ranking principle

ACM Transactions on Information Systems (TOIS)
Elements of information theory

Elements of information theory
Probabilistic retrieval based on staged logistic regression

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
IR evaluation methods for retrieving highly relevant documents

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing

Communications of the ACM
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Corpus structure, language models, and ad hoc information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Eye-tracking analysis of user behavior in WWW search

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Accurately interpreting clickthrough data as implicit feedback

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Impedance coupling in content-targeted advertising

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A generative theory of relevance

A generative theory of relevance
Finding advertising keywords on web pages

Proceedings of the 15th international conference on World Wide Web
Generating query substitutions

Proceedings of the 15th international conference on World Wide Web
Predicting clicks: estimating the click-through rate for new ads

Proceedings of the 16th international conference on World Wide Web
The influence of caption features on clickthrough patterns in web search

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A semantic approach to contextual advertising

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Query rewriting using active learning for sponsored search

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression

The Journal of Machine Learning Research
A noisy-channel approach to contextual advertising

Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising
Online learning from click data for sponsored search

Proceedings of the 17th international conference on World Wide Web
Contextual advertising by combining relevance with click feedback

Proceedings of the 17th international conference on World Wide Web
Searching question and answer archives

Searching question and answer archives
Optimizing relevance and revenue in ad search: a query substitution approach

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Consistent phrase relevance measures

Proceedings of the 2nd International Workshop on Data Mining and Audience Intelligence for Advertising

A collaborative filtering approach to ad recommendation using the query-ad click graph

Proceedings of the 18th ACM conference on Information and knowledge management
Improving ad relevance in sponsored search

Proceedings of the third ACM international conference on Web search and data mining
Conversion rate based bid adjustment for sponsored search

Proceedings of the 19th international conference on World wide web
Temporal click model for sponsored search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Minimally-supervised extraction of entities from text advertisements

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Computational advertising: leveraging user interaction & contextual factors for improved ad retrieval & ranking

Proceedings of the 20th international conference companion on World wide web
The sum of its parts: reducing sparsity in click estimation with query segments

Information Retrieval
Multi-objective optimization for sponsored search

Proceedings of the Sixth International Workshop on Data Mining for Online Advertising and Internet Economy
Captions and biases in diagnostic search

ACM Transactions on the Web (TWEB)
Psychological advertising: exploring user psychology for click prediction in sponsored search

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
New features for query dependent sponsored search click prediction

Proceedings of the 22nd international conference on World Wide Web companion
Sampling dilemma: towards effective data sampling for click prediction in sponsored search

Proceedings of the 7th ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.01

Visualization

Abstract

Much search engine revenue comes from sponsored search ads displayed with algorithmic search results. To maximize revenue, it is essential to choose a good slate of ads for each query, requiring accurate prediction of whether or not users will click on an ad. Click prediction is relatively easy for the ads that have been displayed many times, and have significant click history, but in the long tail with minimal or no click history, other features are needed to predict user response. In this work, we investigate the use of novel text features for this problem, within the context of a state-of-the-art sponsored search system. In particular, we propose the use of detailed word-pair indicator features between the query and ad. We compare the new features to the traditional vector-space and language modeling features extracted in a typical information-retrieval style. We evaluate these approaches in a maximum-entropy ranking model using the click-view data from a commercial search-engine traffic. We show that the word-pair features are highly helpful for sponsored search click prediction, not only improving over the sophisticated click-history feedback based systems, but also compensating for the lack of click history to some extent. In contrast, we find that the language and vector-space modeling approaches are significantly less effective.