Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Optimum polynomial retrieval functions based on the probability ranking principle
ACM Transactions on Information Systems (TOIS)
Elements of information theory
Elements of information theory
Probabilistic retrieval based on staged logistic regression
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
IR evaluation methods for retrieving highly relevant documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing
Communications of the ACM
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Corpus structure, language models, and ad hoc information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Eye-tracking analysis of user behavior in WWW search
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Accurately interpreting clickthrough data as implicit feedback
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Impedance coupling in content-targeted advertising
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A generative theory of relevance
A generative theory of relevance
Finding advertising keywords on web pages
Proceedings of the 15th international conference on World Wide Web
Generating query substitutions
Proceedings of the 15th international conference on World Wide Web
Predicting clicks: estimating the click-through rate for new ads
Proceedings of the 16th international conference on World Wide Web
The influence of caption features on clickthrough patterns in web search
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A semantic approach to contextual advertising
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Query rewriting using active learning for sponsored search
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression
The Journal of Machine Learning Research
A noisy-channel approach to contextual advertising
Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising
Online learning from click data for sponsored search
Proceedings of the 17th international conference on World Wide Web
Contextual advertising by combining relevance with click feedback
Proceedings of the 17th international conference on World Wide Web
Searching question and answer archives
Searching question and answer archives
Optimizing relevance and revenue in ad search: a query substitution approach
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Consistent phrase relevance measures
Proceedings of the 2nd International Workshop on Data Mining and Audience Intelligence for Advertising
A collaborative filtering approach to ad recommendation using the query-ad click graph
Proceedings of the 18th ACM conference on Information and knowledge management
Improving ad relevance in sponsored search
Proceedings of the third ACM international conference on Web search and data mining
Conversion rate based bid adjustment for sponsored search
Proceedings of the 19th international conference on World wide web
Temporal click model for sponsored search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Minimally-supervised extraction of entities from text advertisements
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Proceedings of the 20th international conference companion on World wide web
The sum of its parts: reducing sparsity in click estimation with query segments
Information Retrieval
Multi-objective optimization for sponsored search
Proceedings of the Sixth International Workshop on Data Mining for Online Advertising and Internet Economy
Captions and biases in diagnostic search
ACM Transactions on the Web (TWEB)
Psychological advertising: exploring user psychology for click prediction in sponsored search
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
New features for query dependent sponsored search click prediction
Proceedings of the 22nd international conference on World Wide Web companion
Sampling dilemma: towards effective data sampling for click prediction in sponsored search
Proceedings of the 7th ACM international conference on Web search and data mining
Hi-index | 0.01 |
Much search engine revenue comes from sponsored search ads displayed with algorithmic search results. To maximize revenue, it is essential to choose a good slate of ads for each query, requiring accurate prediction of whether or not users will click on an ad. Click prediction is relatively easy for the ads that have been displayed many times, and have significant click history, but in the long tail with minimal or no click history, other features are needed to predict user response. In this work, we investigate the use of novel text features for this problem, within the context of a state-of-the-art sponsored search system. In particular, we propose the use of detailed word-pair indicator features between the query and ad. We compare the new features to the traditional vector-space and language modeling features extracted in a typical information-retrieval style. We evaluate these approaches in a maximum-entropy ranking model using the click-view data from a commercial search-engine traffic. We show that the word-pair features are highly helpful for sponsored search click prediction, not only improving over the sophisticated click-history feedback based systems, but also compensating for the lack of click history to some extent. In contrast, we find that the language and vector-space modeling approaches are significantly less effective.