Statistical analysis with missing data
Statistical analysis with missing data
Class-based n-gram models of natural language
Computational Linguistics
IR evaluation methods for retrieving highly relevant documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Query clustering using user logs
ACM Transactions on Information Systems (TOIS)
Neural Networks for Pattern Recognition
Neural Networks for Pattern Recognition
Some Solutions to the Missing Feature Problem in Vision
Advances in Neural Information Processing Systems 5, [NIPS Conference]
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Optimizing web search using web click-through data
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Accurately interpreting clickthrough data as implicit feedback
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Linear discriminant model for information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Learning to rank using gradient descent
ICML '05 Proceedings of the 22nd international conference on Machine learning
Improving web search ranking by incorporating user behavior information
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Random walks on the click graph
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Extracting semantic relations from query logs
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Active exploration for learning rankings from clickthrough data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Confidence-weighted linear classification
Proceedings of the 25th international conference on Machine learning
Learning query intent from regularized click graphs
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
BrowseRank: letting web users vote for page importance
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Query suggestion using hitting time
Proceedings of the 17th ACM conference on Information and knowledge management
A machine learning approach for improved BM25 retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
Mining search engine clickthrough log for matching N-gram features
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Classification-enhanced ranking
Proceedings of the 19th international conference on World wide web
Exploring web scale language models for search query processing
Proceedings of the 19th international conference on World wide web
Sampling high-quality clicks from noisy click data
Proceedings of the 19th international conference on World wide web
Bayesian Browsing Model: Exact Inference of Document Relevance from Petabyte-Scale Data
ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning phrase-based spelling error models from clickthrough data
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Clickthrough-based translation models for web search: from word models to phrase models
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
A large scale ranker-based system for search query spelling correction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Smoothing click counts for aggregated vertical search
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Jigs and lures: associating web queries with structured entities
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Inferring and using location metadata to personalize web search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Clickthrough-based latent semantic models for web search
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion
ACM Transactions on Intelligent Systems and Technology (TIST)
Discovering missing click-through query language information for web search
Proceedings of the 20th ACM international conference on Information and knowledge management
Reranking search results for sparse queries
Proceedings of the 20th ACM international conference on Information and knowledge management
Personalizing web search results by reading level
Proceedings of the 20th ACM international conference on Information and knowledge management
Extracting search-focused key n-grams for relevance ranking in web search
Proceedings of the fifth ACM international conference on Web search and data mining
Probabilistic models for personalizing web search
Proceedings of the fifth ACM international conference on Web search and data mining
Evaluating the effectiveness of search task trails
Proceedings of the 21st international conference on World Wide Web
Modeling the impact of short- and long-term behavior on search personalization
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining search query logs for spoken language understanding
SDCTD '12 NAACL-HLT Workshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data
Learning lexicon models from search logs for query expansion
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Domain dependent query reformulation for web search
Proceedings of the 21st ACM international conference on Information and knowledge management
Personalizing atypical web search sessions
Proceedings of the sixth ACM international conference on Web search and data mining
A low rank structural large margin method for cross-modal ranking
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Query expansion using path-constrained random walks
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Modeling click-through based word-pairs for web search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Toward whole-session relevance: exploring intrinsic diversity in web search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Enhancing personalized search by mining and modeling task behavior
Proceedings of the 22nd international conference on World Wide Web
A vlHMM approach to context-aware search
ACM Transactions on the Web (TWEB)
Cross-media semantic representation via bi-directional learning to rank
Proceedings of the 21st ACM international conference on Multimedia
Learning deep structured semantic models for web search using clickthrough data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Through-the-looking glass: utilizing rich post-search trail statistics for web search
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Mining search and browse logs for web search: A Survey
ACM Transactions on Intelligent Systems and Technology (TIST) - Survey papers, special sections on the semantic adaptive social web, intelligent systems for health informatics, regular papers
Towards Concept-Based Translation Models Using Search Logs for Query Expansion
Proceedings of the 21st ACM international conference on Information and knowledge management
Hi-index | 0.00 |
Incorporating features extracted from clickthrough data (called clickthrough features) has been demonstrated to significantly improve the performance of ranking models for Web search applications. Such benefits, however, are severely limited by the data sparseness problem, i.e., many queries and documents have no or very few clicks. The ranker thus cannot rely strongly on clickthrough features for document ranking. This paper presents two smoothing methods to expand clickthrough data: query clustering via Random Walk on click graphs and a discounting method inspired by the Good-Turing estimator. Both methods are evaluated on real-world data in three Web search domains. Experimental results show that the ranking models trained on smoothed clickthrough features consistently outperform those trained on unsmoothed features. This study demonstrates both the importance and the benefits of dealing with the sparseness problem in clickthrough data.