Principal component neural networks: theory and applications
Principal component neural networks: theory and applications
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
IR evaluation methods for retrieving highly relevant documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Title language model for information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
On an equivalence between PLSI and LDA
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Bayesian Latent Semantic Analysis of Multimedia Databases
Bayesian Latent Semantic Analysis of Multimedia Databases
The Journal of Machine Learning Research
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
Statistical phrase-based translation
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Learning to rank using gradient descent
ICML '05 Proceedings of the 22nd international conference on Machine learning
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Smoothing clickthrough data for web search ranking
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A machine learning approach for improved BM25 retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
Model adaptation via model interpolation and boosting for web search ranking
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Exploring web scale language models for search query processing
Proceedings of the 19th international conference on World wide web
On smoothing and inference for topic models
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Multi-style language model for web scale information retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Posterior Regularization for Structured Latent Variable Models
The Journal of Machine Learning Research
Translingual document representations from discriminative projections
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Clickthrough-based translation models for web search: from word models to phrase models
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Learning discriminative projections for text similarity measures
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Adaptive Bayesian Latent Semantic Analysis
IEEE Transactions on Audio, Speech, and Language Processing
Learning discriminative projections for text similarity measures
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Machine learning for query-document matching in search
Proceedings of the fifth ACM international conference on Web search and data mining
Active objects: actions for entity-centric search
Proceedings of the 21st international conference on World Wide Web
Modeling click-through based word-pairs for web search
Proceedings of the 21st international conference companion on World Wide Web
Latent association analysis of document pairs
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Rewarding term location information to enhance probabilistic information retrieval
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Beyond bag-of-words: machine learning for query-document matching in web search
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mining entity types from query logs via user intent modeling
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Learning lexicon models from search logs for query expansion
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Question-answer topic model for question retrieval in community question answering
Proceedings of the 21st ACM international conference on Information and knowledge management
Query expansion using path-constrained random walks
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Modeling click-through based word-pairs for web search
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Learning deep structured semantic models for web search using clickthrough data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Towards Concept-Based Translation Models Using Search Logs for Query Expansion
Proceedings of the 21st ACM international conference on Information and knowledge management
Joint and coupled bilingual topic model based sentence representations for language model adaptation
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning bilinear model for matching queries and documents
The Journal of Machine Learning Research
Hi-index | 0.00 |
This paper presents two new document ranking models for Web search based upon the methods of semantic representation and the statistical translation-based approach to information retrieval (IR). Assuming that a query is parallel to the titles of the documents clicked on for that query, large amounts of query-title pairs are constructed from clickthrough data; two latent semantic models are learned from this data. One is a bilingual topic model within the language modeling framework. It ranks documents for a query by the likelihood of the query being a semantics-based translation of the documents. The semantic representation is language independent and learned from query-title pairs, with the assumption that a query and its paired titles share the same distribution over semantic topics. The other is a discriminative projection model within the vector space modeling framework. Unlike Latent Semantic Analysis and its variants, the projection matrix in our model, which is used to map from term vectors into sematic space, is learned discriminatively such that the distance between a query and its paired title, both represented as vectors in the projected semantic space, is smaller than that between the query and the titles of other documents which have no clicks for that query. These models are evaluated on the Web search task using a real world data set. Results show that they significantly outperform their corresponding baseline models, which are state-of-the-art.