Question-answer topic model for question retrieval in community question answering

Authors:
Zongcheng Ji;Fei Xu;Bin Wang;Ben He
Affiliations:
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Graduate University of Chinese Academy of Sciences, Beijing, China
Venue:
Proceedings of the 21st ACM international conference on Information and knowledge management
Year:
2012

Citing 12
Cited 2

Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Finding similar questions in large question and answer archives

Proceedings of the 14th ACM international conference on Information and knowledge management
Retrieval models for question and answer archives

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A syntactic tree matching approach to finding similar questions in community-based qa services

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Bridging lexical gaps between queries and questions on large online Q&A collections with compact translation models

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
The use of categorization information in language models for question retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
A generalized framework of exploring category information for question retrieval in community question answer archives

Proceedings of the 19th international conference on World wide web
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
Translingual document representations from discriminative projections

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Exploring domain-specific term weight in archived question search

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Phrase-based translation model for question retrieval in community question answer archives

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Clickthrough-based latent semantic models for web search

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Building structures from classifiers for passage reranking

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
CQArank: jointly model topics and expertise in community question answering

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

The major challenge for Question Retrieval (QR) in Community Question Answering (CQA) is the lexical gap between the queried question and the historical questions. This paper proposes a novel Question-Answer Topic Model (QATM) to learn the latent topics aligned across the question-answer pairs to alleviate the lexical gap problem, with the assumption that a question and its paired answer share the same topic distribution. Experiments conducted on a real world CQA dataset from Yahoo! Answers show that combining both parts properly can get more knowledge than each part or both parts in a simple mixing way and combining our QATM with the state-of-the-art translation-based language model, where the topic and translation information is learned from the question-answer pairs at two different grained semantic levels respectively, can significantly improve the QR performance.