Deep Learning Approaches to Semantic Relevance Modeling for Chinese Question-Answer Pairs

Authors:
Baoxun Wang;Bingquan Liu;Xiaolong Wang;Chengjie Sun;Deyuan Zhang
Affiliations:
Harbin Institute of Technology;Harbin Institute of Technology;Harbin Institute of Technology;Harbin Institute of Technology;Harbin Institute of Technology
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2011

Citing 18
Cited 1

Bridging the lexical chasm: statistical approaches to answer-finding

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Training products of experts by minimizing contrastive divergence

Neural Computation
Retrieving answers from frequently asked questions pages on the web

Proceedings of the 14th ACM international conference on Information and knowledge management
Finding similar questions in large question and answer archives

Proceedings of the 14th ACM international conference on Information and knowledge management
An intelligent discussion-bot for answering student queries in threaded discussions

Proceedings of the 11th international conference on Intelligent user interfaces
A framework to predict the quality of answers with non-textual features

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A fast learning algorithm for deep belief nets

Neural Computation
Detection of question-answer pairs in email conversations

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Finding high-quality content in social media

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Finding question-answer pairs from online forums

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Semantic hashing

International Journal of Approximate Reasoning
A classification-based approach to question answering in discussion boards

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Bridging lexical gaps between queries and questions on large online Q&A collections with compact translation models

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Extracting chatbot knowledge from online discussion forums

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Combining lexical semantic resources with question & answer archives for translation-based answer finding

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Answering opinion questions with random walks on graphs

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Extracting Chinese question-answer pairs from online forums

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Metadata-aware measures for answer summarization in community Question Answering

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

Adapting deep RankNet for personalized search

Proceedings of the 7th ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

The human-generated question-answer pairs in the Web social communities are of great value for the research of automatic question-answering technique. Due to the large amount of noise information involved in such corpora, it is still a problem to detect the answers even though the questions are exactly located. Quantifying the semantic relevance between questions and their candidate answers is essential to answer detection in social media corpora. Since both the questions and their answers usually contain a small number of sentences, the relevance modeling methods have to overcome the problem of word feature sparsity. In this article, the deep learning principle is introduced to address the semantic relevance modeling task. Two deep belief networks with different architectures are proposed by us to model the semantic relevance for the question-answer pairs. According to the investigation of the textual similarity between the community-driven question-answering (cQA) dataset and the forum dataset, a learning strategy is adopted to promote our models’ performance on the social community corpora without hand-annotating work. The experimental results show that our method outperforms the traditional approaches on both the cQA and the forum corpora.