Query clustering using user logs
ACM Transactions on Information Systems (TOIS)
The Perceptron Algorithm with Uneven Margins
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Structured use of external knowledge for event-based open domain question answering
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Discovery of inference rules for question-answering
Natural Language Engineering
Analysis of Statistical Question Classification for Fact-Based Questions
Information Retrieval
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Extracting paraphrases from a parallel corpus
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Learning to paraphrase: an unsupervised approach using multiple-sequence alignment
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Synonymous collocation extraction using translation information
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
An analysis of the AskMSR question-answering system
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Interrogative reformulation patterns and acquisition of question paraphrases
PARAPHRASE '03 Proceedings of the second international workshop on Paraphrasing - Volume 16
Paraphrasing with bilingual parallel corpora
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Automatic paraphrase acquisition from news articles
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Web-based unsupervised learning for query formulation in question answering
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Extracting paraphrase patterns from bilingual parallel corpora
Natural Language Engineering
Answering learners' questions by retrieving question paraphrases from social Q&A sites
EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Learning the relative usefulness of questions in community QA
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A utility-driven approach to question ranking in social QA
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Paraphrasing with search engine query logs
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Improving question recommendation by exploiting information need
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Unsupervised identification of synonymous query intent templates for attribute intents
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
Question paraphrasing is critical in many Natural Language Processing (NLP) applications, especially for question reformulation in question answering (QA). However, choosing an appropriate data source and developing effective methods are challenging tasks. In this paper, we propose a method that exploits Encarta logs to automatically identify question paraphrases and extract templates. Questions from Encarta logs are partitioned into small clusters, within which a perceptron classier is used for identifying question paraphrases. Experiments are conducted and the results have shown: (1) Encarta log data is an eligible data source for question paraphrasing and the user clicks in the data are indicative clues for recognizing paraphrases; (2) the supervised method we present is effective, which can evidently outperform the unsupervised method. Besides, the features introduced to identify paraphrases are sound; (3) the obtained question paraphrase templates are quite effective in question reformulation, enhancing the MRR from 0.2761 to 0.4939 with the questions of TREC QA 2003.