Variations in relevance judgments and the measurement of retrieval effectiveness
Information Processing and Management: an International Journal
Evaluating Natural Language Processing Systems: An Analysis and Review
Evaluating Natural Language Processing Systems: An Analysis and Review
The TIPSTER SUMMAC Text Summarization Evaluation
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
MUC3 '91 Proceedings of the 3rd conference on Message understanding
Query-relevant summarization using FAQs
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Using query term order for result summarisation
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Summarization from medical documents: a survey
Artificial Intelligence in Medicine
Investigating sentence weighting components for automatic summarisation
Information Processing and Management: an International Journal
Predicting the readability of short web summaries
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Progress in information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Improving search result summaries by using searcher behavior data
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
The aim of our research is to produce and assess short summaries to aid users' relevance judgements, for example for a search engine result page. In this paper we present our new metric for measuring summary quality based on representativeness and judgeability, and compare the summary quality of our system to that of Google. We discuss the basis for constructing our evaluation methodology in contrast to previous relevant open evaluations, arguing that the elements which make up an evaluation methodology: the tasks, data and metrics, are interdependent and the way in which they are combined is critical to the effectiveness of the methodology. The paper discusses the relationship between these three factors as implemented in our own work, as well as in SUMMAC/MUC/DUC.