Evaluating web search result summaries

Authors:
Shao Fen Liang;Siobhan Devlin;John Tait
Affiliations:
The University of Sunderland School of Computing and Technology, Sunderland, UK;The University of Sunderland School of Computing and Technology, Sunderland, UK;The University of Sunderland School of Computing and Technology, Sunderland, UK
Venue:
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Year:
2006

Citing 8
Cited 4

Variations in relevance judgments and the measurement of retrieval effectiveness

Information Processing and Management: an International Journal
Evaluating Natural Language Processing Systems: An Analysis and Review

Evaluating Natural Language Processing Systems: An Analysis and Review
Evaluating message understanding systems: an analysis of the third message understanding conference (MUC-3)

Computational Linguistics
The TIPSTER SUMMAC Text Summarization Evaluation

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
MUC-3 evaluation metrics

MUC3 '91 Proceedings of the 3rd conference on Message understanding
Query-relevant summarization using FAQs

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Using query term order for result summarisation

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Summarization from medical documents: a survey

Artificial Intelligence in Medicine

Investigating sentence weighting components for automatic summarisation

Information Processing and Management: an International Journal
Predicting the readability of short web summaries

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Progress in information retrieval

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Improving search result summaries by using searcher behavior data

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

The aim of our research is to produce and assess short summaries to aid users' relevance judgements, for example for a search engine result page. In this paper we present our new metric for measuring summary quality based on representativeness and judgeability, and compare the summary quality of our system to that of Google. We discuss the basis for constructing our evaluation methodology in contrast to previous relevant open evaluations, arguing that the elements which make up an evaluation methodology: the tasks, data and metrics, are interdependent and the way in which they are combined is critical to the effectiveness of the methodology. The paper discusses the relationship between these three factors as implemented in our own work, as well as in SUMMAC/MUC/DUC.