Manual and automatic evaluation of summaries

Authors:
Chin-Yew Lin;Eduard Hovy
Affiliations:
USC Information Sciences Institute, Marina del Rey, CA;USC Information Sciences Institute, Marina del Rey, CA
Venue:
AS '02 Proceedings of the ACL-02 Workshop on Automatic Summarization - Volume 4
Year:
2002

Citing 0
Cited 39

Introduction to the special issue on summarization

Computational Linguistics - Summarization
Generic technologies for single- and multi-document summarization

Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Improving summarization performance by sentence compression: a pilot study

AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Examining the consensus between human summaries: initial experiments with factoid analysis

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
The potential and limitations of automatic sentence extraction for summarization

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Probabilistic model for definitional question answering

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Older versions of the ROUGEeval summarization evaluation system were easier to fool

Information Processing and Management: an International Journal
Developing learning strategies for topic-based summarization

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Summarization system evaluation revisited: N-gram graphs

ACM Transactions on Speech and Language Processing (TSLP)
CorrefSum: Referencial Cohesion Recovery in Extractive Summaries

PROPOR '08 Proceedings of the 8th international conference on Computational Processing of the Portuguese Language
Mind the gap: dangers of divorcing evaluations of summary content from linguistic quality

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Summarization with a joint model for sentence extraction and compression

ILP '09 Proceedings of the Workshop on Integer Linear Programming for Natural Langauge Processing
Multi-document summarization by maximizing informative content-words

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Automatic summarization of MEDLINE citations for evidence-based medical treatment: A topic-oriented evaluation

Journal of Biomedical Informatics
Colouring summaries BLEU

Evalinitiatives '03 Proceedings of the EACL 2003 Workshop on Evaluation Initiatives in Natural Language Processing: are evaluation methods, metrics and resources reusable?
Complex question answering: unsupervised learning approaches and experiments

Journal of Artificial Intelligence Research
Focused multi-document summarization: human summarization activity vs. automated systems techniques

Journal of Computing Sciences in Colleges
Fuzzy swarm diversity hybrid model for text summarization

Information Processing and Management: an International Journal
Formal and functional assessment of the pyramid method for summary content evaluation*

Natural Language Engineering
Discourse indicators for content selection in summarization

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Applying regression models to query-focused multi-document summarization

Information Processing and Management: an International Journal
Heuristics based automatic text summarization of unstructured text

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Learning from collective human behavior to introduce diversity in lexical choice

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Towards a unified approach for opinion question answering and summarization

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Text specificity and impact on quality of news summaries

MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
Integer linear programming for dutch sentence compression

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
GEMS: generative modeling for evaluation of summaries

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
EM clustering algorithm for automatic text summarization

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Degree centrality for semantic abstraction summarization of therapeutic studies

Journal of Biomedical Informatics
Summarisation of the logical structure of XML documents

Information Processing and Management: an International Journal
Entity-centric topic-oriented opinion summarization in twitter

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-document summarization via submodularity

Applied Intelligence
Describing video contents in natural language

HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
A knowledge induced graph-theoretical model for extract and abstract single document summarization

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Summary evaluation: together we stand NPowER-ed

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Semisupervised learning based opinion summarization and classification for online product reviews

Applied Computational Intelligence and Soft Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we discuss manual and automatic evaluations of summaries using data from the Document Understanding Conference 2001 (DUC-2001). We first show the instability of the manual evaluation. Specifically, the low inter-human agreement indicates that more reference summaries are needed. To investigate the feasibility of automated summary evaluation based on the recent BLEU method from machine translation, we use accumulative n-gram overlap scores between system and human summaries. The initial results provide encouraging correlations with human judgments, based on the Spearman rank-order correlation coefficient. However, relative ranking of systems needs to take into account the instability.