A compositional context sensitive multi-document summarizer: exploring the factors that influence summarization

Authors:
Ani Nenkova;Lucy Vanderwende;Kathleen McKeown
Affiliations:
Stanford University;Microsoft Research;Stanford University
Venue:
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2006

Citing 14
Cited 36

A trainable document summarizer

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Creating and evaluating multi-document sentence extract summaries

Proceedings of the ninth international conference on Information and knowledge management
Summarization beyond sentence extraction: a probabilistic approach to sentence compression

Artificial Intelligence
Cut and paste based text summarization

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Evaluation challenges in large-scale document summarization

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Sentence Fusion for Multidocument News Summarization

Computational Linguistics
Manual and automatic evaluation of summaries

AS '02 Proceedings of the ACL-02 Workshop on Automatic Summarization - Volume 4
Examining the consensus between human summaries: initial experiments with factoid analysis

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Experiments in multidocument summarization

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Automated multi-document summarization in NeATS

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Using N-Grams to understand the nature of summaries

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers

Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion

Information Processing and Management: an International Journal
Generating Personalized Summaries Using Publicly Available Web Documents

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Have a say over what you see: evaluating interactive compression techniques

Proceedings of the 14th international conference on Intelligent user interfaces
Query-Focused Summarization by Combining Topic Model and Affinity Propagation

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Measuring importance and query relevance in topic-focused multi-document summarization

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Extractive summarization using supervised and semi-supervised learning

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A Gradual Combination of Features for Building Automatic Summarisation Systems

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Using signals of human interest to enhance single-document summarization

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Learning document-level semantic properties from free-text annotations

Journal of Artificial Intelligence Research
Multi-document summarization by maximizing informative content-words

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Automatic generation of topic pages using query-based aspect models

Proceedings of the 18th ACM conference on Information and knowledge management
Query-focused summaries or query-biased summaries?

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Summarizing online discussions by filtering posts

IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
Multi-document video summarization

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Focused multi-document summarization: human summarization activity vs. automated systems techniques

Journal of Computing Sciences in Colleges
A novel approach for enhancing student reading comprehension and assisting teacher assessment of literacy

Computers & Education
An improved web information summarization based on SSSC

CAR'10 Proceedings of the 2nd international Asia conference on Informatics in control, automation and robotics - Volume 3
Formal and functional assessment of the pyramid method for summary content evaluation*

Natural Language Engineering
Experiments on summary-based opinion classification

CAAGET '10 Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text
Capturing user reading behaviors for personalized document summarization

Proceedings of the 16th international conference on Intelligent user interfaces
Discourse indicators for content selection in summarization

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
A comparative study on ranking and selection strategies for multi-document summarization

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Automatic summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
Discovery of topically coherent sentences for extractive summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Automatic assessment of coverage quality in intelligence reports

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Probabilistic document modeling for syntax removal in text summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Statistical source expansion for question answering

Proceedings of the 20th ACM international conference on Information and knowledge management
Summarization as a means of information access: utilizing semantic metadata

FDIA'09 Proceedings of the Third BCS-IRSG conference on Future Directions in Information Access
Multi-aspect query summarization by composite query

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Mind the gap: learning to choose gaps for question generation

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Extractive speech summarization using evaluation metric-related training criteria

Information Processing and Management: an International Journal
Updating users about time critical events

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Automatically assessing machine summary content without a gold standard

Computational Linguistics
Constructing a dental implant ontology for domain specific clustering and life span analysis

Advanced Engineering Informatics
Comments-oriented document summarization based on multi-aspect co-feedback ranking

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Editorial: COMPENDIUM: A text summarization system for generating abstracts of research papers

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The usual approach for automatic summarization is sentence extraction, where key sentences from the input documents are selected based on a suite of features. While word frequency often is used as a feature in summarization, its impact on system performance has not been isolated. In this paper, we study the contribution to summarization of three factors related to frequency: content word frequency, composition functions for estimating sentence importance from word frequency, and adjustment of frequency weights based on context. We carry out our analysis using datasets from the Document Understanding Conferences, studying not only the impact of these features on automatic summarizers, but also their role in human summarization. Our research shows that a frequency based summarizer can achieve performance comparable to that of state-of-the-art systems, but only with a good composition function; context sensitivity improves performance and significantly reduces repetition.