Two uses of anaphora resolution in summarization

  • Authors:
  • Josef Steinberger;Massimo Poesio;Mijail A. Kabadjov;Karel Jeek

  • Affiliations:
  • University of West Bohemia, Univerzitni 8, Pilsen 306 14, Czech Republic;University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom and Universitá di Trento, Rovereto, TN 38100, Italy;University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom;University of West Bohemia, Univerzitni 8, Pilsen 306 14, Czech Republic

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

We propose a new method for using anaphoric information in Latent Semantic Analysis (LSA), and discuss its application to develop an LSA-based summarizer which achieves a significantly better performance than a system not using anaphoric information, and a better performance by the rouge measure than all but one of the single-document summarizers participating in DUC-2002. Anaphoric information is automatically extracted using a new release of our own anaphora resolution system, GUITAR, which incorporates proper noun resolution. Our summarizer also includes a new approach for automatically identifying the dimensionality reduction of a document on the basis of the desired summarization percentage. Anaphoric information is also used to check the coherence of the summary produced by our summarizer, by a reference checker module which identifies anaphoric resolution errors caused by sentence extraction.