Dependency-based sentence alignment for multiple document summarization

  • Authors:
  • Tsutomu Hirao;Jun Suzuki;Hideki Isozaki;Eisaku Maeda

  • Affiliations:
  • NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan;NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan;NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan;NTT Communication Science Laboratories, NTT Corp., Seika-cho, Soraku-gun, Kyoto, Japan

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe a method of automatic sentence alignment for building extracts from abstracts in automatic summarization research. Our method is based on two steps. First, we introduce the "dependency tree path" (DTP). Next, we calculate the similarity between DTPs based on the ESK (Extended String Subsequence Kernel), which considers sequential patterns. By using these procedures, we can derive one-to-many or many-to-one correspondences among sentences. Experiments using different similarity measures show that DTP consistently improves the alignment accuracy and that ESK gives the best performance.