A study for documents summarization based on personal annotation

  • Authors:
  • Haiqin Zhang;Zheng Chen Wei-ying Ma;Qingsheng Cai

  • Affiliations:
  • University of Science and Technology of China;Microsoft Research Asia;University of Science and Technology of China

  • Venue:
  • HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

For one document, current summarization systems produce a uniform version of summary for all users. Personalized summarizations are necessary in order to represent users' preferences and interests. Annotation is getting important for document sharing and collaborative filtering, which in fact record users' dynamic behaviors compared to traditional steady profiles. In this paper we introduce a new summarization system based on users' annotations. Annotations and their contexts are extracted to represent features of sentences, which are given different weights for representation of the document. Our system produces two versions of summaries for each document: generic summary without considering annotations and annotation-based summary. Since annotation is a kind of personal data, annotation-based summary is tailored to user's interests to some extent. We show by experiments that annotations can help a lot in improving summarization performance compared to no annotation consideration. At the same time, we make an extensive study on users' annotating behaviors and annotations distribution, and propose a variety of techniques to evaluate the relationships between annotations and summaries, such as how the number of annotations affects the summarizing performance. A study about collaborative filtering is also made to evaluate the summarization based on annotations of similar users.