A behavioural mode research on user-focus summarization

  • Authors:
  • Chong Teng;Naixue Xiong;Yanxiang He;Laurence T. Yang;Dexi Liu

  • Affiliations:
  • Computer School of Wuhan University, China;Department of Computer Science, Georgia State University, USA;Computer School of Wuhan University, China;Department of Computer Science, St. Francis Xavier University, Canada;School of Information Technology, Jiangxi University of Finance and Economics, China

  • Venue:
  • Mathematical and Computer Modelling: An International Journal
  • Year:
  • 2010

Quantified Score

Hi-index 0.98

Visualization

Abstract

Different persons often choose different contents in multi-document as summary. To optimize summarization, we will focus on the selection of content and seeking their valuable features. Statistical methods for automatic summarization are very important. In this paper, we research the correlation between the eigenvalue of content unit in the original document cluster and the probability of the content unit to be selected as a human summary based on a statistical method. When a Basic Element and word are considered as a content unit, we draw conclusions, in user-focus summarization. It is excellent that the BE is regarded as content unit granularity, and it is proved that the frequency eigenvalue of the BE is more suitable to embody content units' weightiness than the TFIDF value. Moreover, the paper reveals that the given topic on user-focus summarization is helpful for the selection of content unit and quality of summarization. They often choose those content units as a summary in which the emerging frequency is relatively high in the sentences including the content unit of a given topic and neighboring sentences. Through researching potential behavioural modes about manual summary, we will put these effect factors of summarization quality into the process of content unit selection and summary generation to optimize automatic summarization.