Automatic generic document summarization based on non-negative matrix factorization

  • Authors:
  • Ju-Hong Lee;Sun Park;Chan-Min Ahn;Daeho Kim

  • Affiliations:
  • Department of Computer Science and Information Engineering, Inha University, Incheon, Republic of Korea;Department of Computer Engineering, Honam University, Gwangju, Republic of Korea;Department of Computer Science and Information Engineering, Inha University, Incheon, Republic of Korea;Department of Communication and Information, Inha University, Incheon, Republic of Korea

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In existing unsupervised methods, Latent Semantic Analysis (LSA) is used for sentence selection. However, the obtained results are less meaningful, because singular vectors are used as the bases for sentence selection from given documents, and singular vector components can have negative values. We propose a new unsupervised method using Non-negative Matrix Factorization (NMF) to select sentences for automatic generic document summarization. The proposed method uses non-negative constraints, which are more similar to the human cognition process. As a result, the method selects more meaningful sentences for generic document summarization than those selected using LSA.