A Statistical Approach for Automatic Text Summarization by Extraction

Authors:
Munehs Chandra;Vikrant Gupta;Santosh Kr. Paul
Affiliations:
-;-;-
Venue:
CSNT '11 Proceedings of the 2011 International Conference on Communication Systems and Network Technologies
Year:
2011

Citing 0
Cited 2

A picture paints a thousand words: a method of generating image-text timelines

Proceedings of the 21st ACM international conference on Information and knowledge management
A cross-media evolutionary timeline generation framework based on iterative recommendation

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic Document Summarization is a highly interdisciplinary research area related with computer science as well as cognitive psychology. This Summarization is to compress an original document into a summarized version by extracting almost all of the essential concepts with text mining techniques. This research focuses on developing a statistical automatic text summarization approach, Kmixture probabilistic model, to enhancing the quality of summaries. KSRS employs the K-mixture probabilistic model to establish term weights in a statistical sense, and further identifies the term relationships to derive the semantic relationship significance (SRS) of nouns. Sentences are ranked and extracted based on their semantic relationship significance values. The objective of this research is thus to propose a statistical approach to text summarization. We propose a K-mixture semantic relationship significance (KSRS) approach to enhancing the quality of document summary results. The K-mixture probabilistic model is used to determine the term weights. Term relationships are then investigated to develop the semantic relationship of nouns that manifests sentence semantics. Sentences with significant semantic relationship, nouns are extracted to form the summary accordingly.