Generating Text Summaries through the Relative Importance of Topics

Authors:
Joel Larocca Neto;Alexandre Santos;Celso A. A. Kaestner;Alex Alves Freitas
Affiliations:
-;-;-;-
Venue:
IBERAMIA-SBIA '00 Proceedings of the International Joint Conference, 7th Ibero-American Conference on AI: Advances in Artificial Intelligence
Year:
2000

Citing 4
Cited 8

An algorithm for suffix stripping

Readings in information retrieval
Term-weighting approaches in automatic text retrieval

Readings in information retrieval
Foundations of statistical natural language processing

Foundations of statistical natural language processing
TextTiling: A Quantitative Approach to Discourse

TextTiling: A Quantitative Approach to Discourse

Combining Multiple Features for Automatic Text Summarization through Machine Learning

PROPOR '08 Proceedings of the 8th international conference on Computational Processing of the Portuguese Language
A new approach to improving multilingual summarization using a genetic algorithm

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Experiments with CST-based multidocument summarization

TextGraphs-5 Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing
Selecting a feature set to summarize texts in brazilian portuguese

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
Text summarisation in progress: a literature review

Artificial Intelligence Review
SABIO: an automatic portuguese text summarizer through artificial neural networks in a more biologically plausible model

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
A zipf-like distant supervision approach for multi-document summarization using wikinews articles

SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Cross-lingual training of summarization systems using annotated corpora in a foreign language

Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work proposes a new extractive text-summarization algorithm based on the importance of the topics contained in a document. The basic ideas of the proposed algorithm are as follows. At first the document is partitioned by using the TextTiling algorithm, which identifies topics (coherent segments of text) based on the TF-IDF metric. Then for each topic the algorithm computes a measure of its relative relevance in the document. This measure is computed by using the notion of TF-ISF (Term Frequency - Inverse Sentence Frequency), which is our adaptation of the well-known TF-IDF (Term Frequency - Inverse Document Frequency) measure in information retrieval. Finally, the summary is generated by selecting from each topic a number of sentences proportional to the importance of that topic.