Terms derived from frequent sequences for extractive text summarization

Authors:
Yulia Ledeneva;Alexander Gelbukh;René Arnulfo García-Hernández
Affiliations:
Natural Language and Text Processing Laboratory, Center for Computing Research, National Polytechnic Institute, Mexico;Natural Language and Text Processing Laboratory, Center for Computing Research, National Polytechnic Institute, Mexico;Instituto Tecnologico de Toluca, Mexico
Venue:
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Year:
2008

Citing 13
Cited 3

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
A trainable document summarizer

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Automated text summarization and the SUMMARIST system

TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998
Random-Walk Term Weighting for Improved Text Classification

ICSC '07 Proceedings of the International Conference on Semantic Computing
Random walks on text structures

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Multi-document summarization based on BE-Vector clustering

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Deriving event relevance from the ontology constructed with formal concept analysis

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A new algorithm for fast discovery of maximal sequential patterns in a document collection

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Summarisation through discourse structure

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Automatic extraction and learning of keyphrases from scientific articles

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Using word sequences for text summarization

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue

Effect of Preprocessing on Extractive Summarization with Maximal Frequent Sequences

MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Text Summarization by Sentence Extraction Using Unsupervised Learning

MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
RACE: a scalable and elastic parallel system for discovering repeats in very long sequences

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic text summarization helps the user to quickly understand large volumes of information. We present a language- and domain-independent statistical-based method for single-document extractive summarization, i.e., to produce a text summary by extracting some sentences from the given text. We show experimentally that words that are parts of bigrams that repeat more than once in the text are good terms to describe the text's contents, and so are also so-called maximal frequent sentences. We also show that the frequency of the term as term weight gives good results (while we only count the occurrences of a term in repeating bigrams).