Lexical cohesion based topic modeling for summarization

Authors:
Gonenc Ercan;Ilyas Cicekli
Affiliations:
Dept. of Computer Engineering, Bilkent University, Ankara, Turkey;Dept. of Computer Engineering, Bilkent University, Ankara, Turkey
Venue:
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Year:
2008

Citing 5
Cited 7

Lexical cohesion computed by thesaural relations as an indicator of the structure of text

Computational Linguistics
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Using lexical chains for keyword extraction

Information Processing and Management: an International Journal
Improving word sense disambiguation in lexical chaining

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

Top-down cohesion segmentation in summarization

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Text summarization of Turkish texts using latent semantic analysis

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Automatic categorization and summarization of documentaries

Journal of Information Science
Text summarization using Latent Semantic Analysis

Journal of Information Science
Text summarisation in progress: a literature review

Artificial Intelligence Review
Extraction of the contents in the web texts by content-density distribution

International Journal of Knowledge Engineering and Soft Data Paradigms
Extraction of web texts using content-density distribution

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we attack the problem of forming extracts for text summarization. Forming extracts involves selecting the most representative and significant sentences from the text. Our method takes advantage of the lexical cohesion structure in the text in order to evaluate significance of sentences. Lexical chains have been used in summarization research to analyze the lexical cohesion structure and represent topics in a text. Our algorithm represents topics by sets of co-located lexical chains to take advantage of more lexical cohesion clues. Our algorithm segments the text with respect to each topic and finds the most important topic segments. Our summarization algorithm has achieved better results, compared to some other lexical chain based algorithms.