Automatic condensation of electronic publications by sentence selection
Information Processing and Management: an International Journal - Special issue: summarizing text
New Methods in Automatic Extracting
Journal of the ACM (JACM)
TextTiling: segmenting text into multi-paragraph subtopic passages
Computational Linguistics
Abstract generation based on rhetorical structure extraction
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
A method for abstracting newspaper articles by using surface clues
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Fast generation of abstracts from general domain text corpora by extracting relevant sentences
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Using position, fonts and cited references to retrieve scientific documents
Journal of Information Science
Hi-index | 0.00 |
This paper proposes a method of selecting important sentences from a text based on the evaluation of the connectivity between sentences by using surface information. We assume that the title of a text is the most concise statement which expresses the most essential information of the text, and that the closer a sentence relates to an important sentence, the more important this sentence is. The importance of a sentence is defined as the connectivity between the sentence and the title. The connectivity between two sentences is measured based on correference between a pronoun and a preceding (pro)noun, and on lexical cohesion of lexical items. In an experiment with 80 English texts, which consist of an average of 29.0 sentences, the proposed method has marked recall of 78.2% and precision of 57.7%, with the selection ratio being 25%. The recall and precision values surpass those achieved by conventional methods, which means that our method is more effective in abridging relatively short texts.