Attention, intentions, and the structure of discourse
Computational Linguistics
Automatic text structuring experiments
Text-based intelligent systems
Automatic structuring and retrieval of large text files
Communications of the ACM
Textual context analysis for information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
ECDL '97 Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries
Multi-Paragraph Segmentation of Expository Texts
Multi-Paragraph Segmentation of Expository Texts
TextTiling: A Quantitative Approach to Discourse
TextTiling: A Quantitative Approach to Discourse
Lexical cohesion computed by thesaural relations as an indicator of the structure of text
Computational Linguistics
Text segmentation based on similarity between words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Word association norms, mutual information, and lexicography
ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Combining multiple knowledge sources for discourse segmentation
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
An automatic method of finding topic boundaries
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Text Segmentation into Paragraphs Based on Local Text Cohesion
TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Using collocations for topic segmentation and link detection
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Grasping related words of unknown word for automatic extension of lexical dictionary
Proceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop
Integrated Computer-Aided Engineering
Hi-index | 0.00 |
A method is presented for segmenting text into subtopic areas. The proportion of related pairwise words is calculated between adjacent windows of text to determine their lexical similarity. The lexical cohesion relations of reiteration and collocation are used to identify related words. These relations are automatically located using a combination of three linguistic features: word repetition, collocation and relation weights. This method is shown to successfully detect known subject changes in text and corresponds well to the segmentations placed by test subjects.