Latent semantic space: iterative scaling improves precision of inter-document similarity measurement
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Summarization of discussion groups
Proceedings of the tenth international conference on Information and knowledge management
A critique and improvement of an evaluation metric for text segmentation
Computational Linguistics
PorTAL '02 Proceedings of the Third International Conference on Advances in Natural Language Processing
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Knowledge management technology
IBM Systems Journal
The Talent system: TEXTRACT architecture and data model
Natural Language Engineering
Using maximum entropy for sentence extraction
AS '02 Proceedings of the ACL-02 Workshop on Automatic Summarization - Volume 4
The talent system: TEXTRACT architecture and data model
SEALTS '03 Proceedings of the HLT-NAACL 2003 workshop on Software engineering and architecture of language technology systems - Volume 8
Using thematic information in statistical headline generation
MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Broad coverage paragraph segmentation across languages and domains
ACM Transactions on Speech and Language Processing (TSLP)
Topic segmentation algorithms for text summarization and passage retrieval: an exhaustive evaluation
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Top-down cohesion segmentation in summarization
STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Efficient linear text segmentation based on information retrieval techniques
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
A dynamic programming model for text segmentation based on min-max similarity
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Evaluating hierarchical discourse segmentation
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Text summarisation in progress: a literature review
Artificial Intelligence Review
Geographic Information Retrieval and Text Mining on Chinese Tourism Web Pages
International Journal of Information Technology and Web Engineering
Hi-index | 0.00 |
This paper describes work to enhance a sentence-based summarizer with notions of salience, dynamically-adjustable summary size, discourse segmentation, and awareness of topic shifts. Our experiments study strategies to diversify the application of a baseline summarizer, by making it aware of finer-grained 'aboutness', capable of discerning changes of topic, and sensitive to longer-than-usual documents. Evaluated against the corpus used in the development of the baseline summarizer, summaries derived either by means of segmentation analysis alone, or by a mix of strategies for combining salience calculation and topic shift detection, are shown to be of comparable, and under certain conditions even better, quality. We describe the summarization and segmentation procedures, outline a number of strategies for mixing the two, evaluate the overall impact of discourse segmentation, and suggest an interface design capable of using the notion of topic shifts to contextualize a summary and facilitate the mediation between it and the full document source.