The Journal of Machine Learning Research
A Bayesian Hierarchical Model for Learning Natural Scene Categories
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Automatic evaluation of summaries using N-gram co-occurrence statistics
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A Probabilistic Approach to Multi-document Summarization for Generating a Tiled Summary
ICCIMA '05 Proceedings of the Sixth International Conference on Computational Intelligence and Multimedia Applications
Pachinko allocation: DAG-structured mixture models of topic correlations
ICML '06 Proceedings of the 23rd international conference on Machine learning
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Mixtures of hierarchical topics with Pachinko allocation
Proceedings of the 24th international conference on Machine learning
Focused multi-document summarization: human summarization activity vs. automated systems techniques
Journal of Computing Sciences in Colleges
CorrRank: update summarization based on topic correlation analysis
ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
A latent topic extracting method based on events in a document and its application
HLT-SS '11 Proceedings of the ACL 2011 Student Session
Toward a Unified Framework for Standard and Update Multi-Document Summarization
ACM Transactions on Asian Language Information Processing (TALIP)
Incorporating word correlation into tag-topic model for semantic knowledge acquisition
Proceedings of the 21st ACM international conference on Information and knowledge management
Obtaining single document summaries using latent dirichlet allocation
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
TopicDSDR: combining topic decomposition and data reconstruction for summarization
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.00 |
Extraction based Multi-Document Summarization Algorithms consist of choosing sentences from the documents using some weighting mechanism and combining them into a summary. In this article we use Latent Dirichlet Allocation to capture the events being covered by the documents and form the summary with sentences representing these different events. Our approach is distinguished from existing approaches in that we use mixture models to capture the topics and pick up the sentences without paying attention to the details of grammar and structure of the documents. Finally we present the evaluation of the algorithms on the DUC 2002 Corpus multi-document summarization tasks using the ROUGE evaluator to evaluate the summaries. Compared to DUC 2002 winners, our algorithms gave significantly better ROUGE-1 recall measures.