Algorithmics for Hard Problems
Algorithmics for Hard Problems
Clustering data stream: A survey of algorithms
International Journal of Knowledge-based and Intelligent Engineering Systems
Global inference for sentence compression an integer linear programming approach
Journal of Artificial Intelligence Research
Text summarization model based on the budgeted median problem
Proceedings of the 18th ACM conference on Information and knowledge management
A study of global inference algorithms in multi-document summarization
ECIR'07 Proceedings of the 29th European conference on IR research
Unsupervised modeling of Twitter conversations
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Streaming first story detection with application to Twitter
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Summarizing microblogs automatically
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Multi-document summarization via budgeted maximization of submodular functions
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Early detection of buzzwords based on large-scale time-series analysis of blog entries
Proceedings of the 23rd ACM conference on Hypertext and social media
Generating event storylines from microblogs
Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical clustering in improving microblog stream summarization
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Sumblr: continuous summarization of evolving tweet streams
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Personalized time-aware tweets summarization
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Timelines as summaries of popular scheduled events
Proceedings of the 22nd international conference on World Wide Web companion
Hi-index | 0.00 |
We introduce the task of summarizing a stream of short documents on microblogs such as Twitter. On microblogs, thousands of short documents on a certain topic such as sports matches or TV dramas are posted by users. Noticeable characteristics of microblog data are that documents are often very highly redundant and aligned on timeline. There can be thousands of documents on one event in the topic. Two very similar documents will refer to two distinct events when the documents are temporally distant. We examine the microblog data to gain more understanding of those characteristics, and propose a summarization model for a stream of short documents on timeline, along with an approximate fast algorithm for generating summary.We empirically show that our model generates a good summary on the datasets of microblog documents on sports matches.