The use of MMR, diversity-based reranking for reordering documents and producing summaries
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Automated multi-document summarization in NeATS
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Identifying the influential bloggers in a community
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Document summarization using conditional random fields
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Succinct and informative cluster descriptions for document repositories
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Hi-index | 0.00 |
Analyzing/tracking weblogs by given communities (ATWC) is increasingly important for sociologists and government agencies, etc. This paper introduces an approach to address the needs of ATWC by using concise discriminative weblog collection representatives (DCRs). DCRs are aimed at helping users to quickly identify the major themes/trends in such collections, and to quickly identify important shifts/differences in major themes and trends of blogs by given communities over time and space. We propose to use the quality of DCR-based classifiers to measure DCRs' quality. We present algorithms for constructing DCRs, report experimental results to evaluate the efficiency of the algorithms and the quality of the DCRs they construct, and provide real-data examples to demonstrate the usefulness of DCRs for ATWC.