Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The Journal of Machine Learning Research
Bridging the Gap: A Genre Analysis of Weblogs
HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 4 - Volume 4
Conversations in the Blogosphere: An Analysis "From the Bottom Up"
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4 - Volume 04
IEEE Intelligent Systems
On the Bursty Evolution of Blogspace
World Wide Web
Applying Authorship Analysis to Extremist-Group Web Forum Messages
IEEE Intelligent Systems
ACM SIGKDD Explorations Newsletter
A probabilistic approach to spatiotemporal theme pattern mining on weblogs
Proceedings of the 15th international conference on World Wide Web
Community discovery and analysis in blogspace
Proceedings of the 15th international conference on World Wide Web
Topics over time: a non-Markov continuous-time model of topical trends
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining blog stories using community-based and temporal clustering
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Latent Friend Mining from Blog Data
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Learning Social Networks from Web Documents Using Support Vector Classifiers
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Structural and temporal analysis of the blogosphere through community factorization
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Comments-oriented blog summarization by sentence extraction
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Identifying the influential bloggers in a community
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Exploring the role of the reader in the activity of blogging
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Social reader: following social networks in the wilds of the blogosphere
WSM '09 Proceedings of the first SIGMM workshop on Social media
An unsupervised topic segmentation model incorporating word order
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Affinity-driven blog cascade analysis and prediction
Data Mining and Knowledge Discovery
Social reader: towards browsing the social web
Multimedia Tools and Applications
Hi-index | 0.00 |
The blogosphere has grown to be a mainstream forum of social interaction as well as a commercially attractive source of information and influence. Tools are needed to better understand how communities that adhere to individual blogs are constituted in order to facilitate new personal, socially-focused browsing paradigms, and understand how blog content is consumed, which is of interest to blog authors, big media, and search. We present a novel approach to blog subcommunity characterization by modeling individual blog readers using mixtures of an extension to the LDA family that jointly models phrases and time, Ngram Topic over Time (NTOT), and cluster with a number of similarity measures using Affinity Propagation. We experiment with two datasets: a small set of blogs whose authors provide feedback, and a set of popular, highly commented blogs, which provide indicators of algorithm scalability and interpretability without prior knowledge of a given blog. The results offer useful insight to the blog authors about their commenting community, and are observed to offer an integrated perspective on the topics of discussion and members engaged in those discussions for unfamiliar blogs. Our approach also holds promise as a component of solutions to related problems, such as online entity resolution and role discovery.