Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information diffusion through blogspace
Proceedings of the 13th international conference on World Wide Web
A probabilistic approach to spatiotemporal theme pattern mining on weblogs
Proceedings of the 15th international conference on World Wide Web
A comparison of feature selection methods for an evolving RSS feed corpus
Information Processing and Management: an International Journal - Special issue: Informetrics
Detecting cyber security threats in weblogs using probabilistic models
PAISI'07 Proceedings of the 2007 Pacific Asia conference on Intelligence and security informatics
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Applications of Data Mining in E-Business Finance: Introduction
Proceedings of the 2008 conference on Applications of Data Mining in E-Business and Finance
Hi-index | 0.00 |
Blogs, or weblogs, have rapidly gained in popularity over the past decade. Because of the huge volume of existing blog posts, information in the blogosphere is difficult to access and retrieve. Existing studies have focused on analyzing personal blogs, but few have looked at corporate blogs, the numbers of which are dramatically rising. In this paper, we use probabilistic latent semantic analysis to detect keywords from corporate blogs with respect to certain topics. We then demonstrate how this method can represent the blogosphere in terms of topics with measurable keywords, hence tracking popular conversations and topics in the blogosphere. By applying a probabilistic approach, we can improve information retrieval in blog search and keywords detection, and provide an analytical foundation for the future of corporate blog search and mining.