Pivoted document length normalization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Limited-memory matrix methods with applications
Limited-memory matrix methods with applications
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information diffusion through blogspace
Proceedings of the 13th international conference on World Wide Web
Understanding Search Engines: Mathematical Modeling and Text Retrieval (Software, Environments, Tools), Second Edition
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Understanding IPv6 Usage: Communities and Behaviors
APNOMS '08 Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service Management
Database optimization for novelty detection
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Detecting novel business blogs
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Dimensionality reduction techniques for blog visualization
Expert Systems with Applications: An International Journal
A tag-topic model for blog mining
Expert Systems with Applications: An International Journal
Design of an intelligent novelty detection application
International Journal of Innovative Computing and Applications
Database optimization for novelty mining of business blogs
Expert Systems with Applications: An International Journal
Dimensionality reduction for blog tag mining
International Journal of Web Engineering and Technology
Mobile E-Health Information System
International Journal of Handheld Computing Research
Probabilistic Models for Social Media Mining
International Journal of Information Technology and Web Engineering
Adaptable Services for Novelty Mining
International Journal of Systems and Service-Oriented Engineering
Hi-index | 0.00 |
Weblogs, or blogs, have rapidly gained in popularity over the past few years. In particular, the growth of business blogs written by or providing commentary on businesses and companies opens up new opportunities for developing blog-specific search and mining techniques. In this paper, we propose probabilistic models for blog search and mining using two machine learning techniques, Latent Semantic Analysis (LSA) and Probabilistic Latent Semantic Analysis (PLSA). We implement the models in our database of business blogs, with the aim of achieving higher precision and recall. The probabilistic model is able to segment the business blogs into separate topic areas, which is useful for keywords detection on the blogosphere. Various term-weighting schemes and factor values were also studied in detail, which reveal interesting patterns in our database of business blogs. From our study, we can uncover domain-driven data mining techniques that can better strengthen business intelligence in complex enterprise applications.