Pivoted document length normalization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information diffusion through blogspace
Proceedings of the 13th international conference on World Wide Web
A probabilistic approach to spatiotemporal theme pattern mining on weblogs
Proceedings of the 15th international conference on World Wide Web
A comparison of feature selection methods for an evolving RSS feed corpus
Information Processing and Management: an International Journal - Special issue: Informetrics
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Getting insights from the voices of customers: Conversation mining at a contact center
Information Sciences: an International Journal
TSVM-HMM: Transductive SVM based hidden Markov model for automatic image annotation
Expert Systems with Applications: An International Journal
A blog article recommendation generating mechanism using an SBACPSO algorithm
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Evaluation of novelty metrics for sentence-level novelty mining
Information Sciences: an International Journal
Applying text and data mining techniques to forecasting the trend of petitions filed to e-People
Expert Systems with Applications: An International Journal
Detecting novel business blogs
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Web-based geographic search engine for location-aware search in Singapore
Expert Systems with Applications: An International Journal
Multilingual novelty detection
Expert Systems with Applications: An International Journal
Dimensionality reduction techniques for blog visualization
Expert Systems with Applications: An International Journal
An intelligent system for sentence retrieval and novelty mining
International Journal of Knowledge Engineering and Data Mining
A tag-topic model for blog mining
Expert Systems with Applications: An International Journal
Database optimization for novelty mining of business blogs
Expert Systems with Applications: An International Journal
Dimensionality reduction for blog tag mining
International Journal of Web Engineering and Technology
Applying the data fusion technique to blog opinion retrieval
Expert Systems with Applications: An International Journal
Identifying the signs of fraudulent accounts using data mining techniques
Computers in Human Behavior
Blogger-Link-Topic model for blog mining
PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
A data-centric approach to feed search in blogs
International Journal of Web Engineering and Technology
International Journal of Advanced Pervasive and Ubiquitous Computing
Probabilistic Models for Social Media Mining
International Journal of Information Technology and Web Engineering
Finding keywords in blogs: Efficient keyword extraction in blog mining via user behaviors
Expert Systems with Applications: An International Journal
Hi-index | 12.07 |
Weblogs, or blogs, have rapidly gained in popularity over the past few years. In particular, the growth of business blogs that are written by or provide commentary on businesses and companies opens up new opportunities for developing blog-specific search and mining techniques. In this paper, we propose probabilistic models for blog search and mining using two machine learning techniques, latent semantic analysis (LSA) and probabilistic latent semantic analysis (PLSA). We implement the models in our database of business blogs, BizBlogs07, with the aim of achieving higher precision and recall. The probabilistic model is able to segment the business blogs into separate topic areas, which is useful for keywords detection on the blogosphere. Various term-weighting schemes and factor values were also studied in detail, which reveal interesting patterns in our database of business blogs. Our multi-functional business blog system is indeed found to be very different from existing blog search engines, as it aims to provide better relevance and precision of the search.