Unsupervised learning by probabilistic latent semantic analysis
Machine Learning
A Unified Framework for Clustering Heterogeneous Web Objects
WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
Proceedings of the 13th international conference on World Wide Web
A maximum entropy web recommendation system: combining collaborative and content features
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Learn from web search logs to organize search results
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A fuzzy bi-clustering approach to correlate web users and pages
International Journal of Knowledge and Web Intelligence
Co-clustering analysis of weblogs using bipartite spectral projection approach
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Extraction of Target User Group from Web Usage Data Using Evolutionary Biclustering Approach
International Journal of Applied Metaheuristic Computing
A Discrete Artificial Bees Colony Inspired Biclustering Algorithm
International Journal of Swarm Intelligence Research
Hi-index | 0.00 |
Web clustering is an approach for aggregating web objects into various groups according to underlying relationships among them. Finding co-clusters of web objects in semantic space is an interesting topic in the context of web usage mining, which is able to capture the underlying user navigational interest and content preference simultaneously. In this paper we will present a novel web co-clustering algorithm named Co-Clustering in Semantic space (COCS) to simultaneously partition web users and pages via a latent semantic analysis approach. In COCS, we first, train the latent semantic space of weblog data by using Probabilistic Latent Semantic Analysis (PLSA) model, and then, project all weblog data objects into this semantic space with probability distribution to capture the relationship among web pages and web users, at last, propose a clustering algorithm to generate the co-cluster corresponding to each semantic factor in the latent semantic space via probability inference. The proposed approach is evaluated by experiments performed on real datasets in terms of precision and recall metrics. Experimental results have demonstrated the proposed method can effectively reveal the co-aggregates of web users and pages which are closely related.