Utilizing hyperlink transitivity to improve web page clustering
ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Connectivity inferences over the web for the analysis of semantic networks
International Journal of Web Engineering and Technology
Web page clustering: a hyperlink-based similarity and matrix-based hierarchical algorithms
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
GIScience'06 Proceedings of the 4th international conference on Geographic Information Science
Connectivity inferences over the web for the analysis of semantic networks
W2GIS'05 Proceedings of the 5th international conference on Web and Wireless Geographical Information Systems
Hi-index | 0.00 |
This paper proposes a matrix approach for hierarchical web page clustering with two algorithms using hyperlink information among pages.One clustering algorithm clusters web pages without considering cluster overlapping.Another one takes cluster overlapping into account.These algorithms take advantage of intrinsic relationships among the pages, and are independent of the order in which the pages are presented to the algorithms.Furthermore, the proposed algorithms do not require a predefined similarity threshold for clustering.They are easy to be implemented for web applications.The primary evaluations show the effectiveness of the proposed algorithms, as well as a promising application.