Discovery of Web Communities Based on the Co-Occurence of References

Authors:
Tsuyoshi Murata
Affiliations:
-
Venue:
DS '00 Proceedings of the Third International Conference on Discovery Science
Year:
2000

Citing 11
Cited 5

Siteseer: personalized navigation for the Web

Communications of the ACM
Enhanced hypertext categorization using hyperlinks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Syntactic clustering of the Web

Selected papers from the sixth international conference on World Wide Web
Customizable multi-engine search tool with clustering

Selected papers from the sixth international conference on World Wide Web
WebCutter: a system for dynamic and tailorable site mapping

Selected papers from the sixth international conference on World Wide Web
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Readings in information visualization: using vision to think

Readings in information visualization: using vision to think
Trawling the Web for emerging cyber-communities

WWW '99 Proceedings of the eighth international conference on World Wide Web
Modern Information Retrieval

Modern Information Retrieval
Machine Discovery Based on the Co-occurrence of References in a Search Engine

DS '99 Proceedings of the Second International Conference on Discovery Science
The web as a graph: measurements, models, and methods

COCOON'99 Proceedings of the 5th annual international conference on Computing and combinatorics

An Approach to Microscopic Clustering of Terms and Documents

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
A Method for Discovering Purified Web Communities

DS '01 Proceedings of the 4th International Conference on Discovery Science
Implicit Groups of Web Pages as Constrained Top N Concepts

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Subject-based extraction of a latent blog community

Information Sciences: an International Journal
Extraction of structural information from the web

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a method of discovering Web communities. A complete bipartite graph Ki,j of Web pages can be regarded as a community sharing a common interest. Discovery of such community is expected to assist users' information retrieval from the Web. The method proposed in this paper is based on the assumption that hyperlinks to related Web pages often co-occur. Relations of Web pages are detected by the co-occurrence of hyperlinks on the pages which are acquired from a search engine by backlink search. In order to find a new member of a Web community, all the hyperlinks contained in the acquired pages are extracted. Then a page which is pointed by the most frequent hyperlinks is regarded as a new member of the community. We have build a system which discovers complete bipartite graphs based on the method. Only from a few URLs of initial community members, the system succeeds in discovering several genres of Web communities without analyzing the contents of Web pages.