Calculating communities by link analysis of URLs

  • Authors:
  • Gerhard Heyer;Uwe Quasthoff

  • Affiliations:
  • Natural Language Processing Department, Leipzig University Computer Science Institute, Leipzig;Natural Language Processing Department, Leipzig University Computer Science Institute, Leipzig

  • Venue:
  • IICS'04 Proceedings of the 4th international conference on Innovative Internet Community Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Collocation analysis finds semantic associations of concepts using large text corpora. If the same procedure is applied to sets of outgoing links of web pages, we can find semantically related web domains to a large extent. The structure of the semantic clusters shows all properties of small worlds. The algorithm is known to work for large parts of the web like the German internet. As a sample application we present a surf guide for the German web.