Communities in Large Networks: Identification and Ranking

  • Authors:
  • Martin Olsen

  • Affiliations:
  • Department of Computer Science, University of Aarhus,

  • Venue:
  • Algorithms and Models for the Web-Graph
  • Year:
  • 2007

Quantified Score

Hi-index 0.02

Visualization

Abstract

We study the problem of identifying and ranking the members of a community in a very large network with link analysis only, given a set of representatives of the community. We define the concept of a communityjustified by a formal analysis of a simple model of the evolution of a directed graph. We show that the problem of deciding whether a non trivial community exists is NP complete. Nevertheless, experiments show that a very simple greedy approach can identify members of a community in the Danish part of the web graph with time complexity only dependent on the size of the found community and its immediate surroundings. The members are ranked with a "local" variant of the PageRank algorithm. Results are reported from successful experiments on identifying and ranking Danish Computer Science sites and Danish Chess pages using only a few representatives.