Finding cohesive clusters for analyzing knowledge communities

  • Authors:
  • Vasileios Kandylas;S. Phineas Upham;Lyle H. Ungar

  • Affiliations:
  • University of Pennsylvania, Department of Computer and Information Science, 19104, Philadelphia, PA, USA;University of Pennsylvania, Wharton School, Philadelphia, PA, USA;University of Pennsylvania, Department of Computer and Information Science, 19104, Philadelphia, PA, USA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Documents and authors can be clustered into “knowledge communities” based on the overlap in the papers they cite. We introduce a new clustering algorithm, Streemer, which finds cohesive foreground clusters embedded in a diffuse background, and use it to identify knowledge communities as foreground clusters of papers which share common citations. To analyze the evolution of these communities over time, we build predictive models with features based on the citation structure, the vocabulary of the papers, and the affiliations and prestige of the authors. Findings include that scientific knowledge communities tend to grow more rapidly if their publications build on diverse information and if they use a narrow vocabulary.