Mining Eclipse Developer Contributions via Author-Topic Models

  • Authors:
  • Erik Linstead;Paul Rigor;Sushil Bajracharya;Cristina Lopes;Pierre Baldi

  • Affiliations:
  • University of California, Irvine, USA;University of California, Irvine, USA;University of California, Irvine, USA;University of California, Irvine, USA;University of California, Irvine, USA

  • Venue:
  • MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the results of applying statistical author-topic models to a subset of the Eclipse 3.0 source code consisting of 2,119 source files and 700,000 lines of code from 59 developers. This technique provides an intuitive and automated framework with which to mine developer contributions and competencies from a given code base while simultaneously extracting software function in the form of topics. In addition to serving as a convenient summary for program function and developer activities, our study shows that topic models provide a meaningful, effective, and statistical basis for developer similarity analysis.