The maven repository dataset of metrics, changes, and dependencies

  • Authors:
  • Steven Raemaekers;Arie van Deursen;Joost Visser

  • Affiliations:
  • Software Improvement Group, Netherlands / TU Delft, Netherlands;TU Delft, Netherlands;Software Improvement Group, Netherlands

  • Venue:
  • Proceedings of the 10th Working Conference on Mining Software Repositories
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the Maven Dependency Dataset (MDD), containing metrics, changes and dependencies of 148,253 jar files. Metrics and changes have been calculated at the level of individual methods, classes and packages of multiple library versions. A complete call graph is also presented which in- cludes call, inheritance, containment and historical relationships between all units of the entire repository. In this paper, we describe our dataset and the methodology used to obtain it. We present different conceptual views of MDD and we also describe limitations and data quality issues that researchers using this data should be aware of.