A Probabilistic Based Approach towards Software System Clustering

  • Authors:
  • Anna Corazza;Sergio Di Martino;Giuseppe Scanniello

  • Affiliations:
  • -;-;-

  • Venue:
  • CSMR '10 Proceedings of the 2010 14th European Conference on Software Maintenance and Reengineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a clustering based approach to partition software systems into meaningful subsystems. In particular, the approach uses lexical information extracted from four zones in Java classes, which may provide a different contribution towards software systems partitioning. To automatically weigh these zones, we introduced a probabilistic model, and applied the Expectation-Maximization (EM) algorithm. To group classes according to the considered lexical information, we customized the well-known K-Medoids algorithm. To assess the approach and the implemented supporting system, we have conducted a case study on six open source software systems.