Identifying Reasons for Software Changes Using Historic Databases

  • Authors:
  • Audris Mockus;Lawrence G. Votta

  • Affiliations:
  • -;-

  • Venue:
  • ICSM '00 Proceedings of the International Conference on Software Maintenance (ICSM'00)
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large-scale software products must constantly change in order to adapt to a changing environment. Studies of historic data from legacy software systems have identified three specific causes of this change: adding new features; correcting faults; and restructuring code to accommodate future changes. Our hypothesis is that a textual description field of a change is essential to understanding why that change was performed. In addition, we expect that difficulty, size, and interval would vary strongly across different types of changes. To test these hypotheses we have designed a program, which automatically classifies maintenance activity based on a textual description of changes. Developer surveys showed that the automatic classification was in agreement with developer opinions. Tests of the classifier on a different product found that size and interval for different types of changes did not vary across two products. We have found strong relationships between the type and size of a change and the time required to carry it out. We also discovered a relatively large amount of perfective changes in the system we examined. From the study we have arrived at several suggestions on how to make version control data useful in diagnosing the state of a software project, without significantly increasing the overhead for the developer using the change management system.