Genetic granular classifiers in modeling software quality

  • Authors:
  • Witold Pedrycz;Giancarlo Succi

  • Affiliations:
  • Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada T6G 2G7 and Systems Research Institute of Polish Academy of Sciences, Warsaw, Poland;Department of Computer Science, University of Bozen, I-39100 Bozen, Italy

  • Venue:
  • Journal of Systems and Software
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hyperbox classifiers are one of the most appealing and intuitively transparent classification schemes. As the name itself stipulates, these classifiers are based on a collection of hyperboxes--generic and highly interpretable geometric descriptors of data belonging to a given class. The hyperboxes translate into conditional statements (rules) of the form ''if feature"1 is in [a,b] and feature"2 is in [d,f] and ... and feature"n is in [w,z] then class @w'' where the intervals ([a,b],...,[w,z]) are the respective edges of the hyperbox. The proposed design process of hyperboxes comprises of two main phases. In the first phase, a collection of ''seeds'' of the hyperboxes is formed through data clustering (realized by means of the Fuzzy C-Means algorithm, FCM). In the second phase, the hyperboxes are ''grown'' (expanded) by applying mechanisms of genetic optimization (and genetic algorithm, in particular). We reveal how the underlying geometry of the hyperboxes supports an immediate interpretation of software data concerning software maintenance and dealing with rules describing a number of changes made to software modules and their linkages with various software measures (such as size of code, McCabe cyclomatic complexity, number of comments, number of characters, etc.).