An improved methodology on information distillation by mining program source code

  • Authors:
  • Y. Kanellopoulos;C. Makris;C. Tjortjis

  • Affiliations:
  • School of Informatics, University of Manchester, P.O. Box 88, Manchester M60 1QD, UK and Computer Engineering and Informatics Department, University of Patras, Greece;Computer Engineering and Informatics Department, University of Patras, Greece;School of Informatics, University of Manchester, P.O. Box 88, Manchester M60 1QD, UK

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a methodology for knowledge acquisition from source code. We use data mining to support semi-automated software maintenance and comprehension and provide practical insights into systems specifics, assuming one has limited prior familiarity with these systems. We propose a methodology and an associated model for extracting information from object oriented code by applying clustering and association rules mining. K-means clustering produces system overviews and deductions, which support further employment of an improved version of MMS Apriori that identifies hidden relationships between classes, methods and member data. The methodology is evaluated on an industrial case study, results are discussed and conclusions are drawn.