Modeling the search landscape of metaheuristic software clustering algorithms

Authors:
Brian S. Mitchell;Spiros Mancoridis
Affiliations:
Department of Computer Science, Drexel University, Philadelphia, PA;Department of Computer Science, Drexel University, Philadelphia, PA
Venue:
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Year:
2003

Citing 14
Cited 5

Using Neural Networks to Modularize Software

Machine Learning - Special issue on structured connectionist systems
Reverse engineering

Practical reusable UNIX software
Assessing modular structure of legacy code based on mathematical concept analysis

ICSE '97 Proceedings of the 19th international conference on Software engineering
Identifying objects using cluster and concept analysis

Proceedings of the 21st international conference on Software engineering
Extracting and Restructuring the Design of Large Systems

IEEE Software
A Technique for Drawing Directed Graphs

IEEE Transactions on Software Engineering
Using Heuristic Search Techniques To Extract Design Abstractions From Source Code

GECCO '02 Proceedings of the Genetic and Evolutionary Computation Conference
Chava: Reverse Engineering and Tracking of Java Applets

WCRE '99 Proceedings of the Sixth Working Conference on Reverse Engineering
CRAFT: A Framework for Evaluating Software Clustering Results in the Absence of Benchmark Decompositions

WCRE '01 Proceedings of the Eighth Working Conference on Reverse Engineering (WCRE'01)
Comparing the Decompositions Produced by Software Clustering Algorithms using Similarity Measurements

ICSM '01 Proceedings of the IEEE International Conference on Software Maintenance (ICSM'01)
Bunch: A Clustering Tool for the Recovery and Maintenance of Software System Structures

ICSM '99 Proceedings of the IEEE International Conference on Software Maintenance
A Comparison of Graphs of Concept for Reverse Engineering

IWPC '00 Proceedings of the 8th International Workshop on Program Comprehension
Using Automatic Clustering to Produce High-Level System Organizations of Source Code

IWPC '98 Proceedings of the 6th International Workshop on Program Comprehension
A heuristic search approach to solving the software clustering problem

A heuristic search approach to solving the software clustering problem

Spectral and meta-heuristic algorithms for software clustering

Journal of Systems and Software - Special issue: Software reverse engineering
On the Automatic Modularization of Software Systems Using the Bunch Tool

IEEE Transactions on Software Engineering
Improving the build architecture of legacy c/c++ software systems

FASE'05 Proceedings of the 8th international conference, held as part of the joint European Conference on Theory and Practice of Software conference on Fundamental Approaches to Software Engineering
Survey: A survey on search-based software design

Computer Science Review
Search-based software engineering: Trends, techniques and applications

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.01

Visualization

Abstract

Software clustering techniques are useful for extracting architectural information about a system directly from its source code structure. This paper starts by examining the Bunch clustering system, which uses metaheuristic search techniques to perform clustering. Bunch produces a subsystem decomposition by partitioning a graph formed from the entities (e.g., modules) and relations (e.g., function calls) in the source code, and then uses a fitness function to evaluate the quality of the graph partition. Finding the best graph partition has been shown to be a NP-hard problem, thus Bunch attempts to find a sub-optimal result that is "good enough" using search algorithms. Since the validation of software clustering results often is overlooked, we propose an evaluation technique based on the search landscape of the graph being clustered. By gaining insight into the search landscape, we can determine the quality of a typical clustering result. This paper defines how the search landscape is modeled and how it can be used for evaluation. A case study that examines a number of open source systems is presented.