Measuring Code Quality to Improve Specification Mining

Authors:
Claire Le Goues;Westley Weimer
Affiliations:
University of Virginia, Charlottesville;University of Virginia, Charlottesville
Venue:
IEEE Transactions on Software Engineering
Year:
2012

Citing 0
Cited 2

Using automated program repair for evaluating the effectiveness of fault localization techniques

Proceedings of the 2013 International Symposium on Software Testing and Analysis
Current challenges in automatic software repair

Software Quality Control

Quantified Score

Hi-index	0.00

Visualization

Abstract

Formal specifications can help with program testing, optimization, refactoring, documentation, and, most importantly, debugging and repair. However, they are difficult to write manually, and automatic mining techniques suffer from 90-99 percent false positive rates. To address this problem, we propose to augment a temporal-property miner by incorporating code quality metrics. We measure code quality by extracting additional information from the software engineering process and using information from code that is more likely to be correct, as well as code that is less likely to be correct. When used as a preprocessing step for an existing specification miner, our technique identifies which input is most indicative of correct program behavior, which allows off-the-shelf techniques to learn the same number of specifications using only 45 percent of their original input. As a novel inference technique, our approach has few false positives in practice (63 percent when balancing precision and recall, 3 percent when focused on precision), while still finding useful specifications (e.g., those that find many bugs) on over 1.5 million lines of code.