Software vulnerability prediction using text analysis techniques

  • Authors:
  • Aram Hovsepyan;Riccardo Scandariato;Wouter Joosen;James Walden

  • Affiliations:
  • Katholieke Universiteit Leuven, Leuven, Belgium;Katholieke Universiteit Leuven, Leuven, Belgium;Katholieke Universiteit Leuven, Leuven, Belgium;Northern Kentucky University, Highland Heights, KY, USA

  • Venue:
  • Proceedings of the 4th international workshop on Security measurements and metrics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Early identification of software vulnerabilities is essential in software engineering and can help reduce not only costs, but also prevent loss of reputation and damaging litigations for a software firm. Techniques and tools for software vulnerability prediction are thus invaluable. Most of the existing techniques rely on using component characteristic(s) (like code complexity, code churn) for the vulnerability prediction. In this position paper, we present a novel approach for vulnerability prediction that leverages on the analysis of raw source code as text, instead of using "cooked" features. Our initial results seem to be very promising as the prediction model achieves an average accuracy of 0.87, precision of 0.85 and recall of 0.88 on 18 versions of a large mobile application.