On the Equivalence of Information Retrieval Methods for Automated Traceability Link Recovery

  • Authors:
  • Rocco Oliveto;Malcom Gethers;Denys Poshyvanyk;Andrea De Lucia

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICPC '10 Proceedings of the 2010 IEEE 18th International Conference on Program Comprehension
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an empirical study to statistically analyze the equivalence of several traceability recovery methods based on Information Retrieval (IR) techniques. The analysis is based on Principal Component Analysis and on the analysis of the overlap of the set of candidate links provided by each method. The studied techniques are the Jensen-Shannon (JS) method, Vector Space Model (VSM), Latent Semantic Indexing (LSI), and Latent Dirichlet Allocation (LDA). The results show that while JS, VSM, and LSI are almost equivalent, LDA is able to capture a dimension unique to the set of techniques which we considered.