A comprehensive characterization of NLP techniques for identifying equivalent requirements

  • Authors:
  • Davide Falessi;Giovanni Cantone;Gerardo Canfora

  • Affiliations:
  • Simula Research Laboratory (Norway) and University of Rome TorVergata, Rome, Italy;University of Rome TorVergata, Rome, Italy;RCOST -- Research Centre on Software Technology and University of Sannio, Benevento, Italy

  • Venue:
  • Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Though very important in software engineering, linking artifacts of the same type (clone detection) or of different types (traceability recovery) is extremely tedious, error-prone and requires significant effort. Past research focused on supporting analysts with mechanisms based on Natural Language Processing (NLP) to identify candidate links. Because a plethora of NLP techniques exists, and their performances vary among contexts, it is important to characterize them according to the provided level of support. The aim of this paper is to characterize a comprehensive set of NLP techniques according to the provided level of support to human analysts in detecting equivalent requirements. The characterization consists on a case study, featuring real requirements, in the context of an Italian company in the defense and aerospace domain. The major result from the case study is that simple NLP are more precise than complex ones.