Refining the systematic literature review process--two participant-observer case studies

  • Authors:
  • Barbara A. Kitchenham;Pearl Brereton;Mark Turner;Mahmood K. Niazi;Stephen Linkman;Rialette Pretorius;David Budgen

  • Affiliations:
  • School of Computing and Mathematics, Keele University, Stoke-on-Trent, UK ST5 5BG;School of Computing and Mathematics, Keele University, Stoke-on-Trent, UK ST5 5BG;School of Computing and Mathematics, Keele University, Stoke-on-Trent, UK ST5 5BG;School of Computing and Mathematics, Keele University, Stoke-on-Trent, UK ST5 5BG;School of Computing and Mathematics, Keele University, Stoke-on-Trent, UK ST5 5BG;School of Engineering and Computing Sciences, Durham University, Durham, UK DH1 3LE;School of Engineering and Computing Sciences, Durham University, Durham, UK DH1 3LE

  • Venue:
  • Empirical Software Engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Systematic literature reviews (SLRs) are a major tool for supporting evidence-based software engineering. Adapting the procedures involved in such a review to meet the needs of software engineering and its literature remains an ongoing process. As part of this process of refinement, we undertook two case studies which aimed 1) to compare the use of targeted manual searches with broad automated searches and 2) to compare different methods of reaching a consensus on quality. For Case 1, we compared a tertiary study of systematic literature reviews published between January 1, 2004 and June 30, 2007 which used a manual search of selected journals and conferences and a replication of that study based on a broad automated search. We found that broad automated searches find more studies than manual restricted searches, but they may be of poor quality. Researchers undertaking SLRs may be justified in using targeted manual searches if they intend to omit low quality papers, or they are assessing research trends in research methodologies. For Case 2, we analyzed the process used to evaluate the quality of SLRs. We conclude that if quality evaluation of primary studies is a critical component of a specific SLR, assessments should be based on three independent evaluators incorporating at least two rounds of discussion.