Search-based duplicate defect detection: an industrial experience

Authors:
Mehdi Amoui;Nilam Kaushik;Abraham Al-Dabbagh;Ladan Tahvildari;Shimin Li;Weining Liu
Affiliations:
University of Waterloo, Canada;University of Waterloo, Canada;University of Waterloo, Canada;University of Waterloo, Canada;BlackBerry, Canada;BlackBerry, Canada
Venue:
Proceedings of the 10th Working Conference on Mining Software Repositories
Year:
2013

Citing 15
Cited 0

A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Detection of Duplicate Defect Reports Using Natural Language Processing

ICSE '07 Proceedings of the 29th international conference on Software Engineering
An approach to detecting duplicate bug reports using natural language and execution information

Proceedings of the 30th international conference on Software engineering
Introduction to Information Retrieval

Introduction to Information Retrieval
Towards the next generation of bug tracking systems

VLHCC '08 Proceedings of the 2008 IEEE Symposium on Visual Languages and Human-Centric Computing
DebugAdvisor: a recommender system for debugging

Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
A discriminative model approach for accurate duplicate bug report retrieval

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
The relationship between search based software engineering and predictive modeling

Proceedings of the 6th International Conference on Predictive Models in Software Engineering
An Initial Study on the Bug Report Duplication Problem

CSMR '10 Proceedings of the 2010 14th European Conference on Software Maintenance and Reengineering
Search based software engineering: techniques, taxonomy, tutorial

Empirical Software Engineering and Verification
Towards more accurate retrieval of duplicate bug reports

ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
A Comparative Study of the Performance of IR Models on Duplicate Bug Detection

CSMR '12 Proceedings of the 2012 16th European Conference on Software Maintenance and Reengineering
Improved Duplicate Bug Report Identification

CSMR '12 Proceedings of the 2012 16th European Conference on Software Maintenance and Reengineering
Evaluating prediction systems in software project estimation

Information and Software Technology
Duplicate bug report detection with a combination of information retrieval and topic modeling

Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Duplicate defects put extra overheads on software organizations, as the cost and effort of managing duplicate defects are mainly redundant. Due to the use of natural language and various ways to describe a defect, it is usually hard to investigate duplicate defects automatically. This problem is more severe in large software organizations with huge defect repositories and massive number of defect reporters. Ideally, an efficient tool should prevent duplicate reports from reaching developers by automatically detecting and/or filtering duplicates. It also should be able to offer defect triagers a list of top-N similar bug reports and allow them to compare the similarity of incoming bug reports with the suggested duplicates. This demand has motivated us to design and develop a search-based duplicate bug detection framework at BlackBerry. The approach follows a generalized process model to evaluate and tune the performance of the system in a systematic way. We have applied the framework on software projects at BlackBerry, in addition to the Mozilla defect repository. The experimental results exhibit the performance of the developed framework and highlight the high impact of parameter tuning on its performance.