Empirical studies in reverse engineering: state of the art and future trends
Empirical Software Engineering
Quality of bug reports in Eclipse
Proceedings of the 2007 OOPSLA workshop on eclipse technology eXchange
An approach to detecting duplicate bug reports using natural language and execution information
Proceedings of the 30th international conference on Software engineering
Improving bug triage with bug tossing graphs
Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Predicting the fix time of bugs
Proceedings of the 2nd International Workshop on Recommendation Systems for Software Engineering
Comparing Mining Algorithms for Predicting the Severity of a Reported Bug
CSMR '11 Proceedings of the 2011 15th European Conference on Software Maintenance and Reengineering
Bug-fix time prediction models: can we do better?
Proceedings of the 8th Working Conference on Mining Software Repositories
Failure is a four-letter word: a parody in empirical research
Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Hi-index | 0.00 |
The analysis of bug reports is an important subfield within the mining software repositories community. It explores the rich data available in defect tracking systems to uncover interesting and actionable information about the bug triaging process. While bug data is readily accessible from systems like Bugzilla and JIRA, a common database schema and a curated dataset could significantly enhance future research because it allows for easier replication. Consequently, in this paper we propose the Eclipse and Mozilla Defect Tracking Dataset, a representative database of bug data, filtered to contain only genuine defects (i.e., no feature requests) and designed to cover the whole bug-triage life cycle (i.e., store all intermediate actions). We have used this dataset ourselves for predicting bug severity, for studying bug-fixing time and for identifying erroneously assigned components.