Learning to classify bug reports into components

  • Authors:
  • Ashish Sureka

  • Affiliations:
  • Indraprastha Institute of Information Technology (IIIT-D), New Delhi, India

  • Venue:
  • TOOLS'12 Proceedings of the 50th international conference on Objects, Models, Components, Patterns
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bug reports in widely used defect tracking systems contains standard and mandatory fields like product name, component name, version number and operating system. Such fields provide important information required by developers during bug fixing. Previous research shows that bug reporters often assign incorrect values for such fields which cause problems and delays in bug fixing. We conduct an empirical study on the issue of incorrect component assignments or component reassignments in bug reports. We perform a case study on open-source Eclipse and Mozilla projects and report results on various aspects such as the percentage of reassignments, distribution across number of assignments until closure of a bug and time difference between creation and reassignment event. We perform a series of experiments using a machine learning framework for two prediction tasks: categorizing a given bug report into a pre-defined list of components and predicting whether a given bug report will be reassigned. Experimental results demonstrate correlation between terms present in bug reports (textual documents) and components which can be used as linguistic indicators for the task of component prediction. We study component reassignment graphs and reassignment probabilities and investigate their usefulness for the task of component reassignment prediction.