Preliminary guidelines for empirical research in software engineering
IEEE Transactions on Software Engineering
Understanding source code evolution using abstract syntax tree matching
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Mining student CVS repositories for performance indicators
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Developer identification methods for integrated data from various sources
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
On the success of empirical studies in the international conference on software engineering
Proceedings of the 28th international conference on Software engineering
The evolution radar: visualizing integrated logical coupling information
Proceedings of the 2006 international workshop on Mining software repositories
Mining sequences of changed-files from version histories
Proceedings of the 2006 international workshop on Mining software repositories
Program element matching for multi-version program analyses
Proceedings of the 2006 international workshop on Mining software repositories
Fine grained indexing of software repositories to support impact analysis
Proceedings of the 2006 international workshop on Mining software repositories
Proceedings of the 2006 international workshop on Mining software repositories
Geographic location of developers at SourceForge
Proceedings of the 2006 international workshop on Mining software repositories
Open Borders? Immigration in Open Source Projects
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
How Long Will It Take to Fix This Bug?
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Determining Implementation Expertise from Bug Reports
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Defect Data Analysis Based on Extended Association Rule Mining
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Correlating Social Interactions to Release History during Software Evolution
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Visual Data Mining in Software Archives to Detect How Developers Work Together
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Mining a Change-Based Software Repository
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
Journal of Software Maintenance and Evolution: Research and Practice
Improving change descriptions with change contexts
Proceedings of the 2008 international working conference on Mining software repositories
Proceedings of the 2008 international working conference on Mining software repositories
What do large commits tell us?: a taxonomical study of large commits
Proceedings of the 2008 international working conference on Mining software repositories
The promises and perils of mining git
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Tracking concept drift of software projects using defect prediction quality
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
From work to word: How do software developers describe their work?
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Assigning bug reports using a vocabulary-based expertise model of developers
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Mining the history of synchronous changes to refine code ownership
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
IEEE Transactions on Software Engineering
A study of language usage evolution in open source software
Proceedings of the 8th Working Conference on Mining Software Repositories
An exploratory study of identifier renamings
Proceedings of the 8th Working Conference on Mining Software Repositories
Proceedings of the 8th Working Conference on Mining Software Repositories
Comparing fine-grained source code changes and code churn for bug prediction
Proceedings of the 8th Working Conference on Mining Software Repositories
Security versus performance bugs: a case study on Firefox
Proceedings of the 8th Working Conference on Mining Software Repositories
Entering the circle of trust: developer initiation as committers in open-source projects
Proceedings of the 8th Working Conference on Mining Software Repositories
Social interactions around cross-system bug fixings: the case of FreeBSD and OpenBSD
Proceedings of the 8th Working Conference on Mining Software Repositories
Do time of day and developer experience affect commit bugginess?
Proceedings of the 8th Working Conference on Mining Software Repositories
Automated topic naming to support cross-project analysis of software maintenance activities
Proceedings of the 8th Working Conference on Mining Software Repositories
Modeling the evolution of topics in source code histories
Proceedings of the 8th Working Conference on Mining Software Repositories
Software bertillonage: finding the provenance of an entity
Proceedings of the 8th Working Conference on Mining Software Repositories
Supporting software history exploration
Proceedings of the 8th Working Conference on Mining Software Repositories
Hi-index | 0.00 |
The Mining Software Repositories (MSR) research community has grown significantly since the first MSR workshop was held in 2004. As the community continues to broaden its scope and deepens its expertise, it is worthwhile to reflect on the best practices that our community has developed over the past decade of research. We identify these best practices by surveying past MSR conferences and workshops. To that end, we review all 117 full papers published in the MSR proceedings between 2004 and 2012. We extract 268 comments from these papers, and categorize them using a grounded theory methodology. From this evaluation, four high-level themes were identified: data acquisition and preparation, synthesis, analysis, and sharing/replication. Within each theme we identify several common recommendations, and also examine how these recommendations have evolved over the past decade. In an effort to make this survey a living artifact, we also provide a public forum that contains the extracted recommendations in the hopes that the MSR community can engage in a continuing discussion on our evolving best practices.