A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
Two case studies of open source software development: Apache and Mozilla
ACM Transactions on Software Engineering and Methodology (TOSEM)
Empirical Software Engineering
Guest Editor's Introduction: Special Issue on Mining Software Repositories
IEEE Transactions on Software Engineering
Developer identification methods for integrated data from various sources
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Proceedings of the 2006 international workshop on Mining software repositories
Evolution of the core team of developers in libre software projects
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Maispion: a tool for analysing and visualising open source software developer communities
IWST '09 Proceedings of the International Workshop on Smalltalk Technologies
A framework for analysing and visualising open source software ecosystems
Proceedings of the Joint ERCIM Workshop on Software Evolution (EVOL) and International Workshop on Principles of Software Evolution (IWPSE)
Process Mining Software Repositories
CSMR '11 Proceedings of the 2011 15th European Conference on Software Maintenance and Reengineering
A historical dataset for the gnome ecosystem
Proceedings of the 10th Working Conference on Mining Software Repositories
A historical dataset of software engineering conferences
Proceedings of the 10th Working Conference on Mining Software Repositories
How social Q&A sites are changing knowledge sharing in open source software communities
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Hi-index | 0.00 |
Software repository mining research extracts and analyses data originating from multiple software repositories to understand the historical development of software systems, and to propose better ways to evolve such systems in the future. Of particular interest is the study of the activities and interactions between the persons involved in the software development process. The main challenge with such studies lies in the ability to determine the identities (e.g., logins or e-mail accounts) in software repositories that represent the same physical person. To achieve this, different identity merge algorithms have been proposed in the past. This article provides an objective comparison of identity merge algorithms, including some improvements over existing algorithms. The results are validated on a selection of large ongoing open source software projects.