A translation approach to portable ontology specifications
Knowledge Acquisition - Special issue: Current issues in knowledge modeling
Building Knowledge through Families of Experiments
IEEE Transactions on Software Engineering
Object-Oriented Metrics in Practice
Object-Oriented Metrics in Practice
Facilitating software evolution research with kenyon
Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Analysis of signature change patterns
MSR '05 Proceedings of the 2005 international workshop on Mining software repositories
Proceedings of the 2006 international workshop on Mining software repositories
Sourcerer: a search engine for open source code supporting structure-based search
Companion to the 21st ACM SIGPLAN symposium on Object-oriented programming systems, languages, and applications
Predicting Faults from Cached History
ICSE '07 Proceedings of the 29th international conference on Software Engineering
Determining Implementation Expertise from Bug Reports
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
MSR '07 Proceedings of the Fourth International Workshop on Mining Software Repositories
The role of replications in Empirical Software Engineering
Empirical Software Engineering
Branching and merging in the repository
Proceedings of the 2008 international working conference on Mining software repositories
Proceedings of the 2008 international working conference on Mining software repositories
Change Analysis with Evolizer and ChangeDistiller
IEEE Software
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
A platform for software engineering research
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Assigning bug reports using a vocabulary-based expertise model of developers
MSR '09 Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories
Fostering synergies: how semantic web technology could influence software repositories
Proceedings of 2010 ICSE Workshop on Search-driven Development: Users, Infrastructure, Tools and Evaluation
Semantic web enabled software analysis
Web Semantics: Science, Services and Agents on the World Wide Web
An exploratory study of identifier renamings
Proceedings of the 8th Working Conference on Mining Software Repositories
Finding software license violations through binary code clone detection
Proceedings of the 8th Working Conference on Mining Software Repositories
Comparing fine-grained source code changes and code churn for bug prediction
Proceedings of the 8th Working Conference on Mining Software Repositories
Do time of day and developer experience affect commit bugginess?
Proceedings of the 8th Working Conference on Mining Software Repositories
SOFAS: A Lightweight Architecture for Software Analysis as a Service
WICSA '11 Proceedings of the 2011 Ninth Working IEEE/IFIP Conference on Software Architecture
Using the gini coefficient for bug prediction in eclipse
Proceedings of the 12th International Workshop on Principles of Software Evolution and the 7th annual ERCIM Workshop on Software Evolution
Hi-index | 0.00 |
The replication of studies in mining software repositories (MSR) is essential to compare different mining techniques or assess their findings across many projects. However, it has been shown that very few of these studies can be easily replicated. Their replication is just as fundamental as the studies themselves and is one of the main threats to validity that they suffer from. In this paper, we show how we can alleviate this problem with our SOFAS framework. SOFAS is a platform that enables a systematic and repeatable analysis of software projects by providing extensible and composable analysis workflows. These workflows can be applied on a multitude of software projects, facilitating the replication and scaling of mining studies. In this paper, we show how and to which degree replication can be achieved. We investigated the mining studies of MSR from 2004 to 2011 and found that from 88 studies published in the MSR proceedings so far, we can fully replicate 25 empirical studies. Additionally, we can replicate 27 additional studies to a large extent. These studies account for 30% and 32%, respectively, of the mining studies published. To support our claim we describe in detail one large study that we replicated and discuss how replication with SOFAS works for the other studies investigated. To discuss the potential of our platform we also characterise how studies can be easily enriched to deliver even more comprehensive answers by extending the analysis workflows provided by the platform.