How to measure a large open-source distributed system: Research Articles

Authors:
Douglas Thain;Todd Tannenbaum;Miron Livny
Affiliations:
Computer Science and Engineering Department, University of Notre Dame, 384 Fitzpatrick Hall, Notre Dame, IN 46556, U.S.A.;Computer Sciences Department, University of Wisconsin–Madison, 1210 West Dayton Street, Madison, WI 53706, U.S.A.;Computer Sciences Department, University of Wisconsin–Madison, 1210 West Dayton Street, Madison, WI 53706, U.S.A.
Venue:
Concurrency and Computation: Practice & Experience
Year:
2006

Citing 0
Cited 6

A performance study of grid workflow engines

GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
OddCI: on-demand distributed computing infrastructure

Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers
Kestrel: an XMPP-based framework for many task computing applications

Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers
Scaling virtual organization clusters over a wide area network using the Kestrel workload management system

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Incentives and integration in scientific software production

Proceedings of the 2013 conference on Computer supported cooperative work
Using Kestrel and XMPP to Support the STAR Experiment in the Cloud

Journal of Grid Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

How can we measure the impact of an open-source software package over time? When a system has no price, no purchase contracts and no buyers or sellers it can be difficult to judge its impact on the world. To explore this issue, we have instrumented the Condor distributed batch system in a variety of ways and observed its growth to over 50 000 CPUs at over 1000 sites over five years. Instrumentation methods include automatic updates by e-mail and user datagram protocol (UDP), annotated download records and a voluntary user survey. Each of these metrics has various strengths and weaknesses that we are able to compare and contrast. We also explore the ethical and legal issues surrounding automatic data collection. Surprisingly, we discover that objections to automatic data collection are higher among people that are not using the Condor software. We conclude with some practical advice for further research into the measurement of software systems. Copyright © 2006 John Wiley & Sons, Ltd.