The Grid Workloads Archive

Authors:
Alexandru Iosup;Hui Li;Mathieu Jan;Shanny Anoep;Catalin Dumitrescu;Lex Wolters;Dick H. J. Epema
Affiliations:
Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, The Netherlands;LIACS, University of Leiden, The Netherlands;Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, The Netherlands;Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, The Netherlands;Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, The Netherlands;LIACS, University of Leiden, The Netherlands;Faculty of Electrical Engineering, Mathematics, and Computer Science, Delft University of Technology, The Netherlands
Venue:
Future Generation Computer Systems
Year:
2008

Citing 37
Cited 50

Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling

IEEE Transactions on Parallel and Distributed Systems
The distributed ASCI Supercomputer project

ACM SIGOPS Operating Systems Review
Job Characteristics of a Production Parallel Scientivic Workload on the NASA Ames iPSC/860

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Workload Evolution on the Cornell Theory Center IBM SP2

IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Memory Usage in the LANL CM-5 Workload

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Scheduling Distributed Applications: the SimGrid Simulation Framework

CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
A Comparison of Workload Traces from Two Production Parallel Machines

FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Resource Co-Allocation in Computational Grids

HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Scheduling with Advanced Reservations

IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Utilization and Predictability in Scheduling the IBM SP2 with Backfilling

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
The workload on parallel supercomputers: modeling the characteristics of rigid jobs

Journal of Parallel and Distributed Computing
The Grid2003 Production Grid: Principles and Practice

HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Distributed computing in practice: the Condor experience: Research Articles

Concurrency and Computation: Practice & Experience - Grid Performance
Predicting job start times on clusters

CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
CRAWDAD: a community resource for archiving wireless data at Dartmouth

ACM SIGCOMM Computer Communication Review
GRENCHMARK: A Framework for Analyzing, Testing, and Comparing Grids

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
A batch scheduler with high level components

CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
GangSim: a simulator for grid scheduling studies

CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
Provisioning and Scheduling Resources for World-Wide Data-Sharing Services

E-SCIENCE '06 Proceedings of the Second IEEE International Conference on e-Science and Grid Computing
Job Failure Analysis and Its Implications in a Large-Scale Production Grid

E-SCIENCE '06 Proceedings of the Second IEEE International Conference on e-Science and Grid Computing
Advanced resource connector middleware for lightweight computational Grids

Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
Characterizing resource availability in enterprise desktop grids

Future Generation Computer Systems
Analysis and Synthesis of Pseudo-Periodic Job Arrivals in Grids: A Matching Pursuit Approach

CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Build-and-Test Workloads for Grid Middleware: Problem, Analysis, and Applications

CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Traffic data repository at the WIDE project

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed

International Journal of High Performance Computing Applications
Mining performance data for metascheduling decision support in the grid

Future Generation Computer Systems - Special section: Data mining in grid computing environments
Workload charaterization and Selection in Computer Performance Measurement

Computer
Inter-operating grids through delegated matchmaking

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Long range dependent job arrival process and its implications in grid environments

Proceedings of the first international conference on Networks for grid applications
On the dynamic resource availability in grids

GRID '07 Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications

GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Measuring the Performance and Reliability of Production Computational Grids

GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Modeling job arrivals in a data-intensive grid

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
On grid performance evaluation using synthetic workloads

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
The design and implementation of the KOALA co-allocating grid scheduler

EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
The characteristics and performance of groups of jobs in grids

Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing

The performance of bags-of-tasks in large-scale distributed systems

HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Inter-operating grids through Delegated MatchMaking

Scientific Programming - Large-Scale Programming Tools and Environments
A Simulation Framework for Studying Economic Resource Management in Grids

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Trace-based evaluation of job runtime and queue wait time predictions in grids

Proceedings of the 18th ACM international symposium on High performance distributed computing
An experimental system for grid meta-broker evaluation

Proceedings of the 1st ACM workshop on Large-Scale system and application performance
A unified format for traces of peer-to-peer systems

Proceedings of the 1st ACM workshop on Large-Scale system and application performance
The grid observatory

GMAC '09 Proceedings of the 6th international conference industry session on Grids meets autonomic computing
A performance study of grid workflow engines

GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A model to predict the optimal performance of the Hierarchical Data Grid

Future Generation Computer Systems
Grid broker selection strategies using aggregated resource information

Future Generation Computer Systems
Modeling the latency on production grids with respect to the execution context

Parallel Computing
GMBS: A new middleware service for making grids interoperable

Future Generation Computer Systems
Adaptive grid resource selection based on job history analysis using Plackett-Burman designs

APNOMS'09 Proceedings of the 12th Asia-Pacific network operations and management conference on Management enabling the future internet for changing business and new computing services
Performance analysis of available bandwidth estimation tools for grid networks

The Journal of Supercomputing
The Failure Trace Archive: Enabling Comparative Analysis of Failures in Diverse Distributed Systems

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Discovering Piecewise Linear Models of Grid Workload

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Identification, Modelling and Prediction of Non-periodic Bursts in Workloads

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Predicting the Quality of Service of a Peer-to-Peer Desktop Grid

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Performance analysis of dynamic workflow scheduling in multicluster grids

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
High occupancy resource allocation for grid and cloud systems, a study with DRIVE

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Multicriteria, multi-user scheduling in grids with advance reservation

Journal of Scheduling
The importance of complete data sets for job scheduling simulations

JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
CASP: a community-aware scheduling protocol

International Journal of Grid and Utility Computing
Cloud resource usage: extreme distributions invalidating traditional capacity planning models

Proceedings of the 2nd international workshop on Scientific cloud computing
Using a Simple Prioritisation Mechanism to Effectively Interoperate Service and Opportunistic Grids in the EELA-2 e-Infrastructure

Journal of Grid Computing
GroudSim: an event-based simulation framework for computational grids and clouds

Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Integration of an event-based simulation framework into a scientific workflow execution environment for Grids and clouds

ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
A similarity measure for time, frequency, and dependencies in large-scale workloads

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
A grid broker pricing mechanism for temporal and budget guarantees

EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
Cloud Resource Usage--Heavy Tailed Distributions Invalidating Traditional Capacity Planning Models

Journal of Grid Computing
PonD: dynamic creation of HTC pool on demand using a decentralized resource discovery system

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
SpeQuloS: a QoS service for BoT applications using best effort distributed computing infrastructures

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
The XtreemOS Resource Selection Service

ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special Section: Extended Version of SASO 2011 Best Paper
Decentralized scalable fairshare scheduling

Future Generation Computer Systems
ATLAS grid workload on NDGF resources: analysis, modeling, and workload generation

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Achieving high job execution reliability using underutilized resources in a computational economy

Future Generation Computer Systems
Enhancing performance of failure-prone clusters by adaptive provisioning of cloud resources

The Journal of Supercomputing
Characterizing spot price dynamics in public cloud environments

Future Generation Computer Systems
Double auction-inspired meta-scheduling of parallel applications on global grids

Journal of Parallel and Distributed Computing
A science-gateway workload archive to study pilot jobs, user activity, bag of tasks, task sub-steps, and workflow executions

Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
State-based predictions with self-correction on Enterprise Desktop Grid environments

Journal of Parallel and Distributed Computing
The Failure Trace Archive: Enabling the comparison of failure measurements and models of distributed systems

Journal of Parallel and Distributed Computing
The game trace archive

Proceedings of the 11th Annual Workshop on Network and Systems Support for Games
Hierarchical scheduling strategies for parallel tasks and advance reservations in grids

Journal of Scheduling
Deconstructing Amazon EC2 Spot Instance Pricing

ACM Transactions on Economics and Computation
Scheduling jobs in the cloud using on-demand and reserved instances

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Toward fine-grained online task characteristics estimation in scientific workflows

WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Introduction

Benchmarking Peer-to-Peer Systems
SpeQuloS: a QoS service for hybrid and elastic computing infrastructures

Cluster Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

While large grids are currently supporting the work of thousands of scientists, very little is known about their actual use. Due to strict organizational permissions, there are few or no traces of grid workloads available to the grid researcher and practitioner. To address this problem, in this work we present the Grid Workloads Archive (GWA), which is at the same time a workload data exchange and a meeting point for the grid community. We define the requirements for building a workload archive, and describe the approach taken to meet these requirements with the GWA. We introduce a format for sharing grid workload information, and tools associated with this format. Using these tools, we collect and analyze data from nine well-known grid environments, with a total content of more than 2000 users submitting more than 7 million jobs over a period of over 13 operational years, and with working environments spanning over 130 sites comprising 10000 resources. We show evidence that grid workloads are very different from those encountered in other large-scale environments, and in particular from the workloads of parallel production environments: they comprise almost exclusively single-node jobs, and jobs arrive in ''bags-of-tasks''. Finally, we present the immediate applications of the GWA and of its content in several critical grid research and practical areas: research in grid resource management, and grid design, operation, and maintenance.