IEEE Transactions on Parallel and Distributed Systems
The distributed ASCI Supercomputer project
ACM SIGOPS Operating Systems Review
Job Characteristics of a Production Parallel Scientivic Workload on the NASA Ames iPSC/860
IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Workload Evolution on the Cornell Theory Center IBM SP2
IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Memory Usage in the LANL CM-5 Workload
IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Scheduling Distributed Applications: the SimGrid Simulation Framework
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
A Comparison of Workload Traces from Two Production Parallel Machines
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Resource Co-Allocation in Computational Grids
HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Scheduling with Advanced Reservations
IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Utilization and Predictability in Scheduling the IBM SP2 with Backfilling
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
The workload on parallel supercomputers: modeling the characteristics of rigid jobs
Journal of Parallel and Distributed Computing
The Grid2003 Production Grid: Principles and Practice
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Distributed computing in practice: the Condor experience: Research Articles
Concurrency and Computation: Practice & Experience - Grid Performance
Predicting job start times on clusters
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
CRAWDAD: a community resource for archiving wireless data at Dartmouth
ACM SIGCOMM Computer Communication Review
GRENCHMARK: A Framework for Analyzing, Testing, and Comparing Grids
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
A batch scheduler with high level components
CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
GangSim: a simulator for grid scheduling studies
CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
Provisioning and Scheduling Resources for World-Wide Data-Sharing Services
E-SCIENCE '06 Proceedings of the Second IEEE International Conference on e-Science and Grid Computing
Job Failure Analysis and Its Implications in a Large-Scale Production Grid
E-SCIENCE '06 Proceedings of the Second IEEE International Conference on e-Science and Grid Computing
Advanced resource connector middleware for lightweight computational Grids
Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
Characterizing resource availability in enterprise desktop grids
Future Generation Computer Systems
Analysis and Synthesis of Pseudo-Periodic Job Arrivals in Grids: A Matching Pursuit Approach
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Build-and-Test Workloads for Grid Middleware: Problem, Analysis, and Applications
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
Traffic data repository at the WIDE project
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed
International Journal of High Performance Computing Applications
Mining performance data for metascheduling decision support in the grid
Future Generation Computer Systems - Special section: Data mining in grid computing environments
Inter-operating grids through delegated matchmaking
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Long range dependent job arrival process and its implications in grid environments
Proceedings of the first international conference on Networks for grid applications
On the dynamic resource availability in grids
GRID '07 Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Measuring the Performance and Reliability of Production Computational Grids
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Modeling job arrivals in a data-intensive grid
JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
On grid performance evaluation using synthetic workloads
JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
The design and implementation of the KOALA co-allocating grid scheduler
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
The characteristics and performance of groups of jobs in grids
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
The performance of bags-of-tasks in large-scale distributed systems
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Inter-operating grids through Delegated MatchMaking
Scientific Programming - Large-Scale Programming Tools and Environments
A Simulation Framework for Studying Economic Resource Management in Grids
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Trace-based evaluation of job runtime and queue wait time predictions in grids
Proceedings of the 18th ACM international symposium on High performance distributed computing
An experimental system for grid meta-broker evaluation
Proceedings of the 1st ACM workshop on Large-Scale system and application performance
A unified format for traces of peer-to-peer systems
Proceedings of the 1st ACM workshop on Large-Scale system and application performance
GMAC '09 Proceedings of the 6th international conference industry session on Grids meets autonomic computing
A performance study of grid workflow engines
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A model to predict the optimal performance of the Hierarchical Data Grid
Future Generation Computer Systems
Grid broker selection strategies using aggregated resource information
Future Generation Computer Systems
GMBS: A new middleware service for making grids interoperable
Future Generation Computer Systems
Adaptive grid resource selection based on job history analysis using Plackett-Burman designs
APNOMS'09 Proceedings of the 12th Asia-Pacific network operations and management conference on Management enabling the future internet for changing business and new computing services
Performance analysis of available bandwidth estimation tools for grid networks
The Journal of Supercomputing
The Failure Trace Archive: Enabling Comparative Analysis of Failures in Diverse Distributed Systems
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Discovering Piecewise Linear Models of Grid Workload
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Identification, Modelling and Prediction of Non-periodic Bursts in Workloads
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Predicting the Quality of Service of a Peer-to-Peer Desktop Grid
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Performance analysis of dynamic workflow scheduling in multicluster grids
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
High occupancy resource allocation for grid and cloud systems, a study with DRIVE
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Multicriteria, multi-user scheduling in grids with advance reservation
Journal of Scheduling
The importance of complete data sets for job scheduling simulations
JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
CASP: a community-aware scheduling protocol
International Journal of Grid and Utility Computing
Cloud resource usage: extreme distributions invalidating traditional capacity planning models
Proceedings of the 2nd international workshop on Scientific cloud computing
GroudSim: an event-based simulation framework for computational grids and clouds
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
A similarity measure for time, frequency, and dependencies in large-scale workloads
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
A grid broker pricing mechanism for temporal and budget guarantees
EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
Cloud Resource Usage--Heavy Tailed Distributions Invalidating Traditional Capacity Planning Models
Journal of Grid Computing
PonD: dynamic creation of HTC pool on demand using a decentralized resource discovery system
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
SpeQuloS: a QoS service for BoT applications using best effort distributed computing infrastructures
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
The XtreemOS Resource Selection Service
ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special Section: Extended Version of SASO 2011 Best Paper
Decentralized scalable fairshare scheduling
Future Generation Computer Systems
ATLAS grid workload on NDGF resources: analysis, modeling, and workload generation
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Achieving high job execution reliability using underutilized resources in a computational economy
Future Generation Computer Systems
Enhancing performance of failure-prone clusters by adaptive provisioning of cloud resources
The Journal of Supercomputing
Characterizing spot price dynamics in public cloud environments
Future Generation Computer Systems
Double auction-inspired meta-scheduling of parallel applications on global grids
Journal of Parallel and Distributed Computing
Euro-Par'12 Proceedings of the 18th international conference on Parallel processing workshops
State-based predictions with self-correction on Enterprise Desktop Grid environments
Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing
Proceedings of the 11th Annual Workshop on Network and Systems Support for Games
Hierarchical scheduling strategies for parallel tasks and advance reservations in grids
Journal of Scheduling
Deconstructing Amazon EC2 Spot Instance Pricing
ACM Transactions on Economics and Computation
Scheduling jobs in the cloud using on-demand and reserved instances
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Toward fine-grained online task characteristics estimation in scientific workflows
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Benchmarking Peer-to-Peer Systems
SpeQuloS: a QoS service for hybrid and elastic computing infrastructures
Cluster Computing
Hi-index | 0.00 |
While large grids are currently supporting the work of thousands of scientists, very little is known about their actual use. Due to strict organizational permissions, there are few or no traces of grid workloads available to the grid researcher and practitioner. To address this problem, in this work we present the Grid Workloads Archive (GWA), which is at the same time a workload data exchange and a meeting point for the grid community. We define the requirements for building a workload archive, and describe the approach taken to meet these requirements with the GWA. We introduce a format for sharing grid workload information, and tools associated with this format. Using these tools, we collect and analyze data from nine well-known grid environments, with a total content of more than 2000 users submitting more than 7 million jobs over a period of over 13 operational years, and with working environments spanning over 130 sites comprising 10000 resources. We show evidence that grid workloads are very different from those encountered in other large-scale environments, and in particular from the workloads of parallel production environments: they comprise almost exclusively single-node jobs, and jobs arrive in ''bags-of-tasks''. Finally, we present the immediate applications of the GWA and of its content in several critical grid research and practical areas: research in grid resource management, and grid design, operation, and maintenance.