Ten lectures on wavelets
On the self-similar nature of Ethernet traffic (extended version)
IEEE/ACM Transactions on Networking (TON)
Fast, approximate synthesis of fractional Gaussian noise for generating self-similar network traffic
ACM SIGCOMM Computer Communication Review
Data networks as cascades: investigating the multifractal nature of Internet WAN traffic
Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
The impact of job arrival patterns on parallel scheduling
ACM SIGMETRICS Performance Evaluation Review
Workload Modeling for Performance Evaluation
Performance Evaluation of Complex Systems: Techniques and Tools, Performance 2002, Tutorial Lectures
The workload on parallel supercomputers: modeling the characteristics of rigid jobs
Journal of Parallel and Distributed Computing
Fractal-Based Point Processes
A comprehensive model of the supercomputer workload
WWC '01 Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop
Introduction to Probability Models, Ninth Edition
Introduction to Probability Models, Ninth Edition
Analysis and modeling of job arrivals in a production grid
ACM SIGMETRICS Performance Evaluation Review
How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Parallel computer workload modeling with markov chains
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Workload characteristics of a multi-cluster supercomputer
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Workload analysis of a cluster in a grid environment
JSSPP'05 Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing
Wavelet analysis of long-range-dependent traffic
IEEE Transactions on Information Theory
A wavelet-based joint estimator of the parameters of long-range dependence
IEEE Transactions on Information Theory
A multifractal wavelet model with application to network traffic
IEEE Transactions on Information Theory
On non-scale-invariant infinitely divisible cascades
IEEE Transactions on Information Theory
A model to predict the optimal performance of the Hierarchical Data Grid
Future Generation Computer Systems
A Realistic Integrated Model of Parallel System Workloads
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Identification, Modelling and Prediction of Non-periodic Bursts in Workloads
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Analysis and evaluation of grid scheduling algorithms using real workload traces
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Cloud resource usage: extreme distributions invalidating traditional capacity planning models
Proceedings of the 2nd international workshop on Scientific cloud computing
Towards a profound analysis of bags-of-tasks in parallel systems and their performance impact
Proceedings of the 20th international symposium on High performance distributed computing
A similarity measure for time, frequency, and dependencies in large-scale workloads
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Cloud Resource Usage--Heavy Tailed Distributions Invalidating Traditional Capacity Planning Models
Journal of Grid Computing
Concurrency and Computation: Practice & Experience
Hi-index | 0.00 |
This paper presents a comprehensive statistical analysis of a variety of workloads collected on production clusters and Grids. The applications are mostly computational-intensive and each task requires single CPU for processing data, which dominate the workloads on current production Grid systems. Trace data obtained on a parallel supercomputer is also included for comparison studies. The statistical properties of workloads are investigated at different levels, including the Virtual Organization (VO) and user behavior. The aggregation procedure and scaling analysis are applied to job arrivals, leading to the identifications of several basic patterns, namely pseudo-periodicity, long range dependence (LRD), and multifractals. It is shown that statistical measures based on interarrivals are of limited usefulness and count based measures should be trusted when it comes to correlations. Other job characteristics like run time and memory consumption are also studied. A "bag-of-tasks" behavior is empirically evidenced, strongly indicating temporal locality. The nature of such dynamics in the Grid workloads is discussed. This study has important implications on workload modeling and performance predictions, and points out the need of comprehensive performance evaluation studies given the workload characteristics.