Theory of linear and integer programming
Theory of linear and integer programming
The Markov-modulated Poisson process (MMPP) cookbook
Performance Evaluation
On the self-similar nature of Ethernet traffic (extended version)
IEEE/ACM Transactions on Networking (TON)
An EM algorithm for estimation in Markov-modulated Poisson processes
Computational Statistics & Data Analysis
The elusive goal of workload characterization
ACM SIGMETRICS Performance Evaluation Review
The impact of job arrival patterns on parallel scheduling
ACM SIGMETRICS Performance Evaluation Review
Workload Modeling for Performance Evaluation
Performance Evaluation of Complex Systems: Techniques and Tools, Performance 2002, Tutorial Lectures
Aggregate matrix-analytic techniques and their applications
Aggregate matrix-analytic techniques and their applications
The workload on parallel supercomputers: modeling the characteristics of rigid jobs
Journal of Parallel and Distributed Computing
Grid resource management: state of the art and future trends
Grid resource management: state of the art and future trends
DI-GRUBER: A Distributed Approach to Grid Resource Brokering
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Nonlinear Time Series Analysis
Nonlinear Time Series Analysis
A comprehensive model of the supercomputer workload
WWC '01 Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop
A Unifying Framework for Detecting Outliers and Change Points from Time Series
IEEE Transactions on Knowledge and Data Engineering
Parallel computer workload modeling with markov chains
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Workload characteristics of a multi-cluster supercomputer
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Workload analysis of a cluster in a grid environment
JSSPP'05 Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing
Future Generation Computer Systems
The performance of bags-of-tasks in large-scale distributed systems
HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
International Journal of Parallel, Emergent and Distributed Systems - Papers from the Workshop on Dependable Parallel and Network-Centric Systems
A performance study of grid workflow engines
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A model to predict the optimal performance of the Hierarchical Data Grid
Future Generation Computer Systems
Energy aware scheduling on desktop grid environment with static performance prediction
SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
Job demand models for optical grid research
ONDM'07 Proceedings of the 11th international IFIP TC6 conference on Optical network design and modeling
Predicting cost amortization for query services
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Cloud resource usage: extreme distributions invalidating traditional capacity planning models
Proceedings of the 2nd international workshop on Scientific cloud computing
Cloud Resource Usage--Heavy Tailed Distributions Invalidating Traditional Capacity Planning Models
Journal of Grid Computing
Job allocation strategies for energy-aware and efficient Grid infrastructures
Journal of Systems and Software
Hi-index | 0.00 |
In this paper we present an initial analysis of job arrivals in a production data-intensive Grid and investigate several traffic models to characterize the interarrival time processes. Our analysis focuses on the heavy-tail behavior and autocorrelation structures, and the modeling is carried out at three different levels: Grid, Virtual Organization (VO), and region. A set of m-state Markov modulated Poisson processes (MMPP) is investigated, while Poisson processes and hyperexponential renewal processes are evaluated for comparison studies. We apply the transportation distance metric from dynamical systems theory to further characterize the differences between the data trace and the simulated time series, and estimate errors by bootstrapping. The experimental results show that MMPPs with a certain number of states are successful to a certain extent in simulating the job traffic at different levels, fitting both the interarrival time distribution and the autocorrelation function. However, MMPPs are not able to match the autocorrelations for certain VOs, in which strong deterministic semi-periodic patterns are observed. These patterns are further characterized using different representations. Future work is needed to model both deterministic and stochastic components in order to better capture the correlation structure in the series.