What size net gives valid generalization?
Neural Computation
Artificial Intelligence Review - Special issue on lazy learning
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
Future Generation Computer Systems - Special issue on metacomputing
Exploring Last n Value Prediction
PACT '99 Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques
SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
Bursty and Hierarchical Structure in Streams
Data Mining and Knowledge Discovery
Identifying similarities, periodicities and bursts for online search queries
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Adaptive Automatic Grid Reconfiguration Using Workload Phase Identification
E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Future Generation Computer Systems
Scalable and near real-time burst detection from eCommerce queries
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Managing Very-Large Distributed Datasets
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Workload dynamics on clusters and grids
The Journal of Supercomputing
Prediction-based real-time resource provisioning for massively multiplayer online games
Future Generation Computer Systems
Resilient workload manager: taming bursty workload of scaling internet applications
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Stream Monitoring in Large-Scale Distributed Concealed Environments
E-SCIENCE '09 Proceedings of the 2009 Fifth IEEE International Conference on e-Science
Managing very large distributed data sets on a data grid
Concurrency and Computation: Practice & Experience - Grid Computing, High Performance and Distributed Application
A similarity measure for time, frequency, and dependencies in large-scale workloads
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Towards Non-Stationary Grid Models
Journal of Grid Computing
Event aware workload prediction: a study using auction events
WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Hi-index | 0.00 |
Non-periodic bursts are prevalent in workloads of large scale applications. Existing workload models do not predict such non-periodic bursts very well because they mainly focus on repeatable base functions. We begin by showing the necessity to include bursts in workload models by investigating their detrimental effects in a petabyte-scale distributed data management system. This work then makes three contributions. First, we analyse the accuracy of five existing prediction models on workloads of data and computational grids, as well as derived synthetic workloads. Second, we introduce a novel averages-based model to predict bursts in arbitrary workloads. Third, we present a novel metric, mean absolute estimated distance, to assess the prediction accuracy of the model. Using our model and metric, we show that burst behaviour in workloads can be identified, quantified and predicted independently of the underlying base functions. Furthermore, our model and metric are applicable to arbitrary kinds of burst prediction for time series.