Improving scheduling decisions by using knowledge about parallel applications resource usage

Authors:
Luciano José Senger;Rodrigo Fernandes de Mello;Marcos José Santana;Regina Helena Carlucci Santana;Laurence Tianruo Yang
Affiliations:
Departamento de Informática, Universidade Estadual de Ponta Grossa, Ponta Grossa, PR;Instituto de Ciências Matemáticas e de, Computação, Universidade de São Paulo, São Carlos, SP;Instituto de Ciências Matemáticas e de, Computação, Universidade de São Paulo, São Carlos, SP;Instituto de Ciências Matemáticas e de, Computação, Universidade de São Paulo, São Carlos, SP;Department of Computer Science, St. Francis Xavier University, Antigonish, NS, Canada
Venue:
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Year:
2005

Citing 15
Cited 3

Characterizations of parallelism in applications and their use in scheduling

SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Instance-Based Learning Algorithms

Machine Learning
Parallel architectures

ACM Computing Surveys (CSUR)
Using parallel program characteristics in dynamic processor allocation policies

Performance Evaluation
Exploiting process lifetime distributions for dynamic load balancing

ACM Transactions on Computer Systems (TOCS)
Processor allocation in multiprogrammed distributed-memory parallel computer systems

Journal of Parallel and Distributed Computing
Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling

IEEE Transactions on Parallel and Distributed Systems
Implicit coscheduling: coordinated scheduling with implicit information in distributed systems

ACM Transactions on Computer Systems (TOCS)
Understanding the Linux Kernel

Understanding the Linux Kernel
A Case for NOW (Networks of Workstations)

IEEE Micro
A Historical Application Profiler for Use by Parallel Schedulers

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Predicting Application Run Times Using Historical Information

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Metrics for Parallel Job Scheduling and Their Convergence

JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
An Experimental Study of Load Balancing Performance

An Experimental Study of Load Balancing Performance
A New Migration Model based on the Evaluation of Processes Load and Lifetime on Heterogeneous Computing Environments

SBAC-PAD '04 Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing

Prediction of dynamical, nonlinear, and unstable process behavior

The Journal of Supercomputing
Process scheduling using ant colony optimization techniques

ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
An effective and robust two-phase resource allocation scheme for interdependent tasks in mobile ad hoc computational Grids

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a process scheduling algorithm that uses information about the capacity of the processing elements over the communication network and parallel applications in order to allocate resources on heterogeneous and distributed environments. The information about the applications is composed by the resources usage behavior (percentage values related to CPU’s utilization, network send and network receive) and by the prediction of the execution time of tasks that make up a parallel distribution. The knowledge about the resources usage is obtained by means of the Art2A self-organizing artificial neural network and by a specific labeling algorithm; the knowledge about the execution time is obtained through the learning techniques based on instances. The knowledge about the application execution features, combined with the information about the computing capacity of the resources available in the environment, are used as an entry to improve the decisions of the proposed scheduling algorithm. Such algorithm uses genetic algorithm techniques to find out the most appropriate computing resources subset to support the applications. The proposed algorithm is evaluated through simulation by using a model parameterized with the features obtained from a real distributed scenario. The results obtained by the evaluation show that the scheduling that uses the genetic search allows a better allocation of computing resources on environments composed of tens of computers on which the parallel applications are composed by tens of tasks.