Stretch optimization for virtual screening on multi-user pilot-agent platforms on grid/cloud

Authors:
Bui The Quang;Nguyen Hong Quang;Emmanuel Medernach;Vincent Breton;Pham Quoc Long
Affiliations:
Vietnam National University, Hanoi, Vietnam;Vietnam National University, Hanoi, Vietnam;Laboratoire de Physique Corpusculaire, Aubière, France;Laboratoire de Physique Corpusculaire, Aubière, France;Institute of Natural Products Chemistry, Hanoi, Vietnam
Venue:
Proceedings of the Fourth Symposium on Information and Communication Technology
Year:
2013

Citing 12
Cited 0

Application-level scheduling on distributed heterogeneous networks

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
A parallel workload model and its implications for processor allocation

Cluster Computing
Packing Schemes for Gang Scheduling

IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Online Scheduling to Minimize Average Stretch

FOCS '99 Proceedings of the 40th Annual Symposium on Foundations of Computer Science
DIET — A Scalable, Robust and Adaptable Multi-Agent Platform for Information Management

BT Technology Journal
Minimizing the stretch when scheduling flows of biological requests

Proceedings of the eighteenth annual ACM symposium on Parallelism in algorithms and architectures
SimGrid: A Generic Framework for Large-Scale Distributed Experiments

UKSIM '08 Proceedings of the Tenth International Conference on Computer Modeling and Simulation
A survey of job scheduling in grids

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments

AINA '10 Proceedings of the 2010 24th IEEE International Conference on Advanced Information Networking and Applications
SAGA BigJob: An Extensible and Interoperable Pilot-Job Abstraction for Distributed Applications and Systems

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Modeling for Dynamic Cloud Scheduling Via Migration of Virtual Machines

CLOUDCOM '11 Proceedings of the 2011 IEEE Third International Conference on Cloud Computing Technology and Science
Workload analysis of a cluster in a grid environment

JSSPP'05 Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Virtual screening has proven very effective on grid infrastructures where large scale deployments have led to the identification of active inhibitors for biological targets of interest against malaria, SARS or diabetes. Operating a dedicated virtual screening platform on grid resources requires optimizing the scheduling policy. The scheduling can be done at 2 levels; at site level and at platform level. Site scheduling is done at each site independently; each site is autonomous in its choice of job scheduling. Each site allocates time slots for different groups of users. Platform scheduling is done at group level: inside a time slot jobs from many users are allocated. Pilot agents are sent to sites and act as a container of actual users jobs. They pick up users jobs from a central queue where the second stage scheduling is done. In this paper, we focus on pilot-agent platform shared by many virtual screening users. They need a suitable scheduling algorithm to ensure a certain fairness between users. We have studied the scheduling of users jobs inside central queue and examined the relevance and impact of different scheduling policies (FIFO, SPT, LPT and Round Robin) on the user experience. Optimal criterion used in our research is the stretch, a measure for user experience on the platform. In a first step, we simulated the operation of virtual screening applications on the pilot-agent platform in order to compare the scheduling policies. According to simulation, SPT algorithm was shown to significantly improve scheduling performances. In a second step, the Shortest Processing Time (SPT) and Longest Processing Time (LPT) scheduling policies were implemented on a DIRAC pilot-agent platform at IFI in Hanoi and tested on EGI Biomed Virtual Organization. Experimental results are in good agreement with simulation and confirm that SPT algorithm significantly improves user experience. The relevance of our conclusions also extends to cloud computing. Indeed, cloud infrastructures are also characterized by limited machine availability.