GRAND: toward scalability in a Grid environment: Research Articles

Authors:
Patrícia Kayser Vargas;Inês C. Dutra;Vinícius D. do Nascimento;Lucas A. S. Santos;Luciano C. da Silva;Cláudio F. R. Geyer;Bruno Schulze
Affiliations:
Dept. of Sys. Eng. and Comp. Sci., Federal Univ. of Rio de Janeiro, Ilha do Fundão, CEP 21941-972, Rio de Janeiro and Dept. of Comp. Sci., Centro Universitario La Salle, Avenida Victor Barret ...;Department of Systems Engineering and Computer Science, Federal University of Rio de Janeiro, Ilha do Fundão, CEP 21941-972, Rio de Janeiro, RJ, Brazil;Department of Systems Engineering and Computer Science, Federal University of Rio de Janeiro, Ilha do Fundão, CEP 21941-972, Rio de Janeiro, RJ, Brazil;Instituto de Informática, Federal University of Rio Grande do Sul, Avenida Bento Gonçalves 9500, Campus do Vale, Bloco IV, CEP 91501-970, Porto Alegre, RS, Brazil;Instituto de Informática, Federal University of Rio Grande do Sul, Avenida Bento Gonçalves 9500, Campus do Vale, Bloco IV, CEP 91501-970, Porto Alegre, RS, Brazil;Instituto de Informática, Federal University of Rio Grande do Sul, Avenida Bento Gonçalves 9500, Campus do Vale, Bloco IV, CEP 91501-970, Porto Alegre, RS, Brazil;Department of Computer Science, National Laboratory for Scientific Computing (LNCC), Avenida Getulio Vargas 333, CEP 25651-075, Petropolis, RJ, Brazil
Venue:
Concurrency and Computation: Practice & Experience - Middleware for Grid Computing: A “Possible Future”
Year:
2007

Citing 0
Cited 1

Applying reinforcement learning to scheduling strategies in an actual grid environment

International Journal of High Performance Systems Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the challenges in Grid computing research is to provide ameans to automatically submit, manage, and monitor applicationswhose main characteristic is to be composed of a large number oftasks. The large number of explicit tasks, generally placed on acentralized job queue, can cause several problems: (1) they canquickly exhaust the memory of the submission machine; (2) they candeteriorate the response time of the submission machine due tothese demanding too many open ports to manage remote execution ofeach of the tasks; (3) they may cause network traffic congestion ifall tasks try to transfer input and/or output files across thenetwork at the same time; (4) they make it impossible for the userto follow execution progress without an automatic tool orinterface; (5) they may depend on fault-tolerance mechanismsimplemented at application level to ensure that all tasks terminatesuccessfully. In this work we present and validate a novelarchitectural model, GRAND (Grid Robust ApplicatioN Deployment),whose main objective is to deal with the submission of a largenumbers of tasks. Copyright © 2006 John Wiley & Sons,Ltd.