A task routing approach to large-scale scheduling

Authors:
Javier Celaya;Unai Arronategui
Affiliations:
-;-
Venue:
Future Generation Computer Systems
Year:
2013

Citing 43
Cited 0

Distributed Scheduling of Tasks with Deadlines and Resource Requirements

IEEE Transactions on Computers
A worldwide flock of Condors: load sharing among workstation clusters

Future Generation Computer Systems - Special issue: resource management in distributed systems
Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment

Journal of the ACM (JACM)
Data clustering: a review

ACM Computing Surveys (CSUR)
A scalable content-addressable network

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Condor: a distributed job scheduler

Beowulf cluster computing with Linux
SETI@home: an experiment in public-resource computing

Communications of the ACM
An automatic design optimization tool and its application to computational fluid dynamics

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Directed diffusion for wireless sensor networking

IEEE/ACM Transactions on Networking (TON)
Chord: a scalable peer-to-peer lookup protocol for internet applications

IEEE/ACM Transactions on Networking (TON)
Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining

ACM Transactions on Computer Systems (TOCS)
Nimrod: a tool for performing parametrised simulations using distributed workstations

HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
A Study of Deadline Scheduling for Client-Server Systems on the Computational Grid

HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
P-Grid: a self-organizing structured P2P system

ACM SIGMOD Record
A scalable distributed information management system

Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
BOINC: A System for Public-Resource Computing and Storage

GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
VBI-Tree: A Peer-to-Peer Framework for Supporting Multi-Dimensional Indexing Schemes

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
CompuP2P: An Architecture for Internet Computing Using Peer-to-Peer Networks

IEEE Transactions on Parallel and Distributed Systems
Handling churn in a DHT

ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Modeling End-to-End Delay Using Pareto Distribution

ICIMP '07 Proceedings of the Second International Conference on Internet Monitoring and Protection
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Trade-offs in matching jobs and balancing load for distributed desktop grids

Future Generation Computer Systems
Falkon: a Fast and Light-weight tasK executiON framework

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Design and implementation trade-offs for wide-area resource discovery

ACM Transactions on Internet Technology (TOIT)
Wide-scale data stream management

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility

Future Generation Computer Systems
Secure data aggregation in wireless sensor networks: A comprehensive overview

Computer Networks: The International Journal of Computer and Telecommunications Networking
The Eucalyptus Open-Source Cloud-Computing System

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
On Simulation and Design of Parallel-Systems Schedulers: Are We Doing the Right Thing?

IEEE Transactions on Parallel and Distributed Systems
Grid broker selection strategies using aggregated resource information

Future Generation Computer Systems
Decentralized Aggregation Protocols in Peer-to-Peer Networks: A Survey

MACE '09 Proceedings of the 4th IEEE International Workshop on Modelling Autonomic Communications Environments
A view of cloud computing

Communications of the ACM
Cooperative and decentralized workflow scheduling in global grids

Future Generation Computer Systems
Resource Bundles: Using Aggregation for Statistical Large-Scale Resource Discovery and Management

IEEE Transactions on Parallel and Distributed Systems
WaveGrid: a scalable fast-turnaround heterogeneous peer-based desktop grid system

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Modelization and Performance Evaluation of the DIET Middleware

ICPP '10 Proceedings of the 2010 39th International Conference on Parallel Processing
Bag-of-Tasks applications scheduling on volunteer desktop grids with adaptive information dissemination

LCN '10 Proceedings of the 2010 IEEE 35th Conference on Local Computer Networks
Parameter Exploration in Science and Engineering Using Many-Task Computing

IEEE Transactions on Parallel and Distributed Systems
Network-aware summarisation for resource discovery in P2P-content networks

Future Generation Computer Systems
A Highly Scalable Decentralized Scheduler of Tasks with Deadlines

GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Scalable architecture for allocation of idle CPUs in a p2p network

HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
A-GAP: An Adaptive Protocol for Continuous Network Monitoring with Accuracy Objectives

IEEE Transactions on Network and Service Management
Scheduling efficiency of resource information aggregation in grid networks

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Scheduling many tasks in environments of millions of unreliable nodes is a challenging problem. To our knowledge, no work in the literature has proposed a solution that also supports many policies with very different objectives. In this paper, we present a decentralized scheduling model that overcomes these problems. A hierarchical network overlay supports a scalable resource discovery and allocation scheme. It uses aggregated information to route tasks to the most suitable execution nodes, and is easily extensible to provide very different scheduling policies. For this paper, we implemented a policy that just allocates tasks to idle nodes, a policy that minimizes the global makespan and a policy that fulfills deadline requirements. With thorough simulation tests, we conclude that our model allocates any number of tasks to several million nodes in just a few seconds, with very low overhead and high resilience. Meanwhile, policies with different objectives implemented on our model perform almost as well as their centralized counterpart.