Distributed Scheduling of Tasks with Deadlines and Resource Requirements
IEEE Transactions on Computers
A worldwide flock of Condors: load sharing among workstation clusters
Future Generation Computer Systems - Special issue: resource management in distributed systems
Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment
Journal of the ACM (JACM)
ACM Computing Surveys (CSUR)
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Condor: a distributed job scheduler
Beowulf cluster computing with Linux
SETI@home: an experiment in public-resource computing
Communications of the ACM
An automatic design optimization tool and its application to computational fluid dynamics
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Directed diffusion for wireless sensor networking
IEEE/ACM Transactions on Networking (TON)
Chord: a scalable peer-to-peer lookup protocol for internet applications
IEEE/ACM Transactions on Networking (TON)
ACM Transactions on Computer Systems (TOCS)
Nimrod: a tool for performing parametrised simulations using distributed workstations
HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
A Study of Deadline Scheduling for Client-Server Systems on the Computational Grid
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
P-Grid: a self-organizing structured P2P system
ACM SIGMOD Record
A scalable distributed information management system
Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
BOINC: A System for Public-Resource Computing and Storage
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
VBI-Tree: A Peer-to-Peer Framework for Supporting Multi-Dimensional Indexing Schemes
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
CompuP2P: An Architecture for Internet Computing Using Peer-to-Peer Networks
IEEE Transactions on Parallel and Distributed Systems
ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Modeling End-to-End Delay Using Pareto Distribution
ICIMP '07 Proceedings of the Second International Conference on Internet Monitoring and Protection
MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Trade-offs in matching jobs and balancing load for distributed desktop grids
Future Generation Computer Systems
Falkon: a Fast and Light-weight tasK executiON framework
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Design and implementation trade-offs for wide-area resource discovery
ACM Transactions on Internet Technology (TOIT)
Wide-scale data stream management
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Future Generation Computer Systems
Secure data aggregation in wireless sensor networks: A comprehensive overview
Computer Networks: The International Journal of Computer and Telecommunications Networking
The Eucalyptus Open-Source Cloud-Computing System
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
On Simulation and Design of Parallel-Systems Schedulers: Are We Doing the Right Thing?
IEEE Transactions on Parallel and Distributed Systems
Grid broker selection strategies using aggregated resource information
Future Generation Computer Systems
Decentralized Aggregation Protocols in Peer-to-Peer Networks: A Survey
MACE '09 Proceedings of the 4th IEEE International Workshop on Modelling Autonomic Communications Environments
Communications of the ACM
Cooperative and decentralized workflow scheduling in global grids
Future Generation Computer Systems
Resource Bundles: Using Aggregation for Statistical Large-Scale Resource Discovery and Management
IEEE Transactions on Parallel and Distributed Systems
WaveGrid: a scalable fast-turnaround heterogeneous peer-based desktop grid system
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Modelization and Performance Evaluation of the DIET Middleware
ICPP '10 Proceedings of the 2010 39th International Conference on Parallel Processing
LCN '10 Proceedings of the 2010 IEEE 35th Conference on Local Computer Networks
Parameter Exploration in Science and Engineering Using Many-Task Computing
IEEE Transactions on Parallel and Distributed Systems
Network-aware summarisation for resource discovery in P2P-content networks
Future Generation Computer Systems
A Highly Scalable Decentralized Scheduler of Tasks with Deadlines
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Scalable architecture for allocation of idle CPUs in a p2p network
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
A-GAP: An Adaptive Protocol for Continuous Network Monitoring with Accuracy Objectives
IEEE Transactions on Network and Service Management
Scheduling efficiency of resource information aggregation in grid networks
Future Generation Computer Systems
Hi-index | 0.00 |
Scheduling many tasks in environments of millions of unreliable nodes is a challenging problem. To our knowledge, no work in the literature has proposed a solution that also supports many policies with very different objectives. In this paper, we present a decentralized scheduling model that overcomes these problems. A hierarchical network overlay supports a scalable resource discovery and allocation scheme. It uses aggregated information to route tasks to the most suitable execution nodes, and is easily extensible to provide very different scheduling policies. For this paper, we implemented a policy that just allocates tasks to idle nodes, a policy that minimizes the global makespan and a policy that fulfills deadline requirements. With thorough simulation tests, we conclude that our model allocates any number of tasks to several million nodes in just a few seconds, with very low overhead and high resilience. Meanwhile, policies with different objectives implemented on our model perform almost as well as their centralized counterpart.