Allocating Independent Subtasks on Parallel Processors
IEEE Transactions on Software Engineering
Adaptive load sharing in homogeneous distributed systems
IEEE Transactions on Software Engineering
Processor Allocation for Horizontal and Vertical Parallelism and Related Speedup Bounds
IEEE Transactions on Computers
Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
IEEE Transactions on Computers
A simple load balancing scheme for task allocation in parallel machines
SPAA '91 Proceedings of the third annual ACM symposium on Parallel algorithms and architectures
Factoring: a practical and robust method for scheduling parallel loops
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Factoring: a method for scheduling parallel loops
Communications of the ACM
Low-overhead scheduling of nested parallelism
IBM Journal of Research and Development
Using processor affinity in loop scheduling on shared-memory multiprocessors
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Parallel hierarchical N-body methods and their implications for multiprocessors
Parallel hierarchical N-body methods and their implications for multiprocessors
A parallel hashed Oct-Tree N-body algorithm
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
Balancing processor loads and exploiting data locality in N-body simulations
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Load-sharing in heterogeneous systems via weighted factoring
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Efficient load balancing and data remapping for adaptive grid calculations
Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
The optimal effectiveness metric for parallel application analysis
Information Processing Letters - Special issue on parallel models
Dynamic repartitioning of adaptively refined meshes
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
S-HARP: a scalable parallel dynamic partitioner for adaptive mesh-based computations
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Trapezoid Self-Scheduling: A Practical Scheduling Scheme for Parallel Compilers
IEEE Transactions on Parallel and Distributed Systems
A taxonomy of scheduling in general-purpose distributed computing systems
IEEE Transactions on Software Engineering
Portable Parallel Programming for the Dynamic Load Balancing of Unstructured Grid Applications
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
Performance of Scheduling Scientific Applications with Adaptive Weighted Factoring
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Load Sharing in Distributed Systems
IEEE Transactions on Computers
Scheduling multithreaded computations by work stealing
SFCS '94 Proceedings of the 35th Annual Symposium on Foundations of Computer Science
Message-passing parallel adaptive quantum trajectory method
High performance scientific and engineering computing
Overhead Analysis of a Dynamic Load Balancing Library for Cluster Computing
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 1 - Volume 02
Simulation of Vector Nonlinear Time Series Models on Clusters
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 13 - Volume 14
Design and implementation of a novel dynamic load balancing library for cluster computing
Parallel Computing - Heterogeneous computing
A Load Balancing Tool for Distributed Parallel Loops
Cluster Computing
Efficient agent-based simulation framework for multi-node supercomputers
Proceedings of the 38th conference on Winter simulation
Modeling master/worker applications for automatic performance tuning
Parallel Computing - Algorithmic skeletons
Vector nonlinear time-series analysis of gamma-ray burst datasets on heterogeneous clusters
Scientific Programming - International Symposium of Parallel and Distributed Computing & International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogenous Networks
Scientific Programming - Distributed Computing and Applications
Dynamic load balancing with adaptive factoring methods in scientific applications
The Journal of Supercomputing
Performance evaluation of a dynamic load-balancing library for cluster computing
International Journal of Computational Science and Engineering
Integration of Heterogeneous and Non-dedicated Environments for R
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Simulation of a hybrid model for image denoising
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Investigating asymptotic properties of vector nonlinear time series models
Journal of Computational and Applied Mathematics
A parameter study of a hybrid Laplacian mean-curvature flow denoising model
The Journal of Supercomputing
Scheduling divisible workloads using the adaptive time factoring algorithm
ICA3PP'05 Proceedings of the 6th international conference on Algorithms and Architectures for Parallel Processing
Dynamic load balancing for a grid application
HiPC'04 Proceedings of the 11th international conference on High Performance Computing
Computational challenges in vector functional coefficient autoregressive models
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Dynamic load balancing with MatlabMPI
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part II
International Journal of Parallel Programming
Hi-index | 0.00 |
In heterogeneous environments, employing dynamic scheduling algorithms to improve performance of scientific applications via load balancing is essential. Presently, these algorithms require prior knowledge about workload via profiling resulting in higher overhead as problem sizes and number of processors increase. In addition, variations in work load at runtime may be unpredictable, making profiling work tedious and sometimes even obsolete. Therefore, dynamic loop scheduling schemes such as Factoring, Fractiling, and Weighted Factoring have been proposed and proved to be extremely instrumental when used in scientific applications such as Monte-Carlo simulations, N-Body simulations, radar applications, and others. Adaptive Factoring, a technique that evolves from these schemes and addresses a wider range of irregularities has recently been proposed. This paper reports on performance improvements obtained by integrating the Adaptive Factoring, into a scientific application that invloves computational field simulation on unstructured grids. Performance of this scientific application using the implementation with Adaptive Factoring is compared with implementations using other dynamic loop scheduling techniques.Reported experimental results confirm the benefits of using the Adaptive Factoring and its high potential for a successful integration in other scientific applications, especially the ones characterized by highly irregular behaviour whose performance degradation is primarily due to load imbalance.