Automatic generation of DAG parallelism
PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Determining average program execution times and their variance
PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Task granularity analysis in logic programs
PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Instruction reordering for fork-join parallelism
PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
A foundation for sequentializing parallel code
SPAA '90 Proceedings of the second annual ACM symposium on Parallel algorithms and architectures
PAWS: A Performance Evaluation Tool for Parallel Computing Systems
Computer - Special issue on experimental research in computer architecture
Tiling multidimensional iteration spaces for nonshared memory machines
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Interprocedural transformations for parallel code generation
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
A fast static scheduling algorithm for DAGs on an unbounded number of processors
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Automatic partitioning of a program dependence graph into parallel tasks
IBM Journal of Research and Development
A comprehensive approach to parallel data flow analysis
ICS '92 Proceedings of the 6th international conference on Supercomputing
PYRROS: static task scheduling and code generation for message passing multiprocessors
ICS '92 Proceedings of the 6th international conference on Supercomputing
Models of machines and computation for mapping in multicomputers
ACM Computing Surveys (CSUR)
A static parameter based performance prediction tool for parallel programs
ICS '93 Proceedings of the 7th international conference on Supercomputing
Performance prediction of parallel processing systems: the PAMELA methodology
ICS '93 Proceedings of the 7th international conference on Supercomputing
Compiling nested data-parallel programs for shared-memory multiprocessors
ACM Transactions on Programming Languages and Systems (TOPLAS)
Precise compile-time performance prediction for superscalar-based computers
PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Compiling performance models from parallel programs
ICS '94 Proceedings of the 8th international conference on Supercomputing
Compiler transformations for high-performance computing
ACM Computing Surveys (CSUR)
Optimal mapping of sequences of data parallel tasks
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Upper time bounds for executing PRAM-programs on the LogP-machine
ICS '95 Proceedings of the 9th international conference on Supercomputing
A partitioning-independent paradigm for nested data parallelism
PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
Execution Time Analysis of Communicating Tasks in Distributed Systems
IEEE Transactions on Computers
Task Clustering and Scheduling for Distributed Memory Parallel Architectures
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Symbolic analysis for parallelizing compilers
ACM Transactions on Programming Languages and Systems (TOPLAS)
Optimal latency-throughput tradeoffs for data parallel pipelines
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Run-time compilation for parallel sparse matrix computations
ICS '96 Proceedings of the 10th international conference on Supercomputing
Automatic performance prediction to support cross development of parallel programs
SPDT '96 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Clustering task graphs for message passing architectures
ICS '90 Proceedings of the 4th international conference on Supercomputing
Optimal weighted loop fusion for parallel programs
Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
Space and time efficient execution of parallel irregular computations
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
On Parallelization of Static Scheduling Algorithms
IEEE Transactions on Software Engineering
A Framework for Exploiting Task and Data Parallelism on Distributed Memory Multicomputers
IEEE Transactions on Parallel and Distributed Systems
Optimal Scheduling Algorithm for Distributed-Memory Machines
IEEE Transactions on Parallel and Distributed Systems
Scheduling Algorithms for Parallel Gaussian Elimination With Communication Costs
IEEE Transactions on Parallel and Distributed Systems
Improved Compressions of Cube-Connected Cycles Networks
IEEE Transactions on Parallel and Distributed Systems
Analyzing Asynchronous Pipeline Schedules
International Journal of Parallel Programming
Space-time scheduling of instruction-level parallelism on a raw machine
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Space/time-efficient scheduling and execution of parallel irregular computations
ACM Transactions on Programming Languages and Systems (TOPLAS)
IEEE Transactions on Parallel and Distributed Systems
Optimized rapid prototyping for real-time embedded heterogeneous multiprocessors
CODES '99 Proceedings of the seventh international workshop on Hardware/software codesign
On Parallelizing the Multiprocessor Scheduling Problem
IEEE Transactions on Parallel and Distributed Systems
Temporal Partitioning and Scheduling Data Flow Graphs for Reconfigurable Computers
IEEE Transactions on Computers
An object-oriented environment for simulation and evaluation of architectures
ANSS '92 Proceedings of the 25th annual symposium on Simulation
Proceedings of the 36th annual ACM/IEEE Design Automation Conference
ACM Transactions on Programming Languages and Systems (TOPLAS)
Optimal Clustering of Tree-Sweep Computations for High-Latency Parallel Environments
IEEE Transactions on Parallel and Distributed Systems
Clustering Algorithm for Parallelizing Software Systems in Multiprocessors Environment
IEEE Transactions on Software Engineering - Special issue on architecture-independent languages and software tools parallel processing
Task scheduling using a block dependency DAG for block-oriented sparse Cholesky factorization
SAC '00 Proceedings of the 2000 ACM symposium on Applied computing - Volume 2
Static scheduling algorithms for allocating directed task graphs to multiprocessors
ACM Computing Surveys (CSUR)
Efficient Local Search for DAG Scheduling
IEEE Transactions on Parallel and Distributed Systems
A duplication based compile time scheduling method for task parallelism
Compiler optimizations for scalable parallel systems
Automatic partitioning and virtual scheduling for efficient parallel execution
ACM-SE 30 Proceedings of the 30th annual Southeast regional conference
A framework for performance-based program partitioning
Progress in computer research
Hardware-software cosynthesis for digital systems
Readings in hardware/software co-design
A framework for performance-based program partitioning
Progress in computer research
Low-Cost Task Scheduling for Distributed-Memory Machines
IEEE Transactions on Parallel and Distributed Systems
Fault-tolerant recursive least-squares computations on a mesh-connected parallel processor
Journal of Parallel and Distributed Computing
Evolving toward an optimal scheduling solution through adaptivity
Journal of Parallel and Distributed Computing
Link contention-constrained scheduling and mapping of tasks
Cluster Computing
Communication and memory requirements as the basis for mapping task and data parallel programs
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Parallelising large irregular programs: an experience with Naira
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Software engineering: Systems and tools
A Robust Compile Time Method for SchedulingTask Parallelism on Distributed Memory Machines
The Journal of Supercomputing
Task Parallelism in a High Performance Fortran Framework
IEEE Parallel & Distributed Technology: Systems & Technology
CASCH: A Tool for Computer-Aided Scheduling
IEEE Concurrency
Hardware-Software Cosynthesis for Digital Systems
IEEE Design & Test
Hypertool: A Programming Aid for Message-Passing Systems
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
Declustering: A New Multiprocessor Scheduling Technique
IEEE Transactions on Parallel and Distributed Systems
On the Granularity and Clustering of Directed Acyclic Task Graphs
IEEE Transactions on Parallel and Distributed Systems
Program Structuring for Effective Parallel Portability
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors
IEEE Transactions on Parallel and Distributed Systems
Heuristic Algorithms for Scheduling Iterative Task Computations on Distributed Memory Machines
IEEE Transactions on Parallel and Distributed Systems
Symbolic Performance Modeling of Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
A Scheduling Model for Grid Computing Systems
GRID '01 Proceedings of the Second International Workshop on Grid Computing
Hindsight Helps: Deterministic Task Scheduling with Backtracking
ICPP '97 Proceedings of the international Conference on Parallel Processing
Efficient Run-Time Support for Irregular Task Computations with Mixed Granularities
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
A New Clustering Algorithm for Large Communication Delays
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Comparison of General Approaches to Multiprocessor Scheduling
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Genetic Scheduling on Minimal Processing Elements in the Grid
AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Design and Implementation of an Efficient Thread Partitioning Algorithm
ISHPC '00 Proceedings of the Third International Symposium on High Performance Computing
Multiprocessor Clustering for Embedded Systems
Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
An Automatic Scheduler for Parallel Machines (Research Note)
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
A Platform Independent Parallelising Tool Based on Graph Theoretic Models
VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
Scheduling Task Graphs on Arbitrary Processor Architectures Considering Contention
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Automatic Parallelization and Scheduling of Programs on Multiprocessors using CASCH
ICPP '97 Proceedings of the international Conference on Parallel Processing
TOPPER: A Tool for Optimizing the Performance of Parallel Applications
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Processor Allocation and Task Scheduling of Matrix Chain Products on Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
Effect of variation in compile time costs on scheduling tasks on distributed memory systems
FRONTIERS '96 Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation
Mapping heterogeneous task graphs onto heterogeneous system graphs
HCW '97 Proceedings of the 6th Heterogeneous Computing Workshop (HCW '97)
On finding optimal clusterings of task graphs
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
Runtime Parallel Incremental Scheduling of DAGs
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
A Scalable Task Duplication Based Scheduling Algorithm for Heterogeneous Systems
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Benchmarking the Task Graph Scheduling Algorithms
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
VPPB -- A Visualization and Performance Prediction Tool for Multithreaded Solaris Programs
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
A task duplication scheme for resolving deadlocks in clustered DAGs
Parallel Computing
On Task Scheduling Accuracy: Evaluation Methodology and Results
The Journal of Supercomputing
Improving Scheduling of Tasks in a Heterogeneous Environment
IEEE Transactions on Parallel and Distributed Systems
Proceedings of the 1st conference on Computing frontiers
Mapping Applications to an FPFA Tile
DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Decentralizing execution of composite web services
OOPSLA '04 Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Compact DAG representation and its symbolic scheduling
Journal of Parallel and Distributed Computing
Dynamic Critical Path Scheduling Parallel Programs onto Multiprocessors
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 8 - Volume 09
Communication Contention in Task Scheduling
IEEE Transactions on Parallel and Distributed Systems
ACSW Frontiers '05 Proceedings of the 2005 Australasian workshop on Grid computing and e-research - Volume 44
Low-Cost Static Performance Prediction of Parallel Stochastic Task Compositions
IEEE Transactions on Parallel and Distributed Systems
Dynamic mapping of cooperating tasks to nodes in a distributed system
Future Generation Computer Systems
Contentions-conscious dynamic but deterministic scheduling of computational and communication tasks
Proceedings of the 2006 ACM symposium on Applied computing
Contentions-conscious dynamic but deterministic scheduling of computational and communication tasks
Proceedings of the 2006 ACM symposium on Applied computing
CODES+ISSS '06 Proceedings of the 4th international conference on Hardware/software codesign and system synthesis
Efficient scheduling algorithm for component-based networks
Future Generation Computer Systems
Automatic choice of scheduling heuristics for parallel/distributed computing
Scientific Programming
GridRod: a dynamic runtime scheduler for grid workflows
Proceedings of the 21st annual international conference on Supercomputing
Automated memory-aware application distribution for Multi-processor System-on-Chips
Journal of Systems Architecture: the EUROMICRO Journal
A New Task Graph Model for Mapping Message Passing Applications
IEEE Transactions on Parallel and Distributed Systems
A dominant predecessor duplication scheduling algorithm for heterogeneous systems
The Journal of Supercomputing
Scheduling real-time fork-join structures in cluster computing
International Journal of High Performance Computing and Networking
Data parallel scheduling of operations in linear algebra on heterogeneous clusters
DIWEB'06 Proceedings of the 5th WSEAS International Conference on Distance Learning and Web Engineering
Multithreaded simulation for synchronous dataflow graphs
Proceedings of the 45th annual Design Automation Conference
International Journal of High Performance Computing Applications
An Evolutionary Approach to Task Graph Scheduling
ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
Communication-Aware Scheduling Algorithm Based on Heterogeneous Computing Systems
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Resource allocation algorithm for light communication grid-based workflows within an SLA context
International Journal of Parallel, Emergent and Distributed Systems
Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
CellSs: Scheduling techniques to better exploit memory hierarchy
Scientific Programming - High Performance Computing with the Cell Broadband Engine
Optimal and heuristic solution methods for a multiprocessor machine scheduling problem
Computers and Operations Research
A New Genetic Algorithm for Scheduling for Large Communication Delays
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Dynamic mapping of cooperating tasks to nodes in a distributed system
Future Generation Computer Systems
Extending IC-scheduling via the Sweep Algorithm
Journal of Parallel and Distributed Computing
PCI'01 Proceedings of the 8th Panhellenic conference on Informatics
Scheduling parallel tasks onto NUMA multiprocessors with inter-processor communication overhead
ISPA'03 Proceedings of the 2003 international conference on Parallel and distributed processing and applications
A throughput-driven task creation and mapping for network processors
HiPEAC'07 Proceedings of the 2nd international conference on High performance embedded architectures and compilers
Manycore performance analysis using timed configuration graphs
SAMOS'09 Proceedings of the 9th international conference on Systems, architectures, modeling and simulation
Towards Energy Aware Scheduling for Precedence Constrained Parallel Tasks in a Cluster with DVFS
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Area-maximizing schedules for series-parallel DAGs
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
DAG scheduling on heterogeneous distributed systems using learning automata
ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part II
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
A unified design space simulation environment for network-on-chip: fuse-N
International Journal of High Performance Systems Architecture
Recursive least squares problem implementation on a generalized interconnection of DSP processors
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: digital speech processing - Volume III
Scheduling with uncertainties on new computing platforms
Computational Optimization and Applications
Multithreaded Simulation for Synchronous Dataflow Graphs
ACM Transactions on Design Automation of Electronic Systems (TODAES)
A hybrid heuristic-genetic algorithm for task scheduling in heterogeneous processor networks
Journal of Parallel and Distributed Computing
Scheduling tasks and communications on a hierarchical system with message contention
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part II
IWDC'05 Proceedings of the 7th international conference on Distributed Computing
Scheduling for heterogeneous Systems using constrained critical paths
Parallel Computing
DAG-Based software frameworks for PDEs
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Dissemination of state information in distributed autonomous systems
Computer Communications
Efficient parallel implementation of sequence analysis algorithms using a global address space model
Mathematical and Computer Modelling: An International Journal
Resource management of distributed virtual machines
International Journal of Ad Hoc and Ubiquitous Computing
Automatic extraction of multi-objective aware pipeline parallelism using genetic algorithms
Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Scheduling streaming applications on a complex multicore platform
Concurrency and Computation: Practice & Experience
Optimal linear programming solutions for multiprocessor scheduling with communication delays
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Demystifying page load performance with WProf
nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Multi-objective aware extraction of task-level parallelism using genetic algorithms
DATE '12 Proceedings of the Conference on Design, Automation and Test in Europe
Search strategies for optimal multi-way number partitioning
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Æminium: A Permission-Based Concurrent-by-Default Programming Language Approach
ACM Transactions on Programming Languages and Systems (TOPLAS)
Journal of Systems Architecture: the EUROMICRO Journal
Hi-index | 0.01 |
From the Publisher:This book is one of the first to address the problem of forming useful parallelism from potential parallelism and to provide a general solution. The book presents two approaches to automatic partitioning and scheduling so that the same parallel program can be made to execute efficiently on widely different multiprocessors. The first approach is based on a macro dataflow model in which the program is partitioned into tasks at compile time and the tasks are scheduled on processors at run time. The second approach is based on a compile time scheduling model, where both the partitioning and scheduling are performed at compile time. Both approaches have been implemented in partition programs written in the single assignment language SISAL. The inputs to the partitioning and scheduling algorithms are a graphical representation of the parallel program and a list of parameters describing the target multiprocessor. Execution profile information is used to derive compile-time estimates of execution times and data sizes in the program. Both the macro dataflow and compile-time scheduling problems are expressed as optimization problems and are shown to be NP complete in the strong sense. Efficient approximation algorithms for these problems are presented. Finally, the effectiveness of the partitioning and scheduling algorithms is studied by multiprocessor simulations of various SISAL benchmark programs for different target multiprocessor parameters. Vivek Sarkar is a Member of Research Staff at the IBM T. J. Watson Research Center. Partitioning and Scheduling Parallel Programs for Multiprocessing is included in the series Research Monographs in Parallel and DistributedComputing. Copublished with Pitman Publishing.