Partitioning Problems in Parallel, Pipeline, and Distributed Computing
IEEE Transactions on Computers
PYRROS: static task scheduling and code generation for message passing multiprocessors
ICS '92 Proceedings of the 6th international conference on Supercomputing
Exploiting task and data parallelism on a multicomputer
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Generating communication for array statements: design, implementation, and evaluation
Journal of Parallel and Distributed Computing - Special issue on data parallel algorithms and programming
Supporting systolic and memory communication in iWarp
ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Partitioning and Scheduling Parallel Programs for Multiprocessors
Partitioning and Scheduling Parallel Programs for Multiprocessors
Communication and memory requirements as the basis for mapping task and data parallel programs
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
IEEE Transactions on Pattern Analysis and Machine Intelligence
Optimal Processor Assignment for a Class of Pipelined Computations
IEEE Transactions on Parallel and Distributed Systems
Proceedings of the 5th International Workshop on Languages and Compilers for Parallel Computing
Do&Merge: Integrating Parallel Loops and Reductions
Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
Automatic Mapping of Task and Data Parallel Programs for Efficient Execution on Multicomputers
Automatic Mapping of Task and Data Parallel Programs for Efficient Execution on Multicomputers
Optimal mapping of sequences of data parallel tasks
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Optimizing memory system performance for communication in parallel computers
ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Decoupling synchronization and data transfer in message passing systems of parallel computers
ICS '95 Proceedings of the 9th international conference on Supercomputing
Optimal latency-throughput tradeoffs for data parallel pipelines
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Integrating task and data parallelism using shared objects
ICS '96 Proceedings of the 10th international conference on Supercomputing
Fine grain parallel communication on general purpose LANs
ICS '96 Proceedings of the 10th international conference on Supercomputing
Network-Based Multicomputers: A Practical Supercomputer Architecture
IEEE Transactions on Parallel and Distributed Systems
A high-speed network interface for distributed-memory systems: architecture and applications
ACM Transactions on Computer Systems (TOCS)
The importance of synchronization structure in parallel program optimization
ICS '97 Proceedings of the 11th international conference on Supercomputing
A new model for integrated nested task and data parallel programming
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
A Framework for Exploiting Task and Data Parallelism on Distributed Memory Multicomputers
IEEE Transactions on Parallel and Distributed Systems
The design, implementation, and evaluation of Jade
ACM Transactions on Programming Languages and Systems (TOPLAS)
A task- and data-parallel programming language based on shared objects
ACM Transactions on Programming Languages and Systems (TOPLAS)
Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors
ICS '99 Proceedings of the 13th international conference on Supercomputing
Coordinating HPF programs to mix task and data parallelism
SAC '00 Proceedings of the 2000 ACM symposium on Applied computing - Volume 1
Double standards: bringing task parallelism to HPF via the message passing interface
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
A concurrency abstraction model for avoiding inheritance anomaly in object-oriented programs
Compiler optimizations for scalable parallel systems
A resource query interface for network-aware applications
Cluster Computing
Mixed data and task parallelism with HPF and PVM
Cluster Computing
Communication and memory requirements as the basis for mapping task and data parallel programs
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Network-Aware Parallel Computing with Remos
LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Exploiting Advanced Task Parallelism in High Performance Fortran via a Task Library
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Compiling Data Parallel Tasks for Coordinated Execution
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A Coordination Layer for Exploiting Task Parallelism with HPF
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Task and data parallelism in P3L
Patterns and skeletons for parallel and distributed computing
Adaptive Distributed Applications on Heterogeneous Networks
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Detection of Implicit Parallelisms in the Task Parallel Language
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Airshed Pollution Modeling: A Case Study in Application Development in an HPF Environment
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Run-Time Support for the Automatic Parallelization of Java Programs
The Journal of Supercomputing
OpenGR: A directive-based grid programming environment
Parallel Computing - OpenMp
Irregular computations in Fortran - expression and implementation strategies
Scientific Programming
PFunc: modern task parallelism for modern high performance computing
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
A runtime implementation of OpenMP tasks
IWOMP'11 Proceedings of the 7th international conference on OpenMP in the Petascale era
Hi-index | 0.00 |
Exploiting both data and task parallelism in a single framework is the key to achieving good performance for a variety of applications.