Experimental Study of Compiler Techniques for NUMA Machines

Authors:
Affiliations:
Venue:
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Year:
1998

Citing 7
Cited 4

AP1000+: architectural support of PUT/GET interface for parallelizing compiler

ASPLOS VI Proceedings of the sixth international conference on Architectural support for programming languages and operating systems
Automatic data layout for high performance Fortran

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Compiling for distributed memory multiprocessors based on access region analysis

Compiling for distributed memory multiprocessors based on access region analysis
Parallel Programming with Polaris

Computer
Compiler Techniques for Effective Communication on Distributed-Memory Multiprocessors

ICPP '97 Proceedings of the international Conference on Parallel Processing
Detecting and Using Affinity in an Automatic Data Distribution Tool

LCPC '94 Proceedings of the 7th International Workshop on Languages and Compilers for Parallel Computing
Automatic Selection of Dynamic Data Partitioning Schemes for Distributed-Memory Multicomputers

LCPC '95 Proceedings of the 8th International Workshop on Languages and Compilers for Parallel Computing

Simplification of array access patterns for compiler optimizations

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Performance Metrics for Embedded Parallel Pipelines

IEEE Transactions on Parallel and Distributed Systems
Efficient and precise array access analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
An Advanced Compiler Framework for Non-Cache-Coherent Multiprocessors

IEEE Transactions on Parallel and Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This study explores the applicability of fully automatic parallelizing techniques for parallel computers. In this study, we capitalize on a variety of traditional compiling techniques as well as new techniques developed specifically for distributed memory architectures. Combining these traditional and new techniques, we conducted experiments with several benchmark programs on the Cray T3D.