Fortran at ten gigaflops: the connection machine convolution compiler

Authors:
Mark Bromley;Steven Heller;Tim McNerney;Guy L. Steele, Jr.
Affiliations:
Thinking Machines Corporation, 245 First Street, Cambridge, Massachusetts;Thinking Machines Corporation, 245 First Street, Cambridge, Massachusetts;Thinking Machines Corporation, 245 First Street, Cambridge, Massachusetts;Thinking Machines Corporation, 245 First Street, Cambridge, Massachusetts
Venue:
PLDI '91 Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation
Year:
1991

Citing 9
Cited 29

The connection machine

The connection machine
On the problem of optimizing data transfers for complex memory systems

ICS '88 Proceedings of the 2nd international conference on Supercomputing
Hypercube performance for 2-D seismic finite-difference modeling

C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
Strategies for cache and local memory management by global program transformation

Proceedings of the 1st International Conference on Supercomputing
More iteration space tiling

Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Supporting shared data structures on distributed memory architectures

PPOPP '90 Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming
Compiler techniques for data partitioning of sequentially iterated parallel loops

ICS '90 Proceedings of the 4th international conference on Supercomputing
Optimizing Supercompilers for Supercomputers

Optimizing Supercompilers for Supercomputers
The 1990 Gordon Bell Prize Winners

IEEE Software

Seismic modeling at 14 gigaflops on the connection machine

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Prototyping Fortran-90 compilers for massively parallel machines

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Evaluation of compiler optimizations for Fortran D on MIMD distributed memory machines

ICS '92 Proceedings of the 6th international conference on Supercomputing
The performance realities of massively parallel processors: a case study

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Communication optimization and code generation for distributed memory machines

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Unified compilation of Fortran 77D and 90D

ACM Letters on Programming Languages and Systems (LOPLAS)
Compiler transformations for high-performance computing

ACM Computing Surveys (CSUR)
Distributed data access in AC

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Automatic optimization of communication in compiling out-of-core stencil codes

ICS '96 Proceedings of the 10th international conference on Supercomputing
Evaluating uniform expressions within two steps of minimum parallel time

Journal of the ACM (JACM)
Compiled communication for all-optical TDM networks

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Eliminating redundancies in sum-of-product array computations

ICS '01 Proceedings of the 15th international conference on Supercomputing
Data Relation Vectors: A New Abstraction for Data Optimizations

IEEE Transactions on Computers - Special issue on the parallel architecture and compilation techniques conference
Compiling stencils in high performance Fortran

SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Experiences tuning SMG98: a semicoarsening multigrid benchmark based on the hypre library

ICS '02 Proceedings of the 16th international conference on Supercomputing
An experimental APL compiler for a distributed memory parallel machine

Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Algorithms for Supporting Compiled Communication

IEEE Transactions on Parallel and Distributed Systems
Cache-Efficient Multigrid Algorithms

ICCS '01 Proceedings of the International Conference on Computational Sciences-Part I
CC--MPI: a compiled communication capable MPI prototype for ethernet switched clusters

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Optimizing aggregate array computations in loops

ACM Transactions on Programming Languages and Systems (TOPLAS)
Cache-Efficient Multigrid Algorithms

International Journal of High Performance Computing Applications
Optimizing inter-processor data locality on embedded chip multiprocessors

Proceedings of the 5th ACM international conference on Embedded software
Automatic benchmark generation for cache optimization of matrix operations

ACM-SE 33 Proceedings of the 33rd annual on Southeast regional conference
An Approach for Enhancing Inter-processor Data Locality on Chip Multiprocessors

Transactions on High-Performance Embedded Architectures and Compilers I
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs

Proceedings of the 23rd international conference on Supercomputing
A Multilevel Parallelization Framework for High-Order Stencil Computations

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
The pochoir stencil compiler

Proceedings of the twenty-third annual ACM symposium on Parallelism in algorithms and architectures
A primitive-based strategy for producing efficient code for very high level programs

Computer Languages
Hierarchical parallelization and optimization of high-order stencil computations on multicore clusters

The Journal of Supercomputing

Quantified Score

Hi-index	0.01

Fortran at ten gigaflops: the connection machine convolution compiler

Quantified Score

Visualization

Abstract