Compiling collection-oriented languages onto massively parallel computers
Journal of Parallel and Distributed Computing - Massively parallel computation
Partitioning sparse matrices with eigenvectors of graphs
SIAM Journal on Matrix Analysis and Applications
Normalized convergence rates for the PSMG method
SIAM Journal on Scientific and Statistical Computing
The parallel multipole method on the connection machine
SIAM Journal on Scientific and Statistical Computing
An implementation of the fast multipole method without multipoles
SIAM Journal on Scientific and Statistical Computing
The order of Appel's algorithm
Information Processing Letters
Parallel hierarchical N-body methods
Parallel hierarchical N-body methods
A parallel hashed Oct-Tree N-body algorithm
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
A parallel adaptive fast multipole method
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
The parallel implementation of N-body algorithms
The parallel implementation of N-body algorithms
A multilevel algorithm for partitioning graphs
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Fast and parallel mapping algorithms for irregular problems
The Journal of Supercomputing
Implementing O(N) N-body algorithms efficiently in data-parallel languages
Scientific Programming
Provably Good Partitioning and Load Balancing Algorithms for Parallel Adaptive N-Body Simulation
SIAM Journal on Scientific Computing
Spectral partitioning works: planar graphs and finite element meshes
FOCS '96 Proceedings of the 37th Annual Symposium on Foundations of Computer Science
An {\it bf O(N)} Algorithm for Three-Dimensional N-body Simulations
An {\'it bf O(N)} Algorithm for Three-Dimensional N-body Simulations
An evaluation of computing paradigms for N-body simulations on distributed memory architectures
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
A semantics for imprecise exceptions
Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Nonlinear array layouts for hierarchical memory systems
ICS '99 Proceedings of the 13th international conference on Supercomputing
Recursive array layouts and fast parallel matrix multiplication
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
HPFBench: a high performance Fortran benchmark suite
ACM Transactions on Mathematical Software (TOMS)
Highly portable and efficient implementations of parallel adaptive N-body methods
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Recursive Array Layouts and Fast Matrix Multiplication
IEEE Transactions on Parallel and Distributed Systems
A Data Parallel Formulation of the Barnes-Hut Method for N -Body Simulations
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
Reducing Communication Cost for Parallelizing Irregular Scientific Codes
PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
Ahnentafel Indexing into Morton-Ordered Arrays, or Matrix Locality for Free
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
An Evaluation of High Performance Fortran Compilers Using the HPFBench Benchmark Suite
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Expressing Irregular Computations in Modern Fortran Dialects
LCR '98 Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
A Comparison of Locality Transformations for Irregular Codes
LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Flexible Data Distribution in PGHPF
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Radio-wave propagation prediction using ray-tracing techniques on a network of workstations (NOW)
Journal of Parallel and Distributed Computing
Massively parallel implementation of a fast multipole method for distributed memory machines
Journal of Parallel and Distributed Computing
Irregular computations in Fortran - expression and implementation strategies
Scientific Programming
Hi-index | 0.00 |
We present a general data parallel formulation for highly irregular problems in High Performance Fortran (HPF). Our formulation consists of(1) a method for linearizing irregular data structures (2) a data parallel implementation (in HPF) of graph partitioning algorithms applied to the linearized data structure, (3) techniques for expressing irregular communication and nonuniform computations associated with the elements of linearized data structures.We demonstrate and evaluate our formulation on a parallel, hierarchical N--body method for the evaluation of potentials and forces of nonuniform particle distributions. Our experimental results demonstrate that efficient data parallel (HPF) implementations of highly nonuniform problems are feasible with the proper language/compiler/runtime support. Our data parallel N--body code provides a much needed "benchmark" code for evaluating and improving HPF compilers.