A fast algorithm for particle simulations
Journal of Computational Physics
The design and analysis of spatial data structures
The design and analysis of spatial data structures
The parallel multipole method on the connection machine
SIAM Journal on Scientific and Statistical Computing
The order of Appel's algorithm
Information Processing Letters
Parallel hierarchical N-body methods
Parallel hierarchical N-body methods
Astrophysical N-body simulations using hierarchical tree data structures
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Skeletons from the treecode closet
Journal of Computational Physics
Journal of Parallel and Distributed Computing
Implications of hierarchical N-body methods for multiprocessor architectures
ACM Transactions on Computer Systems (TOCS)
Balancing processor loads and exploiting data locality in N-body simulations
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Parallel matrix-vector product using approximate hierarchical methods
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
An empirical evaluation of the Convex SPP-1000 hierarchical shared memory system
PACT '95 Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques
Dynamic Partitioning of Non-Uniform Structured Workloads with Spacefilling Curves
IEEE Transactions on Parallel and Distributed Systems
High performance Fortran for highly irregular problems
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
An evaluation of computing paradigms for N-body simulations on distributed memory architectures
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
A semantics for imprecise exceptions
Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Improving memory hierarchy performance for irregular applications
ICS '99 Proceedings of the 13th international conference on Supercomputing
Nonlinear array layouts for hierarchical memory systems
ICS '99 Proceedings of the 13th international conference on Supercomputing
Recursive array layouts and fast parallel matrix multiplication
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
A unifying data structure for hierarchical methods
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Design of dynamic load-balancing tools for parallel applications
Proceedings of the 14th international conference on Supercomputing
Experiences with Parallel N-Body Simulation
IEEE Transactions on Parallel and Distributed Systems
A data-parallel implementation of O(N) hierarchical N-body methods
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Parallel hierarchical solvers and preconditioners for boundary element methods
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Language support for Morton-order matrices
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
SPMD execution in the presence of dynamic data structures
Compiler optimizations for scalable parallel systems
Compression of particle data from hierarchical approximate methods
ACM Transactions on Mathematical Software (TOMS)
A hierarchical load-balancing framework for dynamic multithreaded computations
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Analyzing the error bounds of multipole-based treecodes
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Avalon: an Alpha/Linux cluster achieves 10 Gflops for $15k
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Highly portable and efficient implementations of parallel adaptive N-body methods
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
A performance comparison of tree data structures for N-body simulation
Journal of Computational Physics
Truly distribution-independent algorithms for the N-body problem
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Scalable parallel formulations of the barnes-hut method for n-body simulations
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
Distribution-Independent Hierarchical Algorithmsfor the N-body Problem
The Journal of Supercomputing
International Journal of Parallel Programming
Recursive Array Layouts and Fast Matrix Multiplication
IEEE Transactions on Parallel and Distributed Systems
An Application-Centric Characterization of Domain-Based SFC Partitioners for Parallel SAMR
IEEE Transactions on Parallel and Distributed Systems
HiPC '01 Proceedings of the 8th International Conference on High Performance Computing
A Versatile Simulation Model for Hierarchical Treecodes
ICCS '02 Proceedings of the International Conference on Computational Science-Part I
A Framework for Parallel Tree-Based Scientific Simulations
ICPP '97 Proceedings of the international Conference on Parallel Processing
Parallelization of Irregular Problems Based on Hierarchical Domain Representation
HPCN Europe 2000 Proceedings of the 8th International Conference on High-Performance Computing and Networking
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Load Balancing Highly Irregular Computations with the Adaptive Factoring
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Next Generation System Software for Future High-End Computing Systems
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Performance of Scheduling Scientific Applications with Adaptive Weighted Factoring
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Ahnentafel Indexing into Morton-Ordered Arrays, or Matrix Locality for Free
Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
A Cost Optimal Parallel Algorithm for Computing Force Field in N-Body Simulations
COCOON '98 Proceedings of the 4th Annual International Conference on Computing and Combinatorics
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
High-density computing: a 240-processor Beowulf in one cubic meter
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Distributed dynamic hash tables using IBM LAPI
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
An Initial evaluation of the Convex SPP-1000 for Earth and Space Science Applications
HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
ICCD '00 Proceedings of the 2000 IEEE International Conference on Computer Design: VLSI in Computers & Processors
IWIA '99 Proceedings of the 1999 International Workshop on Innovative Architecture
IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Sourcebook of parallel computing
Multipole-based preconditioners for large sparse linear systems
Parallel Computing - Parallel matrix algorithms and applications (PMAA '02)
Solving irregularly structured problems based on distributed object model
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Proceedings of the 18th annual international conference on Supercomputing
A New Parallel Kernel-Independent Fast Multipole Method
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
The Space Simulator: Modeling the Universe from Supernovae to Cosmology
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Overhead Analysis of a Dynamic Load Balancing Library for Cluster Computing
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 1 - Volume 02
New challanges in dynamic load balancing
Applied Numerical Mathematics - Adaptive methods for partial differential equations and large-scale computation
Scalable Parallel Octree Meshing for TeraScale Applications
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
A Load Balancing Tool for Distributed Parallel Loops
Cluster Computing
A refinement-tree based partitioning method for dynamic load balancing with adaptively refined grids
Journal of Parallel and Distributed Computing
Irregular computations in Fortran - expression and implementation strategies
Scientific Programming
An HPC component for parallel, heterogeneous, and dynamic unstructured meshes
Proceedings of the 2007 symposium on Component and framework technology in high-performance and scientific computing
Journal of Computational Physics
Applied Numerical Mathematics
Performance evaluation of a dynamic load-balancing library for cluster computing
International Journal of Computational Science and Engineering
Large Scale Three-Dimensional Boundary Element Simulation of Subduction
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A repartitioning hypergraph model for dynamic load balancing
Journal of Parallel and Distributed Computing
Hiding Communication Latency with Non-SPMD, Graph-Based Execution
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Parallelization Strategies for Mixed Regular-Irregular Applications on Multicore-Systems
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
New challenges in dynamic load balancing
Applied Numerical Mathematics - Adaptive methods for partial differential equations and large-scale computation
A massively parallel adaptive fast-multipole method on heterogeneous architectures
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Adaptive scheduling of parallel computations for SPMD tasks
ICCSA'07 Proceedings of the 2007 international conference on Computational science and Its applications - Volume Part II
Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Exploring a Novel Gathering Method for Finite Element Codes on the Cell/B.E. Architecture
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Optimizing the Barnes-Hut algorithm in UPC
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Journal of Computational Physics
Hierarchical partitioning and dynamic load balancing for scientific computation
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
A massively parallel adaptive fast multipole method on heterogeneous architectures
Communications of the ACM
High performance BLAS formulation of the adaptive Fast Multipole Method
Mathematical and Computer Modelling: An International Journal
Quantifying the effectiveness of load balance algorithms
Proceedings of the 26th ACM international conference on Supercomputing
Approximate covering detection among content-based subscriptions using space filling curves
Journal of Parallel and Distributed Computing
Heuristic static load-balancing algorithm applied to the fragment molecular orbital method
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hierarchical task mapping of cell-based AMR cosmology simulations
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A massively space-time parallel N-body solver
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Vectorized algorithms for quadtree construction and descent
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems
International Journal of High Performance Computing Applications
Extending the scope of the controlled logical clock
Cluster Computing
A parallel and incremental extraction of variational capacitance with stochastic geometric moments
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
2HOT: an improved parallel hashed oct-tree n-body algorithm for cosmological simulation
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Pragmatic optimizations for better scientific utilization of large supercomputers
International Journal of High Performance Computing Applications
AusPDC '12 Proceedings of the Tenth Australasian Symposium on Parallel and Distributed Computing - Volume 127
A CPU: GPU Hybrid Implementation and Model-Driven Scheduling of the Fast Multipole Method
Proceedings of Workshop on General Purpose Processing Using GPUs
A multi-threaded algorithm for computing the largest non-colliding moving geometry
Computer-Aided Design
Scientific Programming - A New Overview of the Trilinos Project --Part 1
Hi-index | 0.03 |