A fast algorithm for particle simulations
Journal of Computational Physics
Journal of Parallel and Distributed Computing
Provably Good Partitioning and Load Balancing Algorithms for Parallel Adaptive N-Body Simulation
SIAM Journal on Scientific Computing
Parallel Hierarchical Solvers and Preconditioners for Boundary Element Methods
SIAM Journal on Scientific Computing
A solenoidal basis method for efficient inductance extraction
Proceedings of the 39th annual Design Automation Conference
Iterative Methods for Sparse Linear Systems
Iterative Methods for Sparse Linear Systems
A Provably Optimal, Distribution-Independent Parallel Fast Multipole Method
IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Parallel iterative methods for dense linear systems in inductance extraction
Parallel Computing - Parallel matrix algorithms and applications (PMAA '02)
Parallel Software for Inductance Extraction
ICPP '04 Proceedings of the 2004 International Conference on Parallel Processing
Parallel algorithms for inductance extraction of VLSI circuits
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Hi-index | 0.00 |
Parasitic extraction techniques are used to estimate signal delay in VLSI chips Inductance extraction is a critical component of the parasitic extraction process in which on-chip inductive effects are estimated with high accuracy In earlier work [1], we described a parallel software package for inductance extraction called ParIS, which uses a novel preconditioned iterative method to solve the dense, complex linear system of equations arising in these problems The most computationally challenging task in ParIS involves computing dense matrix-vector products efficiently via hierarchical multipole-based approximation techniques This paper presents a comparative study of two such techniques: a hierarchical algorithm called Hierarchical Multipole Method (HMM) and the well-known Fast Multipole Method (FMM) We investigate the performance of parallel MPI-based implementations of these algorithms on a Linux cluster We analyze the impact of various algorithmic parameters and identify regimes where HMM is expected to outperform FMM on uniprocessor as well as multiprocessor platforms.