Fast multipole method on GPU: tackling 3-D capacitance extraction on massively parallel SIMD platforms

Authors:
Xueqian Zhao;Zhuo Feng
Affiliations:
Michigan Technological University, Houghton, MI;Michigan Technological University, Houghton, MI
Venue:
Proceedings of the 48th Design Automation Conference
Year:
2011

Citing 7
Cited 1

A fast algorithm for particle simulations

Journal of Computational Physics
A fast hierarchical algorithm for 3-D capacitance extraction

DAC '98 Proceedings of the 35th annual Design Automation Conference
Fast multipole methods on graphics processors

Journal of Computational Physics
PiCAP: a parallel and incremental capacitance extraction considering stochastic process variation

Proceedings of the 46th Annual Design Automation Conference
42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
A hierarchical floating random walk algorithm for fabric-aware 3D capacitance extraction

Proceedings of the 2009 International Conference on Computer-Aided Design
A precorrected-FFT method for electrostatic analysis of complicated 3-D structures

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

GPU-friendly floating random walk algorithm for capacitance extraction of VLSI interconnects

Proceedings of the Conference on Design, Automation and Test in Europe

Quantified Score

Hi-index	0.00

Visualization

Abstract

To facilitate full chip capacitance extraction, field solvers are typically deployed for characterizing capacitance libraries for various interconnect structures and configurations. In the past decades, various algorithms for accelerating boundary element methods (BEM) have been developed to improve the efficiency of field solvers for capacitance extraction. This paper presents the first massively parallel capacitance extraction algorithm FMMGpu that accelerates the well-known fast multipole methods (FMM) on modern Graphics Processing Units (GPUs). We propose GPU-friendly data structures and SIMD parallel algorithm flows to facilitate the FMM-based 3-D capacitance extraction on GPU. Effective GPU performance modeling methods are also proposed to properly balance the workload of each critical kernel in our FMMGpu implementation, by taking advantage of the latest Fermi GPU's concurrent kernel executions on streaming multiprocessors (SMs). Our experimental results show that FMMGpu brings 22X to 30X speedups in capacitance extractions for various test cases. We also show that even for small test cases that may not well utilize GPU's hardware resources, the proposed cube clustering and workload balancing techniques can bring 20% to 60% extra performance improvements.