Optimizing the Evaluation of Finite Element Matrices
SIAM Journal on Scientific Computing
A compiler for variational forms
ACM Transactions on Mathematical Software (TOMS)
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Journal of Parallel and Distributed Computing
Nodal discontinuous Galerkin methods on graphics processors
Journal of Computational Physics
Automated Code Generation for Discontinuous Galerkin Methods
SIAM Journal on Scientific Computing
FEAST—realization of hardware-oriented numerics for HPC simulations with finite elements
Concurrency and Computation: Practice & Experience - International Supercomputing Conference
Novel Architectures: Solving Computational Problems with GPU Computing
Computing in Science and Engineering
Automated Solution of Differential Equations by the Finite Element Method: The FEniCS Book
Automated Solution of Differential Equations by the Finite Element Method: The FEniCS Book
Vectorized OpenCL implementation of numerical integration for higher order finite elements
Computers & Mathematics with Applications
Numerical integration on GPUs for higher order finite elements
Computers & Mathematics with Applications
Hi-index | 0.00 |
We present a novel finite element integration method for low-order elements on GPUs. We achieve more than 100GF for element integration on first order discretizations of both the Laplacian and Elasticity operators on an NVIDIA GTX285, which has a nominal single precision peak flop rate of 1 TF/s and bandwidth of 159 GB/s, corresponding to a bandwidth limited peak of 40 GF/s.