Real-time robot motion planning using rasterizing computer graphics hardware
SIGGRAPH '90 Proceedings of the 17th annual conference on Computer graphics and interactive techniques
Fast computation of generalized Voronoi diagrams using graphics hardware
Proceedings of the 26th annual conference on Computer graphics and interactive techniques
OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2
OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2
Physically-based visual simulation on graphics hardware
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
General Calculations using Graphics Hardware with Applications to Interactive Caustics
Proceedings of the Eurographics Workshop on Rendering Techniques 2000
Using modern graphics architectures for general-purpose computing: a framework and analysis
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Reducing the bandwidth of sparse symmetric matrices
ACM '69 Proceedings of the 1969 24th national conference
Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
Sparse matrix solvers on the GPU: conjugate gradients and multigrid
ACM SIGGRAPH 2003 Papers
Brook for GPUs: stream computing on graphics hardware
ACM SIGGRAPH 2004 Papers
GPU Cluster for High Performance Computing
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Concepts and Applications of Finite Element Analysis
Concepts and Applications of Finite Element Analysis
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Parallel Computing
General purpose molecular dynamics simulations fully implemented on graphics processing units
Journal of Computational Physics
CSE '08 Proceedings of the 2008 11th IEEE International Conference on Computational Science and Engineering
International Journal of Parallel, Emergent and Distributed Systems
ISBMS '08 Proceedings of the 4th international symposium on Biomedical Simulation
Journal of Parallel and Distributed Computing
Nodal discontinuous Galerkin methods on graphics processors
Journal of Computational Physics
Scientific and Engineering Computing Using ATI Stream Technology
Computing in Science and Engineering
Hi-index | 31.45 |
An approach is developed to perform explicit time domain finite element simulations of elastodynamic problems on the graphical processing unit, using Nvidia@?s CUDA. Of critical importance for this problem is the arrangement of nodes in memory, allowing data to be loaded efficiently and minimising communication between the independently executed blocks of threads. The initial stage of memory arrangement is partitioning the mesh; both a well established 'greedy' partitioner and a new, more efficient 'aligned' partitioner are investigated. A method is then developed to efficiently arrange the memory within each partition. The software is applied to three models from the fields of non-destructive testing, vibrations and geophysics, demonstrating a memory bandwidth of very close to the card@?s maximum, reflecting the bandwidth-limited nature of the algorithm. Comparison with Abaqus, a widely used commercial CPU equivalent, validated the accuracy of the results and demonstrated a speed improvement of around two orders of magnitude. A software package, Pogo, incorporating these developments, is released open source, downloadable from http://www.pogo-fea.com/ to benefit the community.