Using MPI (2nd ed.): portable parallel programming with the message-passing interface
Using MPI (2nd ed.): portable parallel programming with the message-passing interface
OpenMP: An Industry-Standard API for Shared-Memory Programming
IEEE Computational Science & Engineering
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Parallel Computing
A performance study of general-purpose applications on graphics processors using CUDA
Journal of Parallel and Distributed Computing
Program optimization carving for GPU computing
Journal of Parallel and Distributed Computing
Advances in Engineering Software
Programming Massively Parallel Processors: A Hands-on Approach
Programming Massively Parallel Processors: A Hands-on Approach
Hi-index | 0.00 |
Sheet forming simulation is very important for vehicle body design. Due to the increase of complexity and scale of the CAE model, a tradeoff between the accuracy and efficiency become the bottleneck for application. Therefore, a parallel explicit finite element (FE) based on graphics processing unit (GPU) architecture for sheet forming is developed. Implementation details with computer unified device architecture (CUDA) are considered in this work. A pre-index strategy is suggested for parallelization of nodal force assembling. Parallel reduction method is introduced to calculation of the global time step. To ensure the reliability and accuracy of the GPU-based program, double precision floating and intrinsic functions are implemented for the explicit FE computing. The simulation results based on a commercial NVIDIA GTX285 device can obtain about 27X speedup than on a Intel Q8200 CPU, which demonstrates the efficiency of the parallel sheet forming simulation system.