Some Concepts of the Software Package FEAST
VECPAR '98 Selected Papers and Invited Talks from the Third International Conference on Vector and Parallel Processing
Multi-representation interaction for physically based modeling
Proceedings of the 2005 ACM symposium on Solid and physical modeling
Robust quasistatic finite elements and flesh simulation
Proceedings of the 2005 ACM SIGGRAPH/Eurographics symposium on Computer animation
A compiler for variational forms
ACM Transactions on Mathematical Software (TOMS)
GPGPU: general-purpose computation on graphics hardware
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
ParFUM: a parallel framework for unstructured meshes for scalable dynamic physics applications
Engineering with Computers
General purpose molecular dynamics simulations fully implemented on graphics processing units
Journal of Computational Physics
A new physics engine with automatic process distribution between CPU-GPU
Sandbox '08 Proceedings of the 2008 ACM SIGGRAPH symposium on Video games
Gpu gems 3
A High Performance Massively Parallel Approach for Real Time Deformable Body Physics Simulation
SBAC-PAD '08 Proceedings of the 2008 20th International Symposium on Computer Architecture and High Performance Computing
From CPU to GP-GPU: challenges and insights in GPU-based environmental simulations
Proceedings of the 10th International Workshop on Middleware for Grids, Clouds and e-Science
Hi-index | 0.00 |
As general purpose computing on Graphics Processing Units (GPGPU) matures, more complicated scientific applications are being targeted to utilize the data-level parallelism available on a GPU. Implementing physically-based simulation on data-parallel hardware requires preprocessing overhead which affects application performance. We discuss our implementation of physics-based data structures that provide significant performance improvements when used on data-parallel hardware. These data structures allow us to maintain a physics-based abstraction of the underlying data, reduce programmer effort and obtain 6×-8× speedup over previously implemented GPU kernels.