FCCM '02 Proceedings of the 10th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow
Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Parallel discrete element simulation of poly-dispersed granular material
Advances in Engineering Software
Scaling Benchmark of ESyS-Particle for Elastic Wave Propagation Simulations
ESCIENCE '10 Proceedings of the 2010 IEEE Sixth International Conference on e-Science
Hi-index | 0.00 |
Real-time solution of the Discrete Element Method is a computational challenge that is hardly achievable on standard PCs, especially when a large number of triangular shaped particles are involved. This paper presents a scalable architecture, including a domain decomposition technique, of a GPU accelerator for the two-dimensional Discrete Element Method for triangular shaped particles. This approach achieved a speed up of about 140 times as a single core and about 80 after domain decomposition on a consumer level GPU compared to a similar algorithm run on a fast desktop PC.