3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Data layout transformation for stencil computations on short-vector SIMD architectures
CC'11/ETAPS'11 Proceedings of the 20th international conference on Compiler construction: part of the joint European conferences on theory and practice of software
Automatic code generation and tuning for stencil kernels on modern shared memory architectures
Computer Science - Research and Development
Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
ACM SIGARCH Computer Architecture News
Fast wavelet transform utilizing a multicore-aware framework
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
The Journal of Supercomputing
Optimization of geometric multigrid for emerging multi- and manycore processors
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Comparison of different propagation steps for lattice Boltzmann methods
Computers & Mathematics with Applications
Vectorized higher order finite difference kernels
PARA'12 Proceedings of the 11th international conference on Applied Parallel and Scientific Computing
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
Countless challenges to preserving a user’s location privacy exist and have become more important than ever before with the proliferation of handheld devices and the pervasive use of Location-based Services. It is not possible to access Location-based ...