Locality-improved FFT implementation on a graphics processor
ISCGAV'07 Proceedings of the 7th WSEAS International Conference on Signal Processing, Computational Geometry & Artificial Vision
Spatial sound for video games and virtual environments utilizing real-time GPU-based convolution
Future Play '08 Proceedings of the 2008 Conference on Future Play: Research, Play, Share
Memory Locality Exploitation Strategies for FFT on the CUDA Architecture
High Performance Computing for Computational Science - VECPAR 2008
GPU acceleration of a production molecular docking code
Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units
Experiences with Mapping Non-linear Memory Access Patterns into GPUs
ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Highly parallel rate-distortion optimized intra-mode decision on multicore graphics processors
IEEE Transactions on Circuits and Systems for Video Technology
Intra frame encoding using programmable graphics hardware
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
High-quality HDR rendering technologies for emerging applications
IBM Journal of Research and Development
Data-intensive spatial filtering in large numerical simulation datasets
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
GPU optimization of convolution for large 3-d real images
ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
Energy-Aware real-time face recognition system on mobile CPU-GPU platform
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Optimizing 3d convolutions for wavelet transforms on CPUs with SSE units and GPUs
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Fast computation of convolution operations via low-rank approximation
Applied Numerical Mathematics
Optimizing convolution operations on GPUs using adaptive tiling
Future Generation Computer Systems
Hi-index | 0.00 |
Many contemporary visualization tools comprise some image filtering approach. Since image filtering approaches are very computationally demanding, the acceleration using graphics-hardware (GPU) is very desirable to preserve interactivity of the main visualization tool itself. In this article we take a close look on GPU implementation of two basic approaches to image filtering - Fast Fourier Transform (frequency domain) and convolution (spatial domain). We evaluate these methods in terms of the performance in real time applications and suitability for GPU implementation. Convolution yields better performance than Fast Fourier Transform (FFT) in many cases; however, this observation cannot be generalized. In this article we identify conditions under which the FFT gives better performance than the corresponding convolution and we assess the different kernel sizes and issues of application of multiple filters on one image.