Summed-area tables for texture mapping
SIGGRAPH '84 Proceedings of the 11th annual conference on Computer graphics and interactive techniques
Robust Real-Time Face Detection
International Journal of Computer Vision
Hi-index | 0.00 |
Summed-Area table algorithm is also known as image integral algorithm. It is often used for quickly and efficiently generating the sum of values in a rectangular subset of a grid. Our work is based on the OpenCL framework. We have studied various kinds of optimization methods mainly on AMD GPUs. In this paper, we first implemented an efficient prefix sum algorithm. Then we described how to use vectors in detail. We also adopted many other skills. For instance, a workgroup calculates the entire column by using a loop and each workgroup calculates multi-columns. The results show that the optimized algorithm got a good performance on both NVIDIA platform and AMD platform. On the NVIDIA Tesla C2050 GPU, we got a 33% performance boost compared to CUDA NPP. On the AMD HD 5850 platform, the average performance has reached 4.21 times compared to the appropriate CPU version function in OpenCV 2.3.