Media Processing Applications on the Imagine Stream Processor
ICCD '02 Proceedings of the 2002 IEEE International Conference on Computer Design: VLSI in Computers and Processors (ICCD'02)
Linear algebra operators for GPU implementation of numerical algorithms
ACM SIGGRAPH 2003 Papers
UberFlow: a GPU-based particle engine
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Understanding the efficiency of GPU algorithms for matrix-matrix multiplication
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
High Performance Array Processor for Video Decoding
ISVLSI '05 Proceedings of the IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design
A Particle System for Interactive Visualization of 3D Flows
IEEE Transactions on Visualization and Computer Graphics
GPU-assisted decoding of video samples represented in the YCoCg-R color space
Proceedings of the 13th annual ACM international conference on Multimedia
A survey of point-based techniques in computer graphics
Computers and Graphics
Accelerate video decoding with generic GPU
IEEE Transactions on Circuits and Systems for Video Technology
Proceedings of the Conference on High Performance Graphics 2009
Proceedings of the Conference on High Performance Graphics 2009
Hi-index | 0.00 |
To accelerate computation intensive video decoding tasks, we present a novel framework to offload most decoding operations to current GPUs. Our method is based on rendering graphics points and suitable for block-based video standards. By representing video blocks as graphics points, we achieve great flexibility and high parallelism to utilize the GPU's pipelined stream processing architecture. The computational resources within texture units and blending units are also exploited to facilitate computations. We propose a high performance implementation of IDCT on GPUs, which efficiently excludes most zero-value coefficients to save the bandwidth and the computations. Compared with the existing quad-based representation, our point based implementation of MC greatly reduces data transfer and redundancy. We have demonstrated the efficiency of our proposed framework by a MPEG-2 decoder. Our results indicate a significant improvement over prior CPU and GPU solutions.