Microprocessors in the era of terascale integration
Proceedings of the conference on Design, automation and test in Europe
Proceedings of the 44th annual Design Automation Conference
An efficient decoding of CAVLC in H.264/AVC video coding standard
IEEE Transactions on Consumer Electronics
Memory-efficient H.264/AVC CAVLC for fast decoding
IEEE Transactions on Consumer Electronics
H.264/AVC baseline profile decoder complexity analysis
IEEE Transactions on Circuits and Systems for Video Technology
Design and implementation of parallel video encoding strategies using divisible load analysis
IEEE Transactions on Circuits and Systems for Video Technology
A QHD-capable parallel H.264 decoder
Proceedings of the international conference on Supercomputing
Hi-index | 0.00 |
Diverse approaches to parallel implementation of H.264 have been proposed; however, they all share a common problem. The entropy decoder in H.264 remains mapped on a single processing element (PE). Due to the inherently sequential and context-adaptive nature of the entropy decoder, it cannot be parallelized. This renders a bottleneck to the performance of the entire decoding process. Depending on the type of the processing core and the video bit-rate, the performance of the entire decoding process is subject to the process of entropy decoding. It is, therefore, needful to research and implement new algorithmic solutions to compensate for this bottleneck, and thereby make optimal use of parallel implementation of H.264 decoder on mainstream multi-core systems. This paper presents a new CAVLC decoding method which is de-rived by constructing custom CAVLC decoding tables using 'table grouping'. Compared to the conventional [5] 'sequential table look-up' method, which requires multiple memory accesses. Our proposed method accesses the custom tables only once for the decoding of any symbol Moreover, in our proposed method, the symbol decoding time does not depend on the symbol length and it is constant for each symbol, resulting in a nearly linear increase in computational complexity with increase in video fidelity as compared to an non linear increase in earlier proposed methods. Experimental results show that our proposed algorithm features up to 7x higher performance and 83% less memory accesses compared to conventional methods. We compare to three commonly used, state-of-the-art CAVLC algorithms, such as table look-up by sequential search [5], table look-up by binary search [9], and "Moon's method" [16].