Efficient constant-time entropy decoding for H.264

Authors:
Nabeel Iqbal;Jörg Henkel
Affiliations:
University of Karlsruhe, Karlsruhe, Germany;University of Karlsruhe, Karlsruhe, Germany
Venue:
Proceedings of the Conference on Design, Automation and Test in Europe
Year:
2009

Citing 6
Cited 1

Microprocessors in the era of terascale integration

Proceedings of the conference on Design, automation and test in Europe
The kill rule for multicore

Proceedings of the 44th annual Design Automation Conference
An efficient decoding of CAVLC in H.264/AVC video coding standard

IEEE Transactions on Consumer Electronics
Memory-efficient H.264/AVC CAVLC for fast decoding

IEEE Transactions on Consumer Electronics
H.264/AVC baseline profile decoder complexity analysis

IEEE Transactions on Circuits and Systems for Video Technology
Design and implementation of parallel video encoding strategies using divisible load analysis

IEEE Transactions on Circuits and Systems for Video Technology

A QHD-capable parallel H.264 decoder

Proceedings of the international conference on Supercomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Diverse approaches to parallel implementation of H.264 have been proposed; however, they all share a common problem. The entropy decoder in H.264 remains mapped on a single processing element (PE). Due to the inherently sequential and context-adaptive nature of the entropy decoder, it cannot be parallelized. This renders a bottleneck to the performance of the entire decoding process. Depending on the type of the processing core and the video bit-rate, the performance of the entire decoding process is subject to the process of entropy decoding. It is, therefore, needful to research and implement new algorithmic solutions to compensate for this bottleneck, and thereby make optimal use of parallel implementation of H.264 decoder on mainstream multi-core systems. This paper presents a new CAVLC decoding method which is de-rived by constructing custom CAVLC decoding tables using 'table grouping'. Compared to the conventional [5] 'sequential table look-up' method, which requires multiple memory accesses. Our proposed method accesses the custom tables only once for the decoding of any symbol Moreover, in our proposed method, the symbol decoding time does not depend on the symbol length and it is constant for each symbol, resulting in a nearly linear increase in computational complexity with increase in video fidelity as compared to an non linear increase in earlier proposed methods. Experimental results show that our proposed algorithm features up to 7x higher performance and 83% less memory accesses compared to conventional methods. We compare to three commonly used, state-of-the-art CAVLC algorithms, such as table look-up by sequential search [5], table look-up by binary search [9], and "Moon's method" [16].