Parallel H.264 Decoding on an Embedded Multicore Processor

  • Authors:
  • Arnaldo Azevedo;Cor Meenderinck;Ben Juurlink;Andrei Terechko;Jan Hoogerbrugge;Mauricio Alvarez;Alex Ramirez

  • Affiliations:
  • Delft University of Technology, Delft, The Netherlands;Delft University of Technology, Delft, The Netherlands;Delft University of Technology, Delft, The Netherlands;NXP, Eindhoven, The Netherlands;NXP, Eindhoven, The Netherlands;Technical University of Catalonia (UPC), Barcelona, Spain;Technical University of Catalonia (UPC), Barcelona, Spain and Barcelona Supercomputing Center (BSC), Barcelona, Spain

  • Venue:
  • HiPEAC '09 Proceedings of the 4th International Conference on High Performance Embedded Architectures and Compilers
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In previous work the 3D-Wave parallelization strategy was proposed to increase the parallel scalability of H.264 video decoding. This strategy is based on the observation that inter-frame dependencies have a limited spatial range. The previous results, however, investigate application scalability on an idealized multiprocessor. This work presents an implementation of the 3D-Wave strategy on a multicore architecture composed of NXP TriMedia TM3270 embedded processors. The results show that the parallel H.264 implementation scales very well, achieving a speedup of more than 54 on a 64-core processor. Potential drawbacks of the 3D-Wave strategy are that the memory requirements increase since there can be many frames in flight, and that the latencies of some frames might increase. To address these drawbacks, policies to reduce the number of frames in flight and the frame latency are also presented. The results show that our policies combat memory and latency issues with a negligible effect on the performance scalability.