Power Breakdown Analysis for a Heterogeneous NoC Platform Running a Video Application

  • Authors:
  • Andy Lambrechts;Praveen Raghavan;Anthony Leroy;Guillermo Talavera;Tom Vander Aa;Murali Jayapala;Francky Catthoor;Diederik Verkest;Geert Deconinck;Henk Corporaal;Frédéric Robert;Jordi Carrabina

  • Affiliations:
  • IMEC vzw, Belgium 2Dept of Electrical Engineering, K.U.L., Belgium;IMEC vzw, Belgium, Dept of Electrical Engineering, K.U.L., Belgium;MiEL, U.L.B., Belgium;Dept of Computer Science, U.A.B., Spain 6Dept of Electrical Engineering, TU/e, The Netherlands;Dept of Electrical Engineering, K.U.L., Belgium;Dept of Electrical Engineering, K.U.L., Belgium;Dept of Electrical Engineering, K.U.L., Belgium;Dept of Electrical Engineering, V.U.B., Belgium 4MiEL, U.L.B., Belgium;Dept of Electrical Engineering, K.U.L., Belgium;Dept of Electrical Engineering, TU/e, The Netherlands;MiEL, U.L.B., Belgium;Dept of Computer Science, U.A.B., Spain

  • Venue:
  • ASAP '05 Proceedings of the 2005 IEEE International Conference on Application-Specific Systems, Architecture Processors
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Users expect future handheld devices to provide extended multimedia functionality and have long battery life. This type of application imposes heavy constraints on performance and power consumption and forces designers to optimize all parts of their platform. Evaluating the overall platform power breakdown is therefore critical to determine where to spend the efforts on power optimization. Surprisingly, few studies exist on that topic and decisions generally rely on common belief. We have realized a complete power breakdown for a realistic platform to identify the major power bottlenecks. This paper presents this power assessment of a realistic heterogeneous network on chip platform including processors, network and data/instruction memory hierarchy, running a video processing chain from camera to display. Our power breakdown identifies the main bottlenecks in the memory hierarchy and the foreground memory, and shows that global interconnect is not that critical for a well-optimized application mapping.