Parallelism and scalability in an image processing application

  • Authors:
  • Morten S. Rasmussen;Matthias B. Stuart;Sven Karlsson

  • Affiliations:
  • DTU Informatics, Technical University of Denmark;DTU Informatics, Technical University of Denmark;DTU Informatics, Technical University of Denmark

  • Venue:
  • IWOMP'08 Proceedings of the 4th international conference on OpenMP in a new era of parallelism
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The recent trends in processor architecture show that parallelprocessing is moving into new areas of computing in the form ofmany-core desktop processors and multi-processor system-on-chip. Thismeans that parallel processing is required in application areas that traditionallyhave not used parallel programs. This paper investigates parallelismand scalability of an embedded image processing application. Themajor challenges faced when parallelizing the application were to extractenough parallelism from the application and to reduce load imbalance.The application has limited immediately available parallelism. It is difficultto further extract parallelism since the application has small datasets and parallelization overhead is relatively high. There is also a fairamount of load imbalance which is made worse by a non-uniform memorylatency. Even so, we show that with some tuning relative speedupsin excess of 9 on a 16 CPU system can be reached.