Box-counting algorithm on GPU and multi-core CPU: an OpenCL cross-platform study

  • Authors:
  • Jesús Jiménez;Juan Ruiz De Miras

  • Affiliations:
  • Department of Computer Science, University of Jaén, Jaén, Spain 23071;Department of Computer Science, University of Jaén, Jaén, Spain 23071

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present the analysis and development of a cross-platform OpenCL implementation of the box-counting algorithm, which is one of the most widely-used methods for estimating the Fractal Dimension. The Fractal Dimension is a relevant image analysis method used in several disciplines, but computing it is in general a time consuming process, especially when working with 3D images. Unlike parallel programming models that strictly depend on the hardware type and manufacturer, like CUDA, OpenCL allows us to provide an implementation suitable for execution on both GPUs and multi-core CPUs, whatever the hardware manufacturer. Sorting is a key part of the fast box-counting algorithm and the final speedup is highly conditioned by the efficiency of the sorting algorithm used. Our study reveals that current OpenCL implementations of sorting algorithms are clearly slower when compared with both CUDA for GPU and specific multi-core CPU implementations. Our OpenCL algorithm has been specifically optimized according the type of the target device and the results show an average speedup of up to 7.46脳 and 4脳, when executed on the GPU and the multi-core CPU respectively, both compared with the single-threaded (sequential) CPU implementation.