Novel dynamic partial reconfiguration implementation of k-means clustering on FPGAs: comparative results with GPPs and GPUs

  • Authors:
  • Hanaa M. Hussain;Khaled Benkrid;Ali Ebrahim;Ahmet T. Erdogan;Huseyin Seker

  • Affiliations:
  • School of Engineering, University of Edinburgh, King's Buildings, Edinburgh, UK;School of Engineering, University of Edinburgh, King's Buildings, Edinburgh, UK;School of Engineering, University of Edinburgh, King's Buildings, Edinburgh, UK;School of Engineering, University of Edinburgh, King's Buildings, Edinburgh, UK;Centre for Computational Intelligence, De Montfort University, Leicester, UK

  • Venue:
  • International Journal of Reconfigurable Computing - Special issue on Selected Papers from the 2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

K-means clustering has been widely used in processing large datasets in many fields of studies. Advancement in many data collection techniques has been generating enormous amounts of data, leaving scientists with the challenging task of processing them. Using General Purpose Processors (GPPs) to process large datasets may take a long time; therefore many acceleration methods have been proposed in the literature to speed up the processing of such large datasets. In this work, a parameterized implementation of the K-means clustering algorithm in Field Programmable Gate Array (FPGA) is presented and compared with previous FPGA implementation as well as recent implementations on Graphics Processing Units (GPUs) and GPPs. The proposed FPGA has higher performance in terms of speedup over previous GPP and GPU implementations (two orders and one order of magnitude, resp.). In addition, the FPGA implementation is more energy efficient than GPP and GPU (615x and 31x, resp.). Furthermore, three novel implementations of the K-means clustering based on dynamic partial reconfiguration (DPR) are presented offering high degree of flexibility to dynamically reconfigure the FPGA. The DPR implementations achieved speedups in reconfiguration time between 4x to 15x.