Fast on-line statistical learning on a GPGPU

  • Authors:
  • FangZhou Xiao;Eric McCreath;Christfried Webers

  • Affiliations:
  • The Australian National University, Canberra, Australia;The Australian National University, Canberra, Australia;The Australian National University, Canberra, Australia and NICTA, Canberra, Australia

  • Venue:
  • AusPDC '11 Proceedings of the Ninth Australasian Symposium on Parallel and Distributed Computing - Volume 118
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

On-line Machine Learning using Stochastic Gradient Descent is an inherently sequential computation. This makes it difficult to improve performance by simply employing parallel architectures. Langford et al. made a modification to the standard stochastic gradient descent approach which opens up the possibility of parallel computation. They also proved that there is no significant loss in accuracy in their approach. They did empirically demonstrate the performance gain in speed for the case of a pipelined architecture with a few processing units. In this paper we report on applying the Langford et al. approach on a General Purpose Graphics Processing Unit (GPGPU) with a large number of processing units. We accelerate the learning speed by approximately 4.5 times compared to a standard single threaded approach with comparable accuracy. We also evaluate the GPU performance for the sequential variant of the algorithm, which has not previously been reported. Finally, we investigate how changes in the number of threads, number of blocks, and amount of delay, effects the overall performance and accuracy.