Performance and Scalability of GPU-Based Convolutional Neural Networks

Authors:
Daniel Strigl;Klaus Kofler;Stefan Podlipnig
Affiliations:
-;-;-
Venue:
PDP '10 Proceedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing
Year:
2010

Citing 0
Cited 2

Character recognition of license plate number using convolutional neural network

IVIC'11 Proceedings of the Second international conference on Visual informatics: sustaining research and innovations - Volume Part I
Flexible, high performance convolutional neural networks for image classification

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.