Performance and Scalability of GPU-Based Convolutional Neural Networks 论文

2010引用 241

Advanced Neural Network ApplicationsNeural Networks and ApplicationsAdvanced Image and Video Retrieval Techniques

人工智能 Advanced Neural Network Applications Advanced Image and Video Retrieval Techniques Neural Networks and Applications

关系图谱

作者

摘要

In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.

作者查看全部 (3)

Stefan Podlipnig

Klaus Kofler

Daniel Strigl

Performance and Scalability of GPU-Based Convolutional Neural Networks 论文

摘要

作者查看全部 (3)

相关技术查看全部 (3)

相关事件

相关文章