clCaffe: OpenCL Accelerated Caffe for Convolutional Neural Networks
Autor: | Jingyi Jin, SungYe Kim, Jeff Andrews, Deepak N. Murthy, Jeremy Bottleson, Preeti Bindu |
---|---|
Rok vydání: | 2016 |
Předmět: |
Speedup
Caffè Computer science business.industry Deep learning 02 engineering and technology Parallel computing Convolutional neural network 020202 computer hardware & architecture Computer architecture 0202 electrical engineering electronic engineering information engineering Leverage (statistics) Theano 020201 artificial intelligence & image processing Artificial intelligence business |
Zdroj: | IPDPS Workshops |
DOI: | 10.1109/ipdpsw.2016.182 |
Popis: | Recent advances in deep convolutional neural networks enable researchers and developers to apply machine learning to a much broader number of applications. With the proliferation of deep learning applications, widely used deep learning frameworks, such as Caffe, Theano and Torch, have been significantly improved with the support of powerful GPUs and GPU-accelerated libraries. However, lack of frameworks and libraries built on OpenCL could hinder exploration of more diverse compute devices (CPUs, GPUs, DSPs and FPGAs) in future deep learning domains. In this work, we present OpenCL acceleration of a well-known deep learning framework, Caffe, while focusing on the convolution layer which has been optimized with three different approaches, GEMM, spatial domain, and frequency domain. Our work, clCaffe, greatly enhances the ability to leverage deep learning use cases on all types of OpenCL devices, particularly on small form factor devices in which discrete GPUs are rare and integrated GPUs are much more common. Our benchmark shows 2.5x speedup on the Intel integrated-GPU, compared to CPU-only AlexNet on ImageNet dataset. As such, our work provides the deep learning community with the opportunity to embrace a broad range of devices through OpenCL. |
Databáze: | OpenAIRE |
Externí odkaz: |