Accelerating Generative Neural Networks on Unmodified Deep Learning Processors - A Software Approach

Dawen Xu; Cheng Liu; Ying Wang; Kaijie Tu; Bingsheng He; Lei Zhang

doi:10.1109/tc.2020.3001033

Abstract

1 min read

Generative neural network is a new category of neural networks and it has been widely utilized in many applications such as content generation, unsupervised learning, segmentation, and pose estimation. It typically involves massive computing-intensive deconvolution operations that cannot be fitted to conventional neural network processors directly. However, prior works mainly investigated specialized hardware architectures through intensive hardware modifications to the existing deep learning processors to accelerate deconvolution together with the convolution. In contrast, this article proposes a novel deconvolution implementation with a software approach and enables fast and efficient deconvolution execution on the existing deep learning processors. Our proposed method reorganizes the computation of deconvolution and allows the deep learning processors to treat it as the standard convolution by splitting the original deconvolution filters into multiple small filters. Compared to prior acceleration schemes, the implemented acceleration scheme achieves 2.4× -4.3× performance speedup and reduces the energy consumption by 27.7 -54.5 percent on a set of realistic benchmarks. In addition, we have also applied the deconvolution computing approach to the off-the-shelf commodity deep learning processors. The performance of deconvolution also exhibits significant performance speedup over prior deconvolution implementations.

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors - A Software Approach

Abstract

Discussion(0)

Related publications

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach

Learning a Wavelet-Like Auto-Encoder to Accelerate Deep Neural Networks

A Cost-Sensitive Deep Learning-Based Approach for Network Traffic Classification

Software Vulnerability Detection Using Deep Neural Networks: A Survey

Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures

Related publications

Preprint2019
Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach
Preprint2019

Article2018
Learning a Wavelet-Like Auto-Encoder to Accelerate Deep Neural Networks
Article2018

Article2021
A Cost-Sensitive Deep Learning-Based Approach for Network Traffic Classification
Article2021

Article2020
Software Vulnerability Detection Using Deep Neural Networks: A Survey
Article2020

Preprint2019
Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures
Preprint2019