Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach

Dawen Xu; Ying Wang; Kaijie Tu; Cheng Liu; Bingsheng He; Lei Zhang

doi:10.48550/arxiv.1907.01773

RDLNetworkEkosistem

Hakkımızda SSS

Giriş yap Başla

Hakkımızda SSS Gizlilik Şartlar İletişim

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach — Dawen Xu (2019) | RDL Network

Back

Home
Publications
Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach

Shared by

Lei Zhang

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach

Preprint 2019 en

Authors

DX
Dawen Xu
YW
Ying Wang
KT
Kaijie Tu

Abstract

1 min read

Generative neural network is a new category of neural networks and it has been widely utilized in applications such as content generation, unsupervised learning, segmentation and pose estimation. It typically involves massive computing-intensive deconvolution operations that cannot be fitted to conventional neural network processors directly. However, prior works mainly investigated specialized hardware architectures through intensive hardware modifications to the existing deep learning processors to accelerate deconvolution together with the convolution. In contrast, this work proposes a novel deconvolution implementation with a software approach and enables fast and efficient deconvolution execution on the legacy deep learning processors. Our proposed method reorganizes the computation of deconvolution and allows the deep learning processors to treat it as the standard convolution by splitting the original deconvolution filters into multiple small filters. Compared to prior acceleration schemes, the implemented acceleration scheme achieves 2.41x - 4.34x performance speedup and reduces the energy consumption by 27.7% - 54.5% on a set of realistic benchmarks. In addition, we also applied the deconvolution computing approach to the off-the-shelf commodity deep learning processors. The performance of deconvolution also exhibits significant performance speedup over prior deconvolution implementations.

Discussion(0)

No comments yet. Be the first to comment.

Open reviews(0)

Public, signed peer feedback on this preprint.

No reviews yet.

Publication Info

DOI: 10.48550/arxiv.1907.01773
Year: 2019
Published: —
Language: en

Preprint Details

Link Of The Paper: http://arxiv.org/abs/1907.01773

Timeline

Created:June 19, 2026

Related publications

Article2020

Software Vulnerability Detection Using Deep Neural Networks: A Survey

Guanjun Lin, Sheng Wen, Qinglong Qinglong Han, Jun Zhang, Yang Xiang

Proceedings of the IEEE

Preprint2019

Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures

Hasan Genc, Ameer Haj-Ali, Vighnesh Iyer, Alon Amid, Howard Mao, John Wright, Colin Schmidt, Jerry Zhao, Albert Ou, Max Banister, Yakun Sophia Shao, Borivoje Nikolić, Ion Stoica, Krste Asanović

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors -- A Software Approach

Abstract

Discussion(0)

Open reviews(0)

Related publications

Accelerating Generative Neural Networks on Unmodified Deep Learning Processors - A Software Approach

Learning a Wavelet-Like Auto-Encoder to Accelerate Deep Neural Networks

A Cost-Sensitive Deep Learning-Based Approach for Network Traffic Classification

Software Vulnerability Detection Using Deep Neural Networks: A Survey

Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures