Hardware-Efficient Deconvolution-Based GAN for Edge Computing

2022 56th Annual Conference on Information Sciences and Systems (CISS) Pub Date : 2022-01-18 DOI:10.1109/CISS53076.2022.9751185

A. Alhussain, Mingjie Lin

引用次数: 3

Abstract

Generative Adversarial Networks (GAN) are cutting-edge algorithms for generating new data samples based on the learned data distribution. However, its performance comes at a significant cost in terms of computation and memory requirements. In this paper, we proposed an HW/SW co-design approach for training quantized deconvolution GAN (QDCGAN) implemented on FPGA using a scalable streaming dataflow architecture capable of achieving higher throughput versus resource utilization trade-off. The developed accelerator is based on an efficient deconvolution engine that offers high parallelism with respect to scaling factors for GAN-based edge computing. Furthermore, various precisions, datasets, and network scalability were analyzed for low-power inference on resource-constrained platforms. Lastly, an end-to-end open-source framework is provided for training, implementation, state-space exploration, and scaling the inference using Vivado high-level synthesis for Xilinx SoC-FPGAs, and a comparison testbed with Jetson Nano.

查看原文本刊更多论文

基于硬件高效反卷积的边缘计算GAN

生成对抗网络(GAN)是一种基于学习到的数据分布生成新数据样本的前沿算法。然而，它的性能在计算和内存需求方面付出了巨大的代价。在本文中，我们提出了一种硬件/软件协同设计方法，用于训练在FPGA上实现的量化反卷积GAN (QDCGAN)，该方法使用可扩展的流数据流架构，能够实现更高的吞吐量与资源利用率之间的权衡。开发的加速器基于高效的反褶积引擎，为基于gan的边缘计算的缩放因子提供了高并行性。此外，还分析了资源受限平台上低功耗推理的各种精度、数据集和网络可扩展性。最后，提供了一个端到端的开源框架，用于训练、实现、状态空间探索和扩展推理，该框架使用Vivado高级合成Xilinx soc - fpga，并与Jetson Nano进行了比较测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 56th Annual Conference on Information Sciences and Systems (CISS)

自引率

0.00%

发文量