Efficient Implementation of Activation Function on FPGA for Accelerating Neural Networks

2023 IEEE International Symposium on Circuits and Systems (ISCAS) Pub Date : 2023-05-21 DOI:10.1109/ISCAS46773.2023.10181406

Kai Qian, Yinqiu Liu, Zexu Zhang, Kun Wang

引用次数: 0

Abstract

In this paper, we present the Integer Lightweight Softmax (ILS) algorithm for approximating the Softmax activation function. The accurate implementation of Softmax on FPGA can be huge resource-intensive and memory-hungry. Then, we present the implementation of ILS on a Xilinx XCKU040 FPGA to evaluate the effectiveness of ILS. Evaluations on CIFAR 10, CIFAR 100 and ImageNet show that ILS achieves up to $2.47\times, 40\times$ and $323\times$ speedup over CPU implementation, and $4\times, 63\times$ and $51\times$ speedup over GPU implementation, respectively. In comparison to previous FPGA-based Softmax implementations, ILS strikes a better balance between resource consumption and precision accuracy.

查看原文本刊更多论文

加速神经网络激活函数在FPGA上的高效实现

本文提出了一种近似Softmax激活函数的整数轻量级Softmax (ILS)算法。Softmax在FPGA上的精确实现可能是巨大的资源密集型和内存消耗。然后，我们在Xilinx XCKU040 FPGA上实现了ILS，以评估ILS的有效性。在CIFAR 10、CIFAR 100和ImageNet上的评估表明，ILS比CPU实现的加速分别达到2.47倍、40倍和323倍，比GPU实现的加速分别达到4倍、63倍和51倍。与以前基于fpga的Softmax实现相比，ILS在资源消耗和精度精度之间取得了更好的平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Symposium on Circuits and Systems (ISCAS)

自引率

0.00%

发文量