The Role of Activation Function in CNN

2020 2nd International Conference on Information Technology and Computer Application (ITCA) Pub Date : 2020-12-01 DOI:10.1109/ITCA52113.2020.00096

Wang Hao, Yizhou Wang, Lou Yaqin, Song Zhili

引用次数: 11

Abstract

We all know that the purpose of introducing activation function is to give neural network nonlinear expression ability, so that it can better fit the results, so as to improve the accuracy. However, different activation functions have different performance in different neural networks. In this paper, several activation functions commonly used by researchers are compared one by one, and qualitative comparison results are given by combining with specific neural network models. For example, when using the MNIST dataset in LeNet, PReLU achieved the highest accuracy of 98.724%, followed by Swish at 98.708%. When cifar-10 data set was used, the highest accuracy rate of ELU was 64.580%, followed by Mish at 64.455%. When Using VGG16, ReLU reached the highest accuracy of 90.226%, followed by PReLU at 90.197%. When using ResNet50, ELU achieved the highest accuracy of 89.943%, followed by Mish at 89.780%.

查看原文本刊更多论文

激活函数在CNN中的作用

我们都知道，引入激活函数的目的是赋予神经网络非线性表达能力，使其能够更好地拟合结果，从而提高准确率。然而，不同的激活函数在不同的神经网络中具有不同的性能。本文将研究人员常用的几种激活函数逐一进行比较，并结合具体的神经网络模型给出定性的比较结果。例如，在LeNet中使用MNIST数据集时，PReLU的准确率最高，为98.724%，其次是Swish，为98.708%。使用cifar-10数据集时，ELU的准确率最高为64.580%，其次是Mish，准确率为64.455%。使用VGG16时，ReLU的准确率最高，为90.226%，PReLU次之，为90.197%。使用ResNet50时，ELU的准确率最高，为89.943%，其次是Mish，为89.780%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 2nd International Conference on Information Technology and Computer Application (ITCA)

自引率

0.00%

发文量