Comparison of Activation Functions in Convolutional Neural Network for Poisson Noisy Image Classification

Q1 Multidisciplinary
K. Goh, Sugiyarto Surono, M. Y. F. Afiatin, K. R. Mahmudah, N. Irsalinda, Mesith Chaimanee, Choo Wou Onn
{"title":"Comparison of Activation Functions in Convolutional Neural Network for Poisson Noisy Image Classification","authors":"K. Goh, Sugiyarto Surono, M. Y. F. Afiatin, K. R. Mahmudah, N. Irsalinda, Mesith Chaimanee, Choo Wou Onn","doi":"10.28991/esj-2024-08-02-014","DOIUrl":null,"url":null,"abstract":"Deep learning, specifically the Convolutional Neural Network (CNN), has been a significant technology tool for image processing and human health. CNNs, which mimic the working principles of the human brain, can learn robust representations of images. However, CNNs are susceptible to noise interference, which can impact classification performance. Choosing the right activation function can improve CNNs performance and accuracy. This research aims to test the accuracy of CNN with ResNet50, VGG16, and GoogleNet architectures combined with several activation functions such as ReLU, Leaky ReLU, Sigmoid, and Tanh in the classification of images that experience Poisson noise. Poisson noise is applied to each test data to evaluate CNN accuracy. The data used in this study consists of three scenarios of different numbers of classes, namely 3 classes, 5 classes, and 10 classes. The results showed that combining ResNet50 with the ReLU activation function produced the best performance in class recognition in each scenario of the number of classes experiencing Poisson noise interference. The model achieved 97% accuracy for 3-class data, 95% for 5-class data, and 90% for 10-class data. These results show that using ResNet50 with the ReLU activation function can provide excellent resistance to Poisson noise in image processing. It was found that as the number of classes increases, the accuracy of image recognition tends to decrease. This shows that the more complex the image classification task is with a larger number of classes, the more difficult it is for CNNs to distinguish between different classes. Doi: 10.28991/ESJ-2024-08-02-014 Full Text: PDF","PeriodicalId":11586,"journal":{"name":"Emerging Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28991/esj-2024-08-02-014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning, specifically the Convolutional Neural Network (CNN), has been a significant technology tool for image processing and human health. CNNs, which mimic the working principles of the human brain, can learn robust representations of images. However, CNNs are susceptible to noise interference, which can impact classification performance. Choosing the right activation function can improve CNNs performance and accuracy. This research aims to test the accuracy of CNN with ResNet50, VGG16, and GoogleNet architectures combined with several activation functions such as ReLU, Leaky ReLU, Sigmoid, and Tanh in the classification of images that experience Poisson noise. Poisson noise is applied to each test data to evaluate CNN accuracy. The data used in this study consists of three scenarios of different numbers of classes, namely 3 classes, 5 classes, and 10 classes. The results showed that combining ResNet50 with the ReLU activation function produced the best performance in class recognition in each scenario of the number of classes experiencing Poisson noise interference. The model achieved 97% accuracy for 3-class data, 95% for 5-class data, and 90% for 10-class data. These results show that using ResNet50 with the ReLU activation function can provide excellent resistance to Poisson noise in image processing. It was found that as the number of classes increases, the accuracy of image recognition tends to decrease. This shows that the more complex the image classification task is with a larger number of classes, the more difficult it is for CNNs to distinguish between different classes. Doi: 10.28991/ESJ-2024-08-02-014 Full Text: PDF
用于泊松噪声图像分类的卷积神经网络激活函数比较
深度学习,特别是卷积神经网络(CNN),已成为图像处理和人类健康的重要技术工具。卷积神经网络模仿人脑的工作原理,可以学习图像的稳健表示。然而,CNN 易受噪声干扰,从而影响分类性能。选择正确的激活函数可以提高 CNN 的性能和准确性。本研究旨在测试采用 ResNet50、VGG16 和 GoogleNet 体系结构的 CNN,结合 ReLU、Leaky ReLU、Sigmoid 和 Tanh 等激活函数,对存在泊松噪声的图像进行分类的准确性。泊松噪声应用于每个测试数据,以评估 CNN 的准确性。本研究使用的数据包括三种不同类数的场景,即 3 类、5 类和 10 类。结果表明,将 ResNet50 与 ReLU 激活函数相结合,在受到泊松噪声干扰的每种类数场景中都能产生最佳的类识别性能。该模型对 3 类数据的准确率达到 97%,对 5 类数据的准确率达到 95%,对 10 类数据的准确率达到 90%。这些结果表明,在图像处理中使用带有 ReLU 激活函数的 ResNet50 可以很好地抵抗泊松噪声。研究发现,随着类数的增加,图像识别的准确率呈下降趋势。这表明,图像分类任务越复杂,类别数量越多,CNN 就越难区分不同的类别。Doi: 10.28991/ESJ-2024-08-02-014 全文:PDF
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Emerging Science Journal
Emerging Science Journal Multidisciplinary-Multidisciplinary
CiteScore
5.40
自引率
0.00%
发文量
155
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信