Soft Clipping Mish - A Novel Activation Function for Deep Learning

Marina Adriana Mercioni, S. Holban
{"title":"Soft Clipping Mish - A Novel Activation Function for Deep Learning","authors":"Marina Adriana Mercioni, S. Holban","doi":"10.1109/ICICT52872.2021.00010","DOIUrl":null,"url":null,"abstract":"This study aims to introduce a novel activation function, called Soft Clipping Mish. In other words, it brings improvements in order to increase the performance within the architecture. Its capability was tested on different scenarios using different datasets and using LeNet-5 architecture. So, these different testing conditions strengthen our proposal, the fact also emphasized in the experimental phase. We used such as datasets: MNIST, Fashion-MNIST, CIFAR-10, CIFAR-100 for a classification task, and Beijing PM2.5 dataset for a prediction task to determine the rank of air pollution. We introduced two variants of this function, the first variant being Soft Clipping Mish with a predefined parameter and the second variant being Soft Clipping Mish learnable, the learnable parameter giving us more flexibility into weights updates. This learnable parameter was initialized during the training phase with a value equal to 0.25. Our proposal was inspired by a recent activation function called Mish.","PeriodicalId":359456,"journal":{"name":"2021 4th International Conference on Information and Computer Technologies (ICICT)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT52872.2021.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study aims to introduce a novel activation function, called Soft Clipping Mish. In other words, it brings improvements in order to increase the performance within the architecture. Its capability was tested on different scenarios using different datasets and using LeNet-5 architecture. So, these different testing conditions strengthen our proposal, the fact also emphasized in the experimental phase. We used such as datasets: MNIST, Fashion-MNIST, CIFAR-10, CIFAR-100 for a classification task, and Beijing PM2.5 dataset for a prediction task to determine the rank of air pollution. We introduced two variants of this function, the first variant being Soft Clipping Mish with a predefined parameter and the second variant being Soft Clipping Mish learnable, the learnable parameter giving us more flexibility into weights updates. This learnable parameter was initialized during the training phase with a value equal to 0.25. Our proposal was inspired by a recent activation function called Mish.
一种新的用于深度学习的激活函数
本研究的目的是引入一种新的激活功能,称为“软剪切”。换句话说,它带来了改进,以提高体系结构内的性能。在使用不同数据集和LeNet-5架构的不同场景中测试了其功能。因此,这些不同的测试条件加强了我们的建议,这一事实在实验阶段也得到了强调。我们使用如下数据集:MNIST、Fashion-MNIST、CIFAR-10、CIFAR-100进行分类任务,使用北京PM2.5数据集进行预测任务,以确定空气污染的等级。我们引入了这个函数的两个变体,第一个变体是带有预定义参数的Soft Clipping Mish,第二个变体是可学习的Soft Clipping Mish,可学习的参数使我们更灵活地进行权重更新。这个可学习参数在训练阶段初始化,值为0.25。我们的设计灵感来自于最近的一个名为Mish的激活功能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信