Revision of the LeNet algorithm——Construction of LeNet deformation algorithm based on multi-conditional hyperparameter adjustment

{"title":"Revision of the LeNet algorithm——Construction of LeNet deformation algorithm based on multi-conditional hyperparameter adjustment","authors":"","doi":"10.25236/ajcis.2023.060803","DOIUrl":null,"url":null,"abstract":"This paper explores two main issues. First, this paper explores the optimal hyperparameters of the LeNet algorithm under the Fashion-MNIST dataset based on the grid method: where when the learning rate is 0.032, the regularization coefficient is 0.03, the momentum is 0.9, the weight decay parameter is 0.001, and the number of iterative rounds is 50, the model has the best results under the Fashion-MNIST dataset of 10% uniformly sampled samples has the relatively best results, i.e., the test accuracy converges to 85.8%. In addition, this paper improves the LeNet algorithm by constructing a LeNet deformation algorithm based on multi-conditional hyperparameter adjustment, specifically, the learning rate, momentum, and regularization coefficients change with the increase of the number of iteration rounds; in addition, in the construction of the model, the model introduces two blocks containing a convolutional layer, a batch normalization layer (BatchNorm), and a maximum pooling layer, and three linear neuron layers . After tuning, the tested accuracy of the algorithm is 91.5% under the full sample based on the Fashion-MNIST dataset.","PeriodicalId":387664,"journal":{"name":"Academic Journal of Computing & Information Science","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Journal of Computing & Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25236/ajcis.2023.060803","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper explores two main issues. First, this paper explores the optimal hyperparameters of the LeNet algorithm under the Fashion-MNIST dataset based on the grid method: where when the learning rate is 0.032, the regularization coefficient is 0.03, the momentum is 0.9, the weight decay parameter is 0.001, and the number of iterative rounds is 50, the model has the best results under the Fashion-MNIST dataset of 10% uniformly sampled samples has the relatively best results, i.e., the test accuracy converges to 85.8%. In addition, this paper improves the LeNet algorithm by constructing a LeNet deformation algorithm based on multi-conditional hyperparameter adjustment, specifically, the learning rate, momentum, and regularization coefficients change with the increase of the number of iteration rounds; in addition, in the construction of the model, the model introduces two blocks containing a convolutional layer, a batch normalization layer (BatchNorm), and a maximum pooling layer, and three linear neuron layers . After tuning, the tested accuracy of the algorithm is 91.5% under the full sample based on the Fashion-MNIST dataset.
LeNet算法的修正——基于多条件超参数调整的LeNet变形算法的构建
本文探讨了两个主要问题。首先,本文探索了基于网格方法的Fashion-MNIST数据集下LeNet算法的最优超参数:其中,当学习率为0.032,正则化系数为0.03,动量为0.9,权值衰减参数为0.001,迭代轮数为50次时,模型在10%均匀采样样本的Fashion-MNIST数据集下效果最好,测试精度收敛到85.8%。此外,本文对LeNet算法进行了改进,构造了基于多条件超参数调整的LeNet变形算法,学习率、动量、正则化系数随迭代轮数的增加而变化;此外,在模型的构建中,该模型引入了包含卷积层、批归一化层(BatchNorm)和最大池化层的两个块以及三个线性神经元层。经过调优,基于Fashion-MNIST数据集的全样本下,算法的测试准确率为91.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信