Convolution neural network hyperparameter optimization using modified particle swarm optimization

Q2 Mathematics

Bulletin of Electrical Engineering and Informatics Pub Date : 2024-04-01 DOI:10.11591/eei.v13i2.6112

Muhammad Munsarif, Muhammad Sam'an, Andrian Fahrezi

{"title":"Convolution neural network hyperparameter optimization using modified particle swarm optimization","authors":"Muhammad Munsarif, Muhammad Sam'an, Andrian Fahrezi","doi":"10.11591/eei.v13i2.6112","DOIUrl":null,"url":null,"abstract":"Based on the literature review, a convolutional neural network (CNN) is one of the deep learning techniques most often used for classification problems, especially image classification. Various approaches have been proposed to improve accuracy performance. In CNN architecture, parameter determination is very influential on accuracy performance. Particle swarm optimization (PSO) is a type of metaheuristic algorithm widely used for hyperparameter optimization. PSO convergence is faster than genetic algorithm (GA) and attracts many researchers for further studies such as genetic algorithms and ant colony. In PSO, determining the value of the weight parameter is very influential on accuracy. Therefore, this paper proposes CNN hyperparameter optimization using modified PSO with linearly decreasing randomized weight. The experiments use the modified National Institute of Standards and Technology (MNIST) dataset. The accuracy of the proposed method is superior, and the execution time is slower to random search. In epoch 1, epoch 3, and epoch 5, the proposed method is superior to baseline CNN, linearly decreasing weight PSO (LDWPSO), and RL-based optimization algorithm (ROA). Meanwhile, the accuracy performance of the proposed method is superior to previous studies, namely LeNet-1, LeNet-2, LeNet-3, PCANet-2, RANDNet-2, CAE1, CAE-2, and bee colony. Otherwise, lost to PSO-CNN, distributed PSO (DPSO), recurrent CNN, and CNN-PSO. However, the four methods have a complex architecture and wasteful execution time.","PeriodicalId":37619,"journal":{"name":"Bulletin of Electrical Engineering and Informatics","volume":"6 7","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Electrical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/eei.v13i2.6112","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

Based on the literature review, a convolutional neural network (CNN) is one of the deep learning techniques most often used for classification problems, especially image classification. Various approaches have been proposed to improve accuracy performance. In CNN architecture, parameter determination is very influential on accuracy performance. Particle swarm optimization (PSO) is a type of metaheuristic algorithm widely used for hyperparameter optimization. PSO convergence is faster than genetic algorithm (GA) and attracts many researchers for further studies such as genetic algorithms and ant colony. In PSO, determining the value of the weight parameter is very influential on accuracy. Therefore, this paper proposes CNN hyperparameter optimization using modified PSO with linearly decreasing randomized weight. The experiments use the modified National Institute of Standards and Technology (MNIST) dataset. The accuracy of the proposed method is superior, and the execution time is slower to random search. In epoch 1, epoch 3, and epoch 5, the proposed method is superior to baseline CNN, linearly decreasing weight PSO (LDWPSO), and RL-based optimization algorithm (ROA). Meanwhile, the accuracy performance of the proposed method is superior to previous studies, namely LeNet-1, LeNet-2, LeNet-3, PCANet-2, RANDNet-2, CAE1, CAE-2, and bee colony. Otherwise, lost to PSO-CNN, distributed PSO (DPSO), recurrent CNN, and CNN-PSO. However, the four methods have a complex architecture and wasteful execution time.

查看原文本刊更多论文

利用改进的粒子群优化技术优化卷积神经网络超参数

根据文献综述，卷积神经网络（CNN）是最常用于分类问题（尤其是图像分类）的深度学习技术之一。为了提高准确率，人们提出了各种方法。在 CNN 架构中，参数的确定对准确率性能影响很大。粒子群优化（PSO）是一种元启发式算法，被广泛用于超参数优化。PSO 的收敛速度比遗传算法（GA）更快，吸引了许多研究人员对遗传算法和蚁群等算法进行深入研究。在 PSO 中，权重参数值的确定对精度影响很大。因此，本文提出使用线性递减随机权重的改进型 PSO 进行 CNN 超参数优化。实验使用了修改后的美国国家标准与技术研究院（MNIST）数据集。与随机搜索相比，所提方法的精度更高，执行时间更慢。在epoch 1、epoch 3和epoch 5中，提出的方法优于基线CNN、线性递减权重PSO（LDWPSO）和基于RL的优化算法（ROA）。同时，所提方法的准确度表现优于之前的研究，即 LeNet-1、LeNet-2、LeNet-3、PCANet-2、RANDNet-2、CAE1、CAE-2 和蜂群。其他方法则输给了 PSO-CNN、分布式 PSO（DPSO）、递归 CNN 和 CNN-PSO。然而，这四种方法结构复杂，执行时间浪费。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bulletin of Electrical Engineering and Informatics Computer Science-Computer Science (miscellaneous)

CiteScore

3.60

自引率

0.00%

发文量

期刊介绍： Bulletin of Electrical Engineering and Informatics publishes original papers in the field of electrical, computer and informatics engineering which covers, but not limited to, the following scope: Computer Science, Computer Engineering and Informatics[...] Electronics[...] Electrical and Power Engineering[...] Telecommunication and Information Technology[...]Instrumentation and Control Engineering[...]