基于反向传播和协同进化的神经网络

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-06-21 DOI:10.1016/j.asoc.2025.113453

Yuelin Gao , Yuming Zhang , Xiaofeng Xie

{"title":"基于反向传播和协同进化的神经网络","authors":"Yuelin Gao , Yuming Zhang , Xiaofeng Xie","doi":"10.1016/j.asoc.2025.113453","DOIUrl":null,"url":null,"abstract":"<div><div>Deep neural networks (DNNs) have a powerful feature extraction capability, which allows them to be employed in various fields. However, as the number of layers and neurons in the network increases, the search space for parameter learning becomes complex. Currently, the most commonly used parameter training method is backpropagation (BP) based on gradient descent, but this method is sensitive to the initialization of the parameters and tends to get stuck in local optima in a complex search space. Therefore, a new training method for DNNs has been proposed that combines cooperative co-evolution (CC) with BP-based gradient descent, called BPCC. In the BPCC method, BP performs multiple training periods intermittently, and the CC algorithm is executed when the difference between the current loss function value and the previous loss function value is less than a given threshold (called a condition met). We found that the algorithm easily enters into CC iterations, which reduces the computational effectiveness of the algorithm. A tolerance parameter is designed to curb this phenomenon, and the CC is executed when the cumulative number of times the condition is met reaches the given value of the tolerance parameter, and the improved gray wolf optimizer (GWO) algorithm is used as the solver for the CC. In addition, in the CC iteration stage, the Chebyshev chaotic map series based on the current optimal point is used to initialize the population of GWO to ensure the diversity of the initial population. Experimental comparisons are made with modern network training methods in 7 network models, and the experimental results show that the improved algorithm in this study is competitive.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"181 ","pages":"Article 113453"},"PeriodicalIF":6.6000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A neural network based on back-propagation and cooperative co-evolution\",\"authors\":\"Yuelin Gao , Yuming Zhang , Xiaofeng Xie\",\"doi\":\"10.1016/j.asoc.2025.113453\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep neural networks (DNNs) have a powerful feature extraction capability, which allows them to be employed in various fields. However, as the number of layers and neurons in the network increases, the search space for parameter learning becomes complex. Currently, the most commonly used parameter training method is backpropagation (BP) based on gradient descent, but this method is sensitive to the initialization of the parameters and tends to get stuck in local optima in a complex search space. Therefore, a new training method for DNNs has been proposed that combines cooperative co-evolution (CC) with BP-based gradient descent, called BPCC. In the BPCC method, BP performs multiple training periods intermittently, and the CC algorithm is executed when the difference between the current loss function value and the previous loss function value is less than a given threshold (called a condition met). We found that the algorithm easily enters into CC iterations, which reduces the computational effectiveness of the algorithm. A tolerance parameter is designed to curb this phenomenon, and the CC is executed when the cumulative number of times the condition is met reaches the given value of the tolerance parameter, and the improved gray wolf optimizer (GWO) algorithm is used as the solver for the CC. In addition, in the CC iteration stage, the Chebyshev chaotic map series based on the current optimal point is used to initialize the population of GWO to ensure the diversity of the initial population. Experimental comparisons are made with modern network training methods in 7 network models, and the experimental results show that the improved algorithm in this study is competitive.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"181 \",\"pages\":\"Article 113453\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625007641\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625007641","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

深度神经网络（Deep neural network, dnn）具有强大的特征提取能力，可以应用于各个领域。然而，随着网络层数和神经元数量的增加，参数学习的搜索空间变得复杂。目前最常用的参数训练方法是基于梯度下降的反向传播（BP）方法，但该方法对参数的初始化比较敏感，在复杂的搜索空间中容易陷入局部最优。为此，提出了一种将协同进化（CC）与基于bp的梯度下降相结合的深度神经网络训练方法，称为BPCC。在BPCC方法中，BP间歇性地执行多个训练周期，当当前损失函数值与前一个损失函数值的差值小于给定阈值（称为满足条件）时，执行CC算法。我们发现该算法容易进入CC迭代，这降低了算法的计算效率。公差参数是为了遏制这一现象,执行和CC累积次数的条件满足时达到给定值的公差参数,和改进的灰狼优化器(拥有)算法的解算器CC。此外,在CC迭代阶段,切比雪夫混沌映射系列基于当前最优点是用来初始化的人口拥有确保初始种群的多样性。在7种网络模型中与现代网络训练方法进行了实验比较，实验结果表明本文改进的算法具有一定的竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A neural network based on back-propagation and cooperative co-evolution

Deep neural networks (DNNs) have a powerful feature extraction capability, which allows them to be employed in various fields. However, as the number of layers and neurons in the network increases, the search space for parameter learning becomes complex. Currently, the most commonly used parameter training method is backpropagation (BP) based on gradient descent, but this method is sensitive to the initialization of the parameters and tends to get stuck in local optima in a complex search space. Therefore, a new training method for DNNs has been proposed that combines cooperative co-evolution (CC) with BP-based gradient descent, called BPCC. In the BPCC method, BP performs multiple training periods intermittently, and the CC algorithm is executed when the difference between the current loss function value and the previous loss function value is less than a given threshold (called a condition met). We found that the algorithm easily enters into CC iterations, which reduces the computational effectiveness of the algorithm. A tolerance parameter is designed to curb this phenomenon, and the CC is executed when the cumulative number of times the condition is met reaches the given value of the tolerance parameter, and the improved gray wolf optimizer (GWO) algorithm is used as the solver for the CC. In addition, in the CC iteration stage, the Chebyshev chaotic map series based on the current optimal point is used to initialize the population of GWO to ensure the diversity of the initial population. Experimental comparisons are made with modern network training methods in 7 network models, and the experimental results show that the improved algorithm in this study is competitive.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.