Rui Zhang , Yuanyuan Zhang , Chaoli Sun , Yanjun Zhang , Zehua Dong , Xiaobing Wang
{"title":"基于代理模型的深度卷积神经网络中高维超参数的高效配置","authors":"Rui Zhang , Yuanyuan Zhang , Chaoli Sun , Yanjun Zhang , Zehua Dong , Xiaobing Wang","doi":"10.1016/j.swevo.2025.101940","DOIUrl":null,"url":null,"abstract":"<div><div>The rationality of hyperparameter configuration in deep convolutional neural networks for classification directly determines its performance. It is challenging to reduce high computational costs effectively and guarantee performance in the configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification. This paper proposes an efficient configuration method that concerns high-dimensional hyperparameters in deep convolutional neural networks for classification assisted by surrogate models. By designing a progressive accumulation dropout neural network surrogate model (PA-Dropout), the contribution of hyperparameters configurations to multi-performance objectives is dynamically measured and then the contribution is iteratively screened. As a result, the fitting efficiency of the PA-Dropout to the relationship between high-dimensional hyperparametric configurations and the multi-objective performance in deep convolutional neural networks for classification with scarce data is improved. A dual-drive interactive dynamic model management strategy (DDIDMMS) is designed, considering the comprehensive evaluation and adaptive weighting calculation of convergence diversity of high-dimensional hyperparametric configuration individuals. Reliable candidate solutions are provided for real evaluation, thereby improving the update efficiency of PA-Dropout. Finally, an efficient configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification is realized. By using DTLZ and WFG benchmark problems with up to 100 decision variables and 20 targets, as well as practical classification tasks, the superiority and generalization of this method are verified when solving the expensive multi-objective optimization problem of CNN high-dimensional hyperparameter configuration for classification tasks.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"95 ","pages":"Article 101940"},"PeriodicalIF":8.2000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification assisted by surrogate models\",\"authors\":\"Rui Zhang , Yuanyuan Zhang , Chaoli Sun , Yanjun Zhang , Zehua Dong , Xiaobing Wang\",\"doi\":\"10.1016/j.swevo.2025.101940\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The rationality of hyperparameter configuration in deep convolutional neural networks for classification directly determines its performance. It is challenging to reduce high computational costs effectively and guarantee performance in the configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification. This paper proposes an efficient configuration method that concerns high-dimensional hyperparameters in deep convolutional neural networks for classification assisted by surrogate models. By designing a progressive accumulation dropout neural network surrogate model (PA-Dropout), the contribution of hyperparameters configurations to multi-performance objectives is dynamically measured and then the contribution is iteratively screened. As a result, the fitting efficiency of the PA-Dropout to the relationship between high-dimensional hyperparametric configurations and the multi-objective performance in deep convolutional neural networks for classification with scarce data is improved. A dual-drive interactive dynamic model management strategy (DDIDMMS) is designed, considering the comprehensive evaluation and adaptive weighting calculation of convergence diversity of high-dimensional hyperparametric configuration individuals. Reliable candidate solutions are provided for real evaluation, thereby improving the update efficiency of PA-Dropout. Finally, an efficient configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification is realized. By using DTLZ and WFG benchmark problems with up to 100 decision variables and 20 targets, as well as practical classification tasks, the superiority and generalization of this method are verified when solving the expensive multi-objective optimization problem of CNN high-dimensional hyperparameter configuration for classification tasks.</div></div>\",\"PeriodicalId\":48682,\"journal\":{\"name\":\"Swarm and Evolutionary Computation\",\"volume\":\"95 \",\"pages\":\"Article 101940\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Swarm and Evolutionary Computation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2210650225000987\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225000987","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Efficient configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification assisted by surrogate models
The rationality of hyperparameter configuration in deep convolutional neural networks for classification directly determines its performance. It is challenging to reduce high computational costs effectively and guarantee performance in the configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification. This paper proposes an efficient configuration method that concerns high-dimensional hyperparameters in deep convolutional neural networks for classification assisted by surrogate models. By designing a progressive accumulation dropout neural network surrogate model (PA-Dropout), the contribution of hyperparameters configurations to multi-performance objectives is dynamically measured and then the contribution is iteratively screened. As a result, the fitting efficiency of the PA-Dropout to the relationship between high-dimensional hyperparametric configurations and the multi-objective performance in deep convolutional neural networks for classification with scarce data is improved. A dual-drive interactive dynamic model management strategy (DDIDMMS) is designed, considering the comprehensive evaluation and adaptive weighting calculation of convergence diversity of high-dimensional hyperparametric configuration individuals. Reliable candidate solutions are provided for real evaluation, thereby improving the update efficiency of PA-Dropout. Finally, an efficient configuration of high-dimensional hyperparameters in deep convolutional neural networks for classification is realized. By using DTLZ and WFG benchmark problems with up to 100 decision variables and 20 targets, as well as practical classification tasks, the superiority and generalization of this method are verified when solving the expensive multi-objective optimization problem of CNN high-dimensional hyperparameter configuration for classification tasks.
期刊介绍:
Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.