{"title":"复杂工业过程数据特征选择与预测模型的多策略融合二元斑马优化算法","authors":"Yi-Peng Shang-Guan, Jie-Sheng Wang, Yong-Cheng Sun, Yu-Wei Song, Yu-Liang Qi","doi":"10.1002/cpe.70143","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the process of modern industrial production, soft sensing technology is often used to predict the target variables that are difficult to be directly measured by hard instruments. However, the input variables used for prediction are not all closely related to the output. Feature selection (FS) aims to select the features highly related to the target variables and discard the redundant features. How to select the optimal feature subset and reduce the operation cost while ensuring the prediction accuracy becomes a key problem. A multi-strategy fusion binary Zebra optimization algorithm (MFBZOA) was proposed to select the optimal feature subset in a prediction model. Firstly, the Cauchy inverse cumulative distribution function is used to mutate individual positions in the defense stage of the zebra optimization algorithm (ZOA), and then reproductive behavior is introduced into the algorithm to increase the diversity of the solution set and improve the overall quality of the solution. Finally, the Cauchy mutation strategy is introduced to disturb the worst individuals in the zebra population and increase the probability of jumping out of the local optimum. Firstly, the proposed improved ZOA is combined with the Zebra optimization algorithm, Golden sine algorithm, Whale optimization algorithm, Frilled lizard optimization, Human evolution optimization algorithm, Coatis optimization algorithm, and Goose optimization algorithm to perform CEC2022 function optimization simulation experiments to verify its effectiveness. Then, MFBZOA and the above comparison algorithms are used as search strategies respectively, combined with the wrapper FS method driven by a multi-layer perceptron to solve the FS problem of four industrial process data and build the corresponding prediction model. Then, the optimal feature subset selected by each algorithm is used in the prediction experiment. The simulation results show that MFBZOA can effectively select the optimal feature subset, improve the global search ability and local search ability, and maintain good prediction accuracy and generalization performance.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 15-17","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Strategy Fusion Binary Zebra Optimization Algorithm for Solving Complex Industrial Process Data Feature Selection and Prediction Models\",\"authors\":\"Yi-Peng Shang-Guan, Jie-Sheng Wang, Yong-Cheng Sun, Yu-Wei Song, Yu-Liang Qi\",\"doi\":\"10.1002/cpe.70143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>In the process of modern industrial production, soft sensing technology is often used to predict the target variables that are difficult to be directly measured by hard instruments. However, the input variables used for prediction are not all closely related to the output. Feature selection (FS) aims to select the features highly related to the target variables and discard the redundant features. How to select the optimal feature subset and reduce the operation cost while ensuring the prediction accuracy becomes a key problem. A multi-strategy fusion binary Zebra optimization algorithm (MFBZOA) was proposed to select the optimal feature subset in a prediction model. Firstly, the Cauchy inverse cumulative distribution function is used to mutate individual positions in the defense stage of the zebra optimization algorithm (ZOA), and then reproductive behavior is introduced into the algorithm to increase the diversity of the solution set and improve the overall quality of the solution. Finally, the Cauchy mutation strategy is introduced to disturb the worst individuals in the zebra population and increase the probability of jumping out of the local optimum. Firstly, the proposed improved ZOA is combined with the Zebra optimization algorithm, Golden sine algorithm, Whale optimization algorithm, Frilled lizard optimization, Human evolution optimization algorithm, Coatis optimization algorithm, and Goose optimization algorithm to perform CEC2022 function optimization simulation experiments to verify its effectiveness. Then, MFBZOA and the above comparison algorithms are used as search strategies respectively, combined with the wrapper FS method driven by a multi-layer perceptron to solve the FS problem of four industrial process data and build the corresponding prediction model. Then, the optimal feature subset selected by each algorithm is used in the prediction experiment. The simulation results show that MFBZOA can effectively select the optimal feature subset, improve the global search ability and local search ability, and maintain good prediction accuracy and generalization performance.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 15-17\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70143\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70143","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
Multi-Strategy Fusion Binary Zebra Optimization Algorithm for Solving Complex Industrial Process Data Feature Selection and Prediction Models
In the process of modern industrial production, soft sensing technology is often used to predict the target variables that are difficult to be directly measured by hard instruments. However, the input variables used for prediction are not all closely related to the output. Feature selection (FS) aims to select the features highly related to the target variables and discard the redundant features. How to select the optimal feature subset and reduce the operation cost while ensuring the prediction accuracy becomes a key problem. A multi-strategy fusion binary Zebra optimization algorithm (MFBZOA) was proposed to select the optimal feature subset in a prediction model. Firstly, the Cauchy inverse cumulative distribution function is used to mutate individual positions in the defense stage of the zebra optimization algorithm (ZOA), and then reproductive behavior is introduced into the algorithm to increase the diversity of the solution set and improve the overall quality of the solution. Finally, the Cauchy mutation strategy is introduced to disturb the worst individuals in the zebra population and increase the probability of jumping out of the local optimum. Firstly, the proposed improved ZOA is combined with the Zebra optimization algorithm, Golden sine algorithm, Whale optimization algorithm, Frilled lizard optimization, Human evolution optimization algorithm, Coatis optimization algorithm, and Goose optimization algorithm to perform CEC2022 function optimization simulation experiments to verify its effectiveness. Then, MFBZOA and the above comparison algorithms are used as search strategies respectively, combined with the wrapper FS method driven by a multi-layer perceptron to solve the FS problem of four industrial process data and build the corresponding prediction model. Then, the optimal feature subset selected by each algorithm is used in the prediction experiment. The simulation results show that MFBZOA can effectively select the optimal feature subset, improve the global search ability and local search ability, and maintain good prediction accuracy and generalization performance.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.