{"title":"HMAMRL: Multicriterion Flexible Coordinated Control for Coal-Fired Power Generation Systems under Wide Load Operation.","authors":"Xiaomin Liu,Mengjun Yu,Chunyu Yang,Haoyu Wang,Linna Zhou,Huaichun Zhou","doi":"10.1109/tcyb.2025.3610512","DOIUrl":null,"url":null,"abstract":"Flexible and efficient wide-load tracking in coal-fired power generation systems (CPGSs) is crucial for integrating renewable energy. To address the challenges arising from the dynamic characteristics and task distribution differences during the wide-load operation of thermal power units, this article proposes a novel hierarchical model-agnostic meta reinforcement learning (HMAMRL) framework. This framework combines inner meta-learning for quick adaptation within task categories and outer meta-learning for sharing general task knowledge, ensuring robust generalization under different load conditions. Meanwhile, an adaptive multicriterion reward function design method is proposed to dynamically balance load tracking costs, coal consumption costs, and input fluctuation costs. Moreover, a truncated proximal policy optimization (TPPO) algorithm ensures precise load control within physical constraints. Experimental results on the 160 and 1000 MW CPGSs demonstrate the effectiveness and superiority of the proposed algorithm.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"16 1","pages":""},"PeriodicalIF":10.5000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/tcyb.2025.3610512","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Flexible and efficient wide-load tracking in coal-fired power generation systems (CPGSs) is crucial for integrating renewable energy. To address the challenges arising from the dynamic characteristics and task distribution differences during the wide-load operation of thermal power units, this article proposes a novel hierarchical model-agnostic meta reinforcement learning (HMAMRL) framework. This framework combines inner meta-learning for quick adaptation within task categories and outer meta-learning for sharing general task knowledge, ensuring robust generalization under different load conditions. Meanwhile, an adaptive multicriterion reward function design method is proposed to dynamically balance load tracking costs, coal consumption costs, and input fluctuation costs. Moreover, a truncated proximal policy optimization (TPPO) algorithm ensures precise load control within physical constraints. Experimental results on the 160 and 1000 MW CPGSs demonstrate the effectiveness and superiority of the proposed algorithm.
期刊介绍:
The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.