Cooperative energy management and fuel cell thermal management of fuel cell hybrid electric buses via multi-agent reinforcement learning

IF 9.4 1区 工程技术 Q1 ENERGY & FUELS
Sichen Gao , Fei Ju , Weichao Zhuang , Qun Wang , Yuhua Zong , Liangmo Wang
{"title":"Cooperative energy management and fuel cell thermal management of fuel cell hybrid electric buses via multi-agent reinforcement learning","authors":"Sichen Gao ,&nbsp;Fei Ju ,&nbsp;Weichao Zhuang ,&nbsp;Qun Wang ,&nbsp;Yuhua Zong ,&nbsp;Liangmo Wang","doi":"10.1016/j.energy.2025.138534","DOIUrl":null,"url":null,"abstract":"<div><div>This study proposes a collaborative control strategy based on multi-agent reinforcement learning (MARL) to tackle the dynamic coupling optimization challenge between the energy management strategy (EMS) and the fuel cell thermal management strategy (FCTMS) in hydrogen fuel cell hybrid electric buses. Specifically, the EMS adopts a multi-objective equivalent consumption minimization strategy (ECMS), incorporating penalty terms for powertrain state-of-health (SOH) degradation and temperature violations. To maintain battery state-of-charge (SOC), this multi-objective ECMS employs the twin delayed deep deterministic policy gradient (TD3) algorithm to dynamically adjust its equivalent factor (EF). Meanwhile, the FCTMS utilizes fuel cell power as one of the state information to guide another TD3 agent in synchronously optimizing coolant flow rate and radiator air flow rate, ensuring optimal operating temperature. Results show that, compared with a hierarchical control strategy integrating ECMS (neglecting powertrain durability and thermal characteristics) and PID-based FCTMS, the proposed strategy significantly reduces driving cost, fuel cell SOH degradation, battery SOH degradation, fuel cell temperature violations, fuel cell outlet–inlet temperature difference violations and battery temperature violations by 5.30%, 3.86%, 10.62%, 99.02%, 47.68% and 29.33%, respectively. Moreover, further investigation demonstrates that the proposed strategy exhibits superior adaptability to unknown driving scenarios.</div></div>","PeriodicalId":11647,"journal":{"name":"Energy","volume":"337 ","pages":"Article 138534"},"PeriodicalIF":9.4000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360544225041763","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a collaborative control strategy based on multi-agent reinforcement learning (MARL) to tackle the dynamic coupling optimization challenge between the energy management strategy (EMS) and the fuel cell thermal management strategy (FCTMS) in hydrogen fuel cell hybrid electric buses. Specifically, the EMS adopts a multi-objective equivalent consumption minimization strategy (ECMS), incorporating penalty terms for powertrain state-of-health (SOH) degradation and temperature violations. To maintain battery state-of-charge (SOC), this multi-objective ECMS employs the twin delayed deep deterministic policy gradient (TD3) algorithm to dynamically adjust its equivalent factor (EF). Meanwhile, the FCTMS utilizes fuel cell power as one of the state information to guide another TD3 agent in synchronously optimizing coolant flow rate and radiator air flow rate, ensuring optimal operating temperature. Results show that, compared with a hierarchical control strategy integrating ECMS (neglecting powertrain durability and thermal characteristics) and PID-based FCTMS, the proposed strategy significantly reduces driving cost, fuel cell SOH degradation, battery SOH degradation, fuel cell temperature violations, fuel cell outlet–inlet temperature difference violations and battery temperature violations by 5.30%, 3.86%, 10.62%, 99.02%, 47.68% and 29.33%, respectively. Moreover, further investigation demonstrates that the proposed strategy exhibits superior adaptability to unknown driving scenarios.
基于多智能体强化学习的燃料电池混合动力客车协同能量管理和燃料电池热管理
针对氢燃料电池混合动力客车能量管理策略(EMS)与燃料电池热管理策略(FCTMS)之间的动态耦合优化问题,提出了一种基于多智能体强化学习(MARL)的协同控制策略。具体而言,EMS采用多目标等效消耗最小化策略(ECMS),将动力系统健康状态(SOH)退化和温度违规的惩罚条款纳入其中。为了维持电池荷电状态(SOC),该多目标ECMS采用双延迟深度确定性策略梯度(TD3)算法动态调整等效因子(EF)。同时,FCTMS利用燃料电池功率作为状态信息之一,引导另一种TD3剂同步优化冷却剂流量和散热器空气流量,确保最佳工作温度。结果表明,与集成ECMS(忽略动力系统耐久性和热特性)和基于pid的FCTMS的分级控制策略相比,该策略显著降低了驾驶成本、燃料电池SOH降解、电池SOH降解、燃料电池温度违规、燃料电池进出口温差违规和电池温度违规,分别降低了5.30%、3.86%、10.62%、99.02%、47.68%和29.33%。此外,进一步的研究表明,该策略对未知驾驶场景具有较好的适应性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy
Energy 工程技术-能源与燃料
CiteScore
15.30
自引率
14.40%
发文量
0
审稿时长
14.2 weeks
期刊介绍: Energy is a multidisciplinary, international journal that publishes research and analysis in the field of energy engineering. Our aim is to become a leading peer-reviewed platform and a trusted source of information for energy-related topics. The journal covers a range of areas including mechanical engineering, thermal sciences, and energy analysis. We are particularly interested in research on energy modelling, prediction, integrated energy systems, planning, and management. Additionally, we welcome papers on energy conservation, efficiency, biomass and bioenergy, renewable energy, electricity supply and demand, energy storage, buildings, and economic and policy issues. These topics should align with our broader multidisciplinary focus.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信