Hui Liu , Congwen You , Lijin Han , Ningkang Yang , Baoshuai Liu
{"title":"Off-road hybrid electric vehicle energy management strategy using multi-agent soft actor-critic with collaborative-independent algorithm","authors":"Hui Liu , Congwen You , Lijin Han , Ningkang Yang , Baoshuai Liu","doi":"10.1016/j.energy.2025.136463","DOIUrl":null,"url":null,"abstract":"<div><div>Hybrid electric vehicles (HEVs) reduce carbon emissions and save energy, and hybrid energy storage system (HESS) consist of a battery and a supercapacitor which has high energy density and high power density. The HEV equipped with HESS performs better in off-road conditions than single energy storage source. However, its energy management requires multiple input and multiple output (MIMO) control. In this paper, a multi-agent soft actor-critic (MASAC) based energy management strategy (EMS) is proposed to solve the multi-objective optimizing problem considering fuel economy, maintaining state of charge (SOC) and reducing battery state of health (SOH) decay. MASAC based EMS has two advantages: 1) it decomposed the search space into two subspaces, improving the learning efficiency. 2) a novel collaborative-independent algorithm is proposed to allocate rewards among agents, thereby improving the learning stability. Thus, the optimal actions are efficiently and collaboratively learned by two agents, engine agent and HESS agent, showing better performance in multi-objective optimization. In the simulation, the proposed EMS is compared with dynamic programming (DP) and soft actor-critic (SAC) in both off-road driving cycle and standard driving cycles. Simulation results show that the proposed collaborative-independent algorithm enhances the learning efficiency and learning stability of MASAC, while improving the real-time performance of EMS. In off-road conditions, the equivalent fuel consumption of MASAC is slightly better than that of DP. The SOH decay of MASAC is only 20 % higher than DP, significantly outperforming SAC. Furthermore, MASAC demonstrates superior performance in three standard working cycles when compared with SAC.</div></div>","PeriodicalId":11647,"journal":{"name":"Energy","volume":"328 ","pages":"Article 136463"},"PeriodicalIF":9.0000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S036054422502105X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Hybrid electric vehicles (HEVs) reduce carbon emissions and save energy, and hybrid energy storage system (HESS) consist of a battery and a supercapacitor which has high energy density and high power density. The HEV equipped with HESS performs better in off-road conditions than single energy storage source. However, its energy management requires multiple input and multiple output (MIMO) control. In this paper, a multi-agent soft actor-critic (MASAC) based energy management strategy (EMS) is proposed to solve the multi-objective optimizing problem considering fuel economy, maintaining state of charge (SOC) and reducing battery state of health (SOH) decay. MASAC based EMS has two advantages: 1) it decomposed the search space into two subspaces, improving the learning efficiency. 2) a novel collaborative-independent algorithm is proposed to allocate rewards among agents, thereby improving the learning stability. Thus, the optimal actions are efficiently and collaboratively learned by two agents, engine agent and HESS agent, showing better performance in multi-objective optimization. In the simulation, the proposed EMS is compared with dynamic programming (DP) and soft actor-critic (SAC) in both off-road driving cycle and standard driving cycles. Simulation results show that the proposed collaborative-independent algorithm enhances the learning efficiency and learning stability of MASAC, while improving the real-time performance of EMS. In off-road conditions, the equivalent fuel consumption of MASAC is slightly better than that of DP. The SOH decay of MASAC is only 20 % higher than DP, significantly outperforming SAC. Furthermore, MASAC demonstrates superior performance in three standard working cycles when compared with SAC.
期刊介绍:
Energy is a multidisciplinary, international journal that publishes research and analysis in the field of energy engineering. Our aim is to become a leading peer-reviewed platform and a trusted source of information for energy-related topics.
The journal covers a range of areas including mechanical engineering, thermal sciences, and energy analysis. We are particularly interested in research on energy modelling, prediction, integrated energy systems, planning, and management.
Additionally, we welcome papers on energy conservation, efficiency, biomass and bioenergy, renewable energy, electricity supply and demand, energy storage, buildings, and economic and policy issues. These topics should align with our broader multidisciplinary focus.