{"title":"基于深度强化学习的全连续控制混合动力客车电池热意识能量管理","authors":"Zhongbao Wei, Haokai Ruan, Hongwen He","doi":"10.1109/ITEC51675.2021.9490073","DOIUrl":null,"url":null,"abstract":"This paper proposes a knowledge-based, thermal-conscious strategy for the energy management of hybrid electric bus (HEB). The deep deterministic policy gradient (DDPG) algorithm with priority experience replay (PER) is exploited to distribute the power smartly among energy components. The fully-continuous separate speed- and torque-control mechanism is further devised to excavate the upper optimization potential of PER-DDPG strategy. Moreover, in the PER-DDPG framework, the penalties to over-temperature are embedded for thermal safety enforcement. Comparative results also disclose the superiority of the proposed strategy in terms of the over-temperature protection and overall optimization performance in the energy management of HEB.","PeriodicalId":339989,"journal":{"name":"2021 IEEE Transportation Electrification Conference & Expo (ITEC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Battery Thermal-conscious Energy Management for Hybrid Electric Bus Based on Fully-continuous Control with Deep Reinforcement Learning\",\"authors\":\"Zhongbao Wei, Haokai Ruan, Hongwen He\",\"doi\":\"10.1109/ITEC51675.2021.9490073\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a knowledge-based, thermal-conscious strategy for the energy management of hybrid electric bus (HEB). The deep deterministic policy gradient (DDPG) algorithm with priority experience replay (PER) is exploited to distribute the power smartly among energy components. The fully-continuous separate speed- and torque-control mechanism is further devised to excavate the upper optimization potential of PER-DDPG strategy. Moreover, in the PER-DDPG framework, the penalties to over-temperature are embedded for thermal safety enforcement. Comparative results also disclose the superiority of the proposed strategy in terms of the over-temperature protection and overall optimization performance in the energy management of HEB.\",\"PeriodicalId\":339989,\"journal\":{\"name\":\"2021 IEEE Transportation Electrification Conference & Expo (ITEC)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Transportation Electrification Conference & Expo (ITEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITEC51675.2021.9490073\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Transportation Electrification Conference & Expo (ITEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITEC51675.2021.9490073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Battery Thermal-conscious Energy Management for Hybrid Electric Bus Based on Fully-continuous Control with Deep Reinforcement Learning
This paper proposes a knowledge-based, thermal-conscious strategy for the energy management of hybrid electric bus (HEB). The deep deterministic policy gradient (DDPG) algorithm with priority experience replay (PER) is exploited to distribute the power smartly among energy components. The fully-continuous separate speed- and torque-control mechanism is further devised to excavate the upper optimization potential of PER-DDPG strategy. Moreover, in the PER-DDPG framework, the penalties to over-temperature are embedded for thermal safety enforcement. Comparative results also disclose the superiority of the proposed strategy in terms of the over-temperature protection and overall optimization performance in the energy management of HEB.