基于强化学习的电动卡车多目标速度优化方法，考虑到电池退化缓解问题

IF 8.3 1区工程技术 Q1 ECONOMICS

Transportation Research Part E-Logistics and Transportation Review Pub Date : 2025-02-01 DOI:10.1016/j.tre.2024.103885

Ruo Jia , Kun Gao , Shaohua Cui , Jing Chen , Jelena Andric

{"title":"基于强化学习的电动卡车多目标速度优化方法，考虑到电池退化缓解问题","authors":"Ruo Jia , Kun Gao , Shaohua Cui , Jing Chen , Jelena Andric","doi":"10.1016/j.tre.2024.103885","DOIUrl":null,"url":null,"abstract":"<div><div>Electrification of commercial vehicles for more sustainable logistic systems has been promoted in the past decades. This study proposes a deep reinforcement learning method for velocity optimization and battery degradation minimization during operation for battery-powered electric trucks (BETs), aiming to achieve a safe, efficient, and comfortable driving control policy for BETs. To obtain an optimal solution considering both calendar and cyclic battery degradation, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient (TD3) approaches are integrated within a simulation environment. To optimize overall BET velocity performance, a trade-off among safety, efficiency, comfort, and battery degradation is incorporated into the reward function of reinforcement learning using Mixture of Experts (MoE) model. The results indicate that the proposed TD3-MoE model achieves safe, efficient, and comfortable car-following control while optimizing total battery degradation. Specifically, the model achieves reductions in total battery capacity loss ranging from 2.4% to 8.3% at different states of charge (SoC) of battery compared to human-driven scenarios. Moreover, despite calendar battery degradation being inevitable, the cyclic battery degradation is effectively mitigated by 27.7% to 29.6% compared to the same SoCs in human-driving data. Furthermore, the TD3-MoE model achieves significant energy consumption reductions, ranging from 35.3% to 39.8% compared to real car-following trajectories.</div></div>","PeriodicalId":49418,"journal":{"name":"Transportation Research Part E-Logistics and Transportation Review","volume":"194 ","pages":"Article 103885"},"PeriodicalIF":8.3000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A multi-objective reinforcement learning-based velocity optimization approach for electric trucks considering battery degradation mitigation\",\"authors\":\"Ruo Jia , Kun Gao , Shaohua Cui , Jing Chen , Jelena Andric\",\"doi\":\"10.1016/j.tre.2024.103885\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Electrification of commercial vehicles for more sustainable logistic systems has been promoted in the past decades. This study proposes a deep reinforcement learning method for velocity optimization and battery degradation minimization during operation for battery-powered electric trucks (BETs), aiming to achieve a safe, efficient, and comfortable driving control policy for BETs. To obtain an optimal solution considering both calendar and cyclic battery degradation, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient (TD3) approaches are integrated within a simulation environment. To optimize overall BET velocity performance, a trade-off among safety, efficiency, comfort, and battery degradation is incorporated into the reward function of reinforcement learning using Mixture of Experts (MoE) model. The results indicate that the proposed TD3-MoE model achieves safe, efficient, and comfortable car-following control while optimizing total battery degradation. Specifically, the model achieves reductions in total battery capacity loss ranging from 2.4% to 8.3% at different states of charge (SoC) of battery compared to human-driven scenarios. Moreover, despite calendar battery degradation being inevitable, the cyclic battery degradation is effectively mitigated by 27.7% to 29.6% compared to the same SoCs in human-driving data. Furthermore, the TD3-MoE model achieves significant energy consumption reductions, ranging from 35.3% to 39.8% compared to real car-following trajectories.</div></div>\",\"PeriodicalId\":49418,\"journal\":{\"name\":\"Transportation Research Part E-Logistics and Transportation Review\",\"volume\":\"194 \",\"pages\":\"Article 103885\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part E-Logistics and Transportation Review\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1366554524004769\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part E-Logistics and Transportation Review","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1366554524004769","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

在过去的几十年里，商用车的电气化一直在推动更可持续的物流系统。本研究提出了一种基于深度强化学习的电动卡车运行速度优化和电池退化最小化方法，旨在实现电动卡车安全、高效、舒适的驾驶控制策略。为了获得考虑日历和循环电池退化的最优解，在仿真环境中集成了深度确定性策略梯度和双延迟深度确定性策略梯度（TD3）方法。为了优化整体BET速度性能，使用混合专家（MoE）模型将安全性、效率、舒适性和电池退化之间的权衡纳入强化学习的奖励函数中。结果表明，TD3-MoE模型在优化电池总退化的同时，实现了安全、高效、舒适的跟车控制。具体而言，与人为驱动的场景相比，该模型在电池不同充电状态（SoC）下实现了电池总容量损失减少2.4%至8.3%。此外，尽管日历电池退化是不可避免的，但与人类驾驶数据中相同的soc相比，循环电池退化有效缓解了27.7%至29.6%。此外，与实际车辆跟随轨迹相比，TD3-MoE模型实现了显著的能耗降低，降幅为35.3%至39.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A multi-objective reinforcement learning-based velocity optimization approach for electric trucks considering battery degradation mitigation

Electrification of commercial vehicles for more sustainable logistic systems has been promoted in the past decades. This study proposes a deep reinforcement learning method for velocity optimization and battery degradation minimization during operation for battery-powered electric trucks (BETs), aiming to achieve a safe, efficient, and comfortable driving control policy for BETs. To obtain an optimal solution considering both calendar and cyclic battery degradation, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient (TD3) approaches are integrated within a simulation environment. To optimize overall BET velocity performance, a trade-off among safety, efficiency, comfort, and battery degradation is incorporated into the reward function of reinforcement learning using Mixture of Experts (MoE) model. The results indicate that the proposed TD3-MoE model achieves safe, efficient, and comfortable car-following control while optimizing total battery degradation. Specifically, the model achieves reductions in total battery capacity loss ranging from 2.4% to 8.3% at different states of charge (SoC) of battery compared to human-driven scenarios. Moreover, despite calendar battery degradation being inevitable, the cyclic battery degradation is effectively mitigated by 27.7% to 29.6% compared to the same SoCs in human-driving data. Furthermore, the TD3-MoE model achieves significant energy consumption reductions, ranging from 35.3% to 39.8% compared to real car-following trajectories.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Transportation Research Part E-Logistics and Transportation Review 工程技术-工程：土木

CiteScore

16.20

自引率

16.00%

发文量

285

审稿时长

62 days

期刊介绍： Transportation Research Part E: Logistics and Transportation Review is a reputable journal that publishes high-quality articles covering a wide range of topics in the field of logistics and transportation research. The journal welcomes submissions on various subjects, including transport economics, transport infrastructure and investment appraisal, evaluation of public policies related to transportation, empirical and analytical studies of logistics management practices and performance, logistics and operations models, and logistics and supply chain management. Part E aims to provide informative and well-researched articles that contribute to the understanding and advancement of the field. The content of the journal is complementary to other prestigious journals in transportation research, such as Transportation Research Part A: Policy and Practice, Part B: Methodological, Part C: Emerging Technologies, Part D: Transport and Environment, and Part F: Traffic Psychology and Behaviour. Together, these journals form a comprehensive and cohesive reference for current research in transportation science.