Environment-Aware Reinforcement Learning-Based Energy Consumption Prediction Model for Electric Vehicles

IF 8.4 1区工程技术 Q1 ENGINEERING, CIVIL

IEEE Transactions on Intelligent Transportation Systems Pub Date : 2026-03-01 Epub Date: 2026-01-22 DOI:10.1109/TITS.2026.3654673

Ziran Peng;Xiaoyang Yang;Zhenyu He

{"title":"Environment-Aware Reinforcement Learning-Based Energy Consumption Prediction Model for Electric Vehicles","authors":"Ziran Peng;Xiaoyang Yang;Zhenyu He","doi":"10.1109/TITS.2026.3654673","DOIUrl":null,"url":null,"abstract":"As electric vehicles (EVs) gain growing popularity worldwide, demands for real-time and precise energy-consuming prediction have increased correspondingly. Targeting at limitations of existing models in environmental perception and dynamic calibration, this research put forward a novel model for energy-consuming prediction. This model integrated environmental perception with reinforcement learning. Specifically, at first, a road-condition perceiving approach deeply coupled with reinforcement learning was designed, and a linear multi-scale attention encoder was constructed. The aim was to extract multi-granularity environmental features related to energy efficiency and thus enhance the model’s representational capabilities under complicated dynamic driving situations. Second, a real-time energy efficiency estimation model was developed under a Markov decision process, which was also mapped to the reinforcement learning framework. Based on temporal-difference learning, the data-driven Q function was iteratively updated, and constant calibration of energy estimation was realized. Finally, a prioritization mechanism for causal-structure-based Kullback–Leibler (KL) divergence scenarios was proposed to enhance the sampling efficiency in cases of critical incidents such as slope variations and abrupt speed-accelerating/decelerating, while strengthening the robustness and generalization of the model under complicated conditions. Results confirmed the superior stability and robustness of the proposed approach across multiple operating conditions and vehicle types. Specifically, the mean absolute error (MAE) was below 12%; the root mean-squared error (RMSE) exhibited a value under 1.8%; and the R<sup>2</sup> value exceeded 99.5%. All these demonstrated its significantly improved efficiency over Transformer, Informer, Mamba, and long short-term memory (LSTM) models. EVs’ actual energy consumption in the real world was also compared with that in speed profile (EVECS) dataset, presenting an MAE below 1.15% and a RMSE under 1.65%, which further verified its excellent generalization.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"27 5","pages":"6147-6159"},"PeriodicalIF":8.4000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11361373/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/22 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

Abstract

As electric vehicles (EVs) gain growing popularity worldwide, demands for real-time and precise energy-consuming prediction have increased correspondingly. Targeting at limitations of existing models in environmental perception and dynamic calibration, this research put forward a novel model for energy-consuming prediction. This model integrated environmental perception with reinforcement learning. Specifically, at first, a road-condition perceiving approach deeply coupled with reinforcement learning was designed, and a linear multi-scale attention encoder was constructed. The aim was to extract multi-granularity environmental features related to energy efficiency and thus enhance the model’s representational capabilities under complicated dynamic driving situations. Second, a real-time energy efficiency estimation model was developed under a Markov decision process, which was also mapped to the reinforcement learning framework. Based on temporal-difference learning, the data-driven Q function was iteratively updated, and constant calibration of energy estimation was realized. Finally, a prioritization mechanism for causal-structure-based Kullback–Leibler (KL) divergence scenarios was proposed to enhance the sampling efficiency in cases of critical incidents such as slope variations and abrupt speed-accelerating/decelerating, while strengthening the robustness and generalization of the model under complicated conditions. Results confirmed the superior stability and robustness of the proposed approach across multiple operating conditions and vehicle types. Specifically, the mean absolute error (MAE) was below 12%; the root mean-squared error (RMSE) exhibited a value under 1.8%; and the R² value exceeded 99.5%. All these demonstrated its significantly improved efficiency over Transformer, Informer, Mamba, and long short-term memory (LSTM) models. EVs’ actual energy consumption in the real world was also compared with that in speed profile (EVECS) dataset, presenting an MAE below 1.15% and a RMSE under 1.65%, which further verified its excellent generalization.

查看原文本刊更多论文

基于环境感知强化学习的电动汽车能耗预测模型

随着电动汽车在全球范围内的日益普及，对实时、精确的能耗预测的需求也相应增加。针对现有模型在环境感知和动态校准方面的局限性，提出了一种新的能耗预测模型。该模型将环境感知与强化学习相结合。具体而言，首先设计了一种与强化学习深度耦合的路况感知方法，并构建了线性多尺度注意编码器；目的是提取与能源效率相关的多粒度环境特征，从而增强模型在复杂动态驾驶情况下的表征能力。其次，建立了马尔可夫决策过程下的实时能源效率估计模型，并将其映射到强化学习框架中。基于时变差分学习，迭代更新数据驱动的Q函数，实现能量估计的不断标定。最后，提出了基于因果结构的Kullback-Leibler （KL）发散情景的优先排序机制，以提高坡度变化和突然加速/减速等关键事件下的采样效率，同时增强模型在复杂条件下的鲁棒性和泛化性。结果证实了该方法在多种工况和车辆类型中具有优异的稳定性和鲁棒性。具体而言，平均绝对误差（MAE）低于12%；均方根误差（RMSE）小于1.8%；R2值超过99.5%。所有这些都表明它比Transformer、Informer、Mamba和长短期记忆（LSTM）模型显著提高了效率。将ev在真实世界中的实际能耗与EVECS数据集进行比较，MAE低于1.15%，RMSE低于1.65%，进一步验证了其良好的泛化性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Intelligent Transportation Systems 工程技术-工程：电子与电气

CiteScore

14.80

自引率

12.90%

发文量

1872

审稿时长

7.5 months

期刊介绍： The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.