{"title":"Environment-Aware Reinforcement Learning-Based Energy Consumption Prediction Model for Electric Vehicles","authors":"Ziran Peng;Xiaoyang Yang;Zhenyu He","doi":"10.1109/TITS.2026.3654673","DOIUrl":null,"url":null,"abstract":"As electric vehicles (EVs) gain growing popularity worldwide, demands for real-time and precise energy-consuming prediction have increased correspondingly. Targeting at limitations of existing models in environmental perception and dynamic calibration, this research put forward a novel model for energy-consuming prediction. This model integrated environmental perception with reinforcement learning. Specifically, at first, a road-condition perceiving approach deeply coupled with reinforcement learning was designed, and a linear multi-scale attention encoder was constructed. The aim was to extract multi-granularity environmental features related to energy efficiency and thus enhance the model’s representational capabilities under complicated dynamic driving situations. Second, a real-time energy efficiency estimation model was developed under a Markov decision process, which was also mapped to the reinforcement learning framework. Based on temporal-difference learning, the data-driven Q function was iteratively updated, and constant calibration of energy estimation was realized. Finally, a prioritization mechanism for causal-structure-based Kullback–Leibler (KL) divergence scenarios was proposed to enhance the sampling efficiency in cases of critical incidents such as slope variations and abrupt speed-accelerating/decelerating, while strengthening the robustness and generalization of the model under complicated conditions. Results confirmed the superior stability and robustness of the proposed approach across multiple operating conditions and vehicle types. Specifically, the mean absolute error (MAE) was below 12%; the root mean-squared error (RMSE) exhibited a value under 1.8%; and the R<sup>2</sup> value exceeded 99.5%. All these demonstrated its significantly improved efficiency over Transformer, Informer, Mamba, and long short-term memory (LSTM) models. EVs’ actual energy consumption in the real world was also compared with that in speed profile (EVECS) dataset, presenting an MAE below 1.15% and a RMSE under 1.65%, which further verified its excellent generalization.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"27 5","pages":"6147-6159"},"PeriodicalIF":8.4000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11361373/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/22 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
As electric vehicles (EVs) gain growing popularity worldwide, demands for real-time and precise energy-consuming prediction have increased correspondingly. Targeting at limitations of existing models in environmental perception and dynamic calibration, this research put forward a novel model for energy-consuming prediction. This model integrated environmental perception with reinforcement learning. Specifically, at first, a road-condition perceiving approach deeply coupled with reinforcement learning was designed, and a linear multi-scale attention encoder was constructed. The aim was to extract multi-granularity environmental features related to energy efficiency and thus enhance the model’s representational capabilities under complicated dynamic driving situations. Second, a real-time energy efficiency estimation model was developed under a Markov decision process, which was also mapped to the reinforcement learning framework. Based on temporal-difference learning, the data-driven Q function was iteratively updated, and constant calibration of energy estimation was realized. Finally, a prioritization mechanism for causal-structure-based Kullback–Leibler (KL) divergence scenarios was proposed to enhance the sampling efficiency in cases of critical incidents such as slope variations and abrupt speed-accelerating/decelerating, while strengthening the robustness and generalization of the model under complicated conditions. Results confirmed the superior stability and robustness of the proposed approach across multiple operating conditions and vehicle types. Specifically, the mean absolute error (MAE) was below 12%; the root mean-squared error (RMSE) exhibited a value under 1.8%; and the R2 value exceeded 99.5%. All these demonstrated its significantly improved efficiency over Transformer, Informer, Mamba, and long short-term memory (LSTM) models. EVs’ actual energy consumption in the real world was also compared with that in speed profile (EVECS) dataset, presenting an MAE below 1.15% and a RMSE under 1.65%, which further verified its excellent generalization.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.