基于离线强化学习的住宅区电动汽车充电调度策略

IF 8.9 2区工程技术 Q1 ENERGY & FUELS

Journal of energy storage Pub Date : 2024-11-01 DOI:10.1016/j.est.2024.114319

Runda Jia , Hengxin Pan , Shulei Zhang , Yao Hu

{"title":"基于离线强化学习的住宅区电动汽车充电调度策略","authors":"Runda Jia , Hengxin Pan , Shulei Zhang , Yao Hu","doi":"10.1016/j.est.2024.114319","DOIUrl":null,"url":null,"abstract":"<div><div>As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.</div></div>","PeriodicalId":15942,"journal":{"name":"Journal of energy storage","volume":"103 ","pages":"Article 114319"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Charging scheduling strategy for electric vehicles in residential areas based on offline reinforcement learning\",\"authors\":\"Runda Jia , Hengxin Pan , Shulei Zhang , Yao Hu\",\"doi\":\"10.1016/j.est.2024.114319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.</div></div>\",\"PeriodicalId\":15942,\"journal\":{\"name\":\"Journal of energy storage\",\"volume\":\"103 \",\"pages\":\"Article 114319\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of energy storage\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352152X24039057\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of energy storage","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352152X24039057","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

随着电动汽车（EV）数量的增加，强化学习（RL）在电动汽车充电调度方面面临更多挑战。在线强化学习需要与环境进行大量交互并不断试错，这可能会导致高成本和潜在风险。此外，电动汽车的大规模应用会导致 RL 的维度诅咒。针对这些问题，本研究构建了一个住宅区微电网模型，全面考虑了不同类型电动汽车的非线性充电模型和车对网（V2G）模式。充电调度问题被表示为受约束马尔可夫决策过程（CMDP），并采用无模型 RL 框架来有效解决不确定性问题。针对 "维度诅咒 "问题，本文设计了一种充电策略，并根据电动汽车的状态将其分为不同的组。代理将控制信号传送到各组，从而有效地降低了行动空间的维度。随后，利用离线数据集训练拉格朗日-BCQ 算法，并采用基于拉格朗日-BCQ 算法的充电策略来解决 CMDP 问题，同时加入安全过滤器以保证符合严格的约束条件。通过数值模拟实验，验证了本文提出的策略的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Charging scheduling strategy for electric vehicles in residential areas based on offline reinforcement learning

As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of energy storage Energy-Renewable Energy, Sustainability and the Environment

CiteScore

11.80

自引率

24.50%

发文量

2262

审稿时长

69 days

期刊介绍： Journal of energy storage focusses on all aspects of energy storage, in particular systems integration, electric grid integration, modelling and analysis, novel energy storage technologies, sizing and management strategies, business models for operation of storage systems and energy storage developments worldwide.