基于离线强化学习的住宅区电动汽车充电调度策略

IF 8.9 2区 工程技术 Q1 ENERGY & FUELS
Runda Jia , Hengxin Pan , Shulei Zhang , Yao Hu
{"title":"基于离线强化学习的住宅区电动汽车充电调度策略","authors":"Runda Jia ,&nbsp;Hengxin Pan ,&nbsp;Shulei Zhang ,&nbsp;Yao Hu","doi":"10.1016/j.est.2024.114319","DOIUrl":null,"url":null,"abstract":"<div><div>As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.</div></div>","PeriodicalId":15942,"journal":{"name":"Journal of energy storage","volume":null,"pages":null},"PeriodicalIF":8.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Charging scheduling strategy for electric vehicles in residential areas based on offline reinforcement learning\",\"authors\":\"Runda Jia ,&nbsp;Hengxin Pan ,&nbsp;Shulei Zhang ,&nbsp;Yao Hu\",\"doi\":\"10.1016/j.est.2024.114319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.</div></div>\",\"PeriodicalId\":15942,\"journal\":{\"name\":\"Journal of energy storage\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of energy storage\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352152X24039057\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of energy storage","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352152X24039057","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

摘要

随着电动汽车(EV)数量的增加,强化学习(RL)在电动汽车充电调度方面面临更多挑战。在线强化学习需要与环境进行大量交互并不断试错,这可能会导致高成本和潜在风险。此外,电动汽车的大规模应用会导致 RL 的维度诅咒。针对这些问题,本研究构建了一个住宅区微电网模型,全面考虑了不同类型电动汽车的非线性充电模型和车对网(V2G)模式。充电调度问题被表示为受约束马尔可夫决策过程(CMDP),并采用无模型 RL 框架来有效解决不确定性问题。针对 "维度诅咒 "问题,本文设计了一种充电策略,并根据电动汽车的状态将其分为不同的组。代理将控制信号传送到各组,从而有效地降低了行动空间的维度。随后,利用离线数据集训练拉格朗日-BCQ 算法,并采用基于拉格朗日-BCQ 算法的充电策略来解决 CMDP 问题,同时加入安全过滤器以保证符合严格的约束条件。通过数值模拟实验,验证了本文提出的策略的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Charging scheduling strategy for electric vehicles in residential areas based on offline reinforcement learning
As the number of electric vehicles(EVs) increases, reinforcement learning(RL) faces more challenges in EV charging scheduling. Online RL requires lots of interaction with the environment and trial and error, which may lead to high costs and potential risks. In addition, the large-scale application of EVs causes curse of dimensionality in RL. In response to these problems, this work constructed a residential area microgrid model that comprehensively considered the nonlinear charging models of different types of EVs and the vehicle-to-grid (V2G) mode. The charging scheduling problem is represented as a Constrained Markov Decision Process (CMDP), employing a model-free RL framework to proficiently address uncertainties. In response to the curse of dimensionality problem, this paper designs a charging strategy, and divides EVs into different sets according to their statuses. The agent transmits control signals to the sets, thereby efficiently reducing the dimension of the action space. Subsequently, the Lagrangian-BCQ algorithm is trained using the offline data set, the charging strategy based on the Lagrangian-BCQ algorithm is employed to address the CMDP, with the incorporation of a safety filter to guarantee compliance with stringent constraints. Through numerical simulation experiments, the effectiveness of the strategy proposed in this work was verified.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of energy storage
Journal of energy storage Energy-Renewable Energy, Sustainability and the Environment
CiteScore
11.80
自引率
24.50%
发文量
2262
审稿时长
69 days
期刊介绍: Journal of energy storage focusses on all aspects of energy storage, in particular systems integration, electric grid integration, modelling and analysis, novel energy storage technologies, sizing and management strategies, business models for operation of storage systems and energy storage developments worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信