基于多智能体异步更新深度强化学习的智能家用电动汽车充放电调度

IF 4.9 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Qiang Zhao , Chengwei Xu , Chuan Sun , Yinghua Han
{"title":"基于多智能体异步更新深度强化学习的智能家用电动汽车充放电调度","authors":"Qiang Zhao ,&nbsp;Chengwei Xu ,&nbsp;Chuan Sun ,&nbsp;Yinghua Han","doi":"10.1016/j.compeleceng.2025.110473","DOIUrl":null,"url":null,"abstract":"<div><div>Despite the increasing penetration of electric vehicle (EV), significant efforts are still required to transition towards a low-carbon future while balancing economy and power system stability. Coordinating EV charging and discharging scheduling in residential areas faces challenges due to uncertainties in EV owners’ commuting behaviors, complex energy demands, and unpredictable power information. This paper formulates the <strong>R</strong>esidential <strong>EV C</strong>harging and <strong>D</strong>ischarging <strong>S</strong>cheduling (REV-CDS) as a Markov Decision Process with an unknown transition function. <strong>M</strong>ulti-<strong>A</strong>gent-<strong>A</strong>synchronous-<strong>S</strong>oft-<strong>A</strong>ctor-<strong>C</strong>ritic (MAASAC) algorithm is proposed to solve the Markov Decision Process. The proposed method employs the asynchronous updating process instead of synchronous updating, which allows agents to maintain consistent policy update direction, enabling better learning of a charging and discharging strategy to improve coordination among EVs and highly align with the overall scheduling goals in the REV-CDS environment. Finally, several numerical studies were conducted to compare the proposed approach with classical multi-agent reinforcement learning methods. The studies demonstrate the effectiveness of improvement in minimizing charging costs, reducing carbon emissions, alleviating charging anxiety, and preventing transformer overload.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"126 ","pages":"Article 110473"},"PeriodicalIF":4.9000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Smart residential electric vehicle charging and discharging scheduling via multi-agent asynchronous-updating deep reinforcement learning\",\"authors\":\"Qiang Zhao ,&nbsp;Chengwei Xu ,&nbsp;Chuan Sun ,&nbsp;Yinghua Han\",\"doi\":\"10.1016/j.compeleceng.2025.110473\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Despite the increasing penetration of electric vehicle (EV), significant efforts are still required to transition towards a low-carbon future while balancing economy and power system stability. Coordinating EV charging and discharging scheduling in residential areas faces challenges due to uncertainties in EV owners’ commuting behaviors, complex energy demands, and unpredictable power information. This paper formulates the <strong>R</strong>esidential <strong>EV C</strong>harging and <strong>D</strong>ischarging <strong>S</strong>cheduling (REV-CDS) as a Markov Decision Process with an unknown transition function. <strong>M</strong>ulti-<strong>A</strong>gent-<strong>A</strong>synchronous-<strong>S</strong>oft-<strong>A</strong>ctor-<strong>C</strong>ritic (MAASAC) algorithm is proposed to solve the Markov Decision Process. The proposed method employs the asynchronous updating process instead of synchronous updating, which allows agents to maintain consistent policy update direction, enabling better learning of a charging and discharging strategy to improve coordination among EVs and highly align with the overall scheduling goals in the REV-CDS environment. Finally, several numerical studies were conducted to compare the proposed approach with classical multi-agent reinforcement learning methods. The studies demonstrate the effectiveness of improvement in minimizing charging costs, reducing carbon emissions, alleviating charging anxiety, and preventing transformer overload.</div></div>\",\"PeriodicalId\":50630,\"journal\":{\"name\":\"Computers & Electrical Engineering\",\"volume\":\"126 \",\"pages\":\"Article 110473\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Electrical Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045790625004161\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625004161","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

尽管电动汽车(EV)的普及率越来越高,但在平衡经济和电力系统稳定性的同时,向低碳未来转型仍需要付出巨大努力。由于电动汽车车主出行行为的不确定性、复杂的能源需求和不可预测的电力信息,住宅小区电动汽车充放电调度协调面临挑战。本文将住宅电动汽车充放电调度(REV-CDS)表述为一个具有未知过渡函数的马尔可夫决策过程。针对马尔可夫决策过程,提出了多智能体-异步-软行为者-批评家(MAASAC)算法。该方法采用异步更新过程代替同步更新,使agent保持一致的策略更新方向,更好地学习充电和放电策略,提高了REV-CDS环境下电动汽车之间的协调性,与总体调度目标高度一致。最后,进行了一些数值研究,将该方法与经典的多智能体强化学习方法进行了比较。研究证明了改进在降低充电成本、减少碳排放、缓解充电焦虑和防止变压器过载方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Smart residential electric vehicle charging and discharging scheduling via multi-agent asynchronous-updating deep reinforcement learning
Despite the increasing penetration of electric vehicle (EV), significant efforts are still required to transition towards a low-carbon future while balancing economy and power system stability. Coordinating EV charging and discharging scheduling in residential areas faces challenges due to uncertainties in EV owners’ commuting behaviors, complex energy demands, and unpredictable power information. This paper formulates the Residential EV Charging and Discharging Scheduling (REV-CDS) as a Markov Decision Process with an unknown transition function. Multi-Agent-Asynchronous-Soft-Actor-Critic (MAASAC) algorithm is proposed to solve the Markov Decision Process. The proposed method employs the asynchronous updating process instead of synchronous updating, which allows agents to maintain consistent policy update direction, enabling better learning of a charging and discharging strategy to improve coordination among EVs and highly align with the overall scheduling goals in the REV-CDS environment. Finally, several numerical studies were conducted to compare the proposed approach with classical multi-agent reinforcement learning methods. The studies demonstrate the effectiveness of improvement in minimizing charging costs, reducing carbon emissions, alleviating charging anxiety, and preventing transformer overload.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信