{"title":"Adaptive Duty Cycle Control for Optimal Battery Energy Storage System Charging by Reinforcement Learning","authors":"Richard Wiencek, Sagnika Ghosh","doi":"10.1109/CAI54212.2023.00013","DOIUrl":null,"url":null,"abstract":"This paper works on adaptive duty cycle control of a Solar power system using a Reinforcement Learning approach for optimizing the charging of a 12 V 30 Ah Battery Energy Storage System (BESS). The Twin-Delayed Deep Deterministic (TD3) algorithm is used to train an agent that adaptively controls the duty cycle of a Pulse-Width Modulation (PWM) signal to maintain the output voltage of the Photovoltaic (PV) system in the optimal range for charging the BESS. Results from MATLAB Simulink show that the TD3 algorithm optimizes the charging of the BESS when compared to other techniques, such as a PID controller, achieving an SOC of around 50.33% from an initial value of 50% with low noise for 10 seconds, whereas the PID controller achieved around 50.06% with high voltage spikes. This work opens avenues for expanding the system to include other renewable energy sources and their interactions for improving power distribution.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"186 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Conference on Artificial Intelligence (CAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAI54212.2023.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper works on adaptive duty cycle control of a Solar power system using a Reinforcement Learning approach for optimizing the charging of a 12 V 30 Ah Battery Energy Storage System (BESS). The Twin-Delayed Deep Deterministic (TD3) algorithm is used to train an agent that adaptively controls the duty cycle of a Pulse-Width Modulation (PWM) signal to maintain the output voltage of the Photovoltaic (PV) system in the optimal range for charging the BESS. Results from MATLAB Simulink show that the TD3 algorithm optimizes the charging of the BESS when compared to other techniques, such as a PID controller, achieving an SOC of around 50.33% from an initial value of 50% with low noise for 10 seconds, whereas the PID controller achieved around 50.06% with high voltage spikes. This work opens avenues for expanding the system to include other renewable energy sources and their interactions for improving power distribution.
本文采用强化学习方法研究太阳能发电系统的自适应占空比控制,以优化12 V 30 Ah电池储能系统(BESS)的充电。采用双延迟深度确定性(TD3)算法训练智能体自适应控制脉宽调制(PWM)信号的占空比,使光伏(PV)系统的输出电压保持在最佳充电范围内。MATLAB Simulink的结果表明,与其他技术(如PID控制器)相比,TD3算法优化了BESS的充电,在低噪声情况下,从初始值的50%在10秒内实现了约50.33%的SOC,而PID控制器在高电压尖峰情况下实现了约50.06%。这项工作为扩大该系统以包括其他可再生能源及其相互作用以改善电力分配开辟了道路。