离散和连续行为空间对基于深度强化学习的电力零售商定价策略优化的影响

2021 IEEE Sustainable Power and Energy Conference (iSPEC) Pub Date : 2021-12-23 DOI:10.1109/iSPEC53008.2021.9735962

Hongsheng Xu, Xiaowei Cai, Jiao Shu, Jixiang Lu

{"title":"离散和连续行为空间对基于深度强化学习的电力零售商定价策略优化的影响","authors":"Hongsheng Xu, Xiaowei Cai, Jiao Shu, Jixiang Lu","doi":"10.1109/iSPEC53008.2021.9735962","DOIUrl":null,"url":null,"abstract":"The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.","PeriodicalId":417862,"journal":{"name":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Influence of Discrete and Continuous Action Spaces on Deep Reinforcement Learning-Based Pricing Strategy Optimization for Electricity Retailers\",\"authors\":\"Hongsheng Xu, Xiaowei Cai, Jiao Shu, Jixiang Lu\",\"doi\":\"10.1109/iSPEC53008.2021.9735962\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.\",\"PeriodicalId\":417862,\"journal\":{\"name\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSPEC53008.2021.9735962\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSPEC53008.2021.9735962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在电力市场中，电价策略优化问题成为电力零售商面临的重要问题。深度强化学习(DRL)已被应用于解决电力市场领域的战略决策问题。然而，使用基于drl的方法求解最优零售价格时，离散和连续的动作空间对优化结果的影响是未知的。本文通过深度Q网络(DQN)和深度确定性政策梯度(DDPG)对电力零售商采用了两种不同的基于drl的零售定价策略。对DQN和DDPG在收敛性和计算性能方面进行了深入的比较分析。最优零售价格和响应负荷的数值结果显示了离散和连续作用空间对优化效果的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Influence of Discrete and Continuous Action Spaces on Deep Reinforcement Learning-Based Pricing Strategy Optimization for Electricity Retailers

The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Sustainable Power and Energy Conference (iSPEC)

自引率

0.00%

发文量