{"title":"离散和连续行为空间对基于深度强化学习的电力零售商定价策略优化的影响","authors":"Hongsheng Xu, Xiaowei Cai, Jiao Shu, Jixiang Lu","doi":"10.1109/iSPEC53008.2021.9735962","DOIUrl":null,"url":null,"abstract":"The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.","PeriodicalId":417862,"journal":{"name":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Influence of Discrete and Continuous Action Spaces on Deep Reinforcement Learning-Based Pricing Strategy Optimization for Electricity Retailers\",\"authors\":\"Hongsheng Xu, Xiaowei Cai, Jiao Shu, Jixiang Lu\",\"doi\":\"10.1109/iSPEC53008.2021.9735962\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.\",\"PeriodicalId\":417862,\"journal\":{\"name\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Sustainable Power and Energy Conference (iSPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSPEC53008.2021.9735962\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Sustainable Power and Energy Conference (iSPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSPEC53008.2021.9735962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Influence of Discrete and Continuous Action Spaces on Deep Reinforcement Learning-Based Pricing Strategy Optimization for Electricity Retailers
The pricing strategy optimization problem becomes important for electricity retailers in electricity market. Deep reinforcement learning (DRL) has been applied to solve the strategic decision-making problems in electricity market area. However, the influence of discrete and continuous action spaces on optimization results by using DRL-based methods to solve for optimal retail price is unknown. This paper applies two different DRL-based retail pricing strategies through deep Q network (DQN) and deep deterministic policy gradient (DDPG) for the electricity retailers. An in-depth comparative analysis between DQN and DDPG is conducted in terms of convergence and computational performance. The numerical results of optimal retail prices and responding loads show the influence of discrete and continuous actions space on optimization effect.