{"title":"电力消费者的智能需求响应:一种多手强盗博弈方法","authors":"Zibo Zhao, Andrew L. Liu","doi":"10.1109/ISAP.2017.8071376","DOIUrl":null,"url":null,"abstract":"Real-time electricity pricing (RTP) for consumers has long been argued to be key to realize the many envisioned benefits of a smart energy grid. How to actually implement an RTP scheme, however, is still under debate. Since most of the organized wholesale power markets in the US implement a two-settlement system, with day-ahead electricity price forecasts guiding financial and physical transactions in the next day and real-time ex post prices settling any real-time imbalances, it is a natural idea to let consumers respond to the day-ahead prices. Such an idea, however, may lead to consumers all respond in the same fashion, causing large swings of the energy demand and prices, which may jeopardize system stability and increase consumers' financial risks. To overcome this issue, we propose a game-theoretic framework in which each consumer solves a multi-armed bandit problem; that is, consumers learn from the history and attempts to minimize their regrets. The consequence is drastically reduced volatility on real-time prices and much flatter load curves for the entire grid. Such results are not only based on simulation, but are also supported by theories of mean-field equilibria in multi-armed bandit games.","PeriodicalId":257100,"journal":{"name":"2017 19th International Conference on Intelligent System Application to Power Systems (ISAP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Intelligent demand response for electricity consumers: A multi-armed bandit game approach\",\"authors\":\"Zibo Zhao, Andrew L. Liu\",\"doi\":\"10.1109/ISAP.2017.8071376\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time electricity pricing (RTP) for consumers has long been argued to be key to realize the many envisioned benefits of a smart energy grid. How to actually implement an RTP scheme, however, is still under debate. Since most of the organized wholesale power markets in the US implement a two-settlement system, with day-ahead electricity price forecasts guiding financial and physical transactions in the next day and real-time ex post prices settling any real-time imbalances, it is a natural idea to let consumers respond to the day-ahead prices. Such an idea, however, may lead to consumers all respond in the same fashion, causing large swings of the energy demand and prices, which may jeopardize system stability and increase consumers' financial risks. To overcome this issue, we propose a game-theoretic framework in which each consumer solves a multi-armed bandit problem; that is, consumers learn from the history and attempts to minimize their regrets. The consequence is drastically reduced volatility on real-time prices and much flatter load curves for the entire grid. Such results are not only based on simulation, but are also supported by theories of mean-field equilibria in multi-armed bandit games.\",\"PeriodicalId\":257100,\"journal\":{\"name\":\"2017 19th International Conference on Intelligent System Application to Power Systems (ISAP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 19th International Conference on Intelligent System Application to Power Systems (ISAP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISAP.2017.8071376\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 19th International Conference on Intelligent System Application to Power Systems (ISAP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISAP.2017.8071376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Intelligent demand response for electricity consumers: A multi-armed bandit game approach
Real-time electricity pricing (RTP) for consumers has long been argued to be key to realize the many envisioned benefits of a smart energy grid. How to actually implement an RTP scheme, however, is still under debate. Since most of the organized wholesale power markets in the US implement a two-settlement system, with day-ahead electricity price forecasts guiding financial and physical transactions in the next day and real-time ex post prices settling any real-time imbalances, it is a natural idea to let consumers respond to the day-ahead prices. Such an idea, however, may lead to consumers all respond in the same fashion, causing large swings of the energy demand and prices, which may jeopardize system stability and increase consumers' financial risks. To overcome this issue, we propose a game-theoretic framework in which each consumer solves a multi-armed bandit problem; that is, consumers learn from the history and attempts to minimize their regrets. The consequence is drastically reduced volatility on real-time prices and much flatter load curves for the entire grid. Such results are not only based on simulation, but are also supported by theories of mean-field equilibria in multi-armed bandit games.