Amr Mohamed, Antoine Lesage-Landry, Joshua A. Taylor
{"title":"Dispatching thermostatically controlled loads for frequency regulation using adversarial multi-armed bandits","authors":"Amr Mohamed, Antoine Lesage-Landry, Joshua A. Taylor","doi":"10.1109/EPEC.2017.8286168","DOIUrl":null,"url":null,"abstract":"Utilizing residential Thermostatically Controlled Loads (TCLs) for demand response stands to offer a more economical and environmentally friendly alternative to procuring energy storage and generation facilities for grid ancillary services. We use the adversarial multi-armed bandit framework to learn the signal response of TCLs and determine which TCLs to activate for demand response in real-time. We demonstrate the performance of our proposed approach by invoking theoretical bounds on the performance of an Exp3.M-based algorithm, and comparing the performance with a greedy algorithm. A sub-linear regret shows that the algorithm is able to learn and identify high-performing TCLs, and activate them more frequently as more information is acquired about the TCLs' signal response.","PeriodicalId":141250,"journal":{"name":"2017 IEEE Electrical Power and Energy Conference (EPEC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Electrical Power and Energy Conference (EPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EPEC.2017.8286168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Utilizing residential Thermostatically Controlled Loads (TCLs) for demand response stands to offer a more economical and environmentally friendly alternative to procuring energy storage and generation facilities for grid ancillary services. We use the adversarial multi-armed bandit framework to learn the signal response of TCLs and determine which TCLs to activate for demand response in real-time. We demonstrate the performance of our proposed approach by invoking theoretical bounds on the performance of an Exp3.M-based algorithm, and comparing the performance with a greedy algorithm. A sub-linear regret shows that the algorithm is able to learn and identify high-performing TCLs, and activate them more frequently as more information is acquired about the TCLs' signal response.