Amr Mohamed, Antoine Lesage-Landry, Joshua A. Taylor
{"title":"利用对抗性多臂强盗调度恒温控制负载进行频率调节","authors":"Amr Mohamed, Antoine Lesage-Landry, Joshua A. Taylor","doi":"10.1109/EPEC.2017.8286168","DOIUrl":null,"url":null,"abstract":"Utilizing residential Thermostatically Controlled Loads (TCLs) for demand response stands to offer a more economical and environmentally friendly alternative to procuring energy storage and generation facilities for grid ancillary services. We use the adversarial multi-armed bandit framework to learn the signal response of TCLs and determine which TCLs to activate for demand response in real-time. We demonstrate the performance of our proposed approach by invoking theoretical bounds on the performance of an Exp3.M-based algorithm, and comparing the performance with a greedy algorithm. A sub-linear regret shows that the algorithm is able to learn and identify high-performing TCLs, and activate them more frequently as more information is acquired about the TCLs' signal response.","PeriodicalId":141250,"journal":{"name":"2017 IEEE Electrical Power and Energy Conference (EPEC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Dispatching thermostatically controlled loads for frequency regulation using adversarial multi-armed bandits\",\"authors\":\"Amr Mohamed, Antoine Lesage-Landry, Joshua A. Taylor\",\"doi\":\"10.1109/EPEC.2017.8286168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Utilizing residential Thermostatically Controlled Loads (TCLs) for demand response stands to offer a more economical and environmentally friendly alternative to procuring energy storage and generation facilities for grid ancillary services. We use the adversarial multi-armed bandit framework to learn the signal response of TCLs and determine which TCLs to activate for demand response in real-time. We demonstrate the performance of our proposed approach by invoking theoretical bounds on the performance of an Exp3.M-based algorithm, and comparing the performance with a greedy algorithm. A sub-linear regret shows that the algorithm is able to learn and identify high-performing TCLs, and activate them more frequently as more information is acquired about the TCLs' signal response.\",\"PeriodicalId\":141250,\"journal\":{\"name\":\"2017 IEEE Electrical Power and Energy Conference (EPEC)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Electrical Power and Energy Conference (EPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EPEC.2017.8286168\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Electrical Power and Energy Conference (EPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EPEC.2017.8286168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dispatching thermostatically controlled loads for frequency regulation using adversarial multi-armed bandits
Utilizing residential Thermostatically Controlled Loads (TCLs) for demand response stands to offer a more economical and environmentally friendly alternative to procuring energy storage and generation facilities for grid ancillary services. We use the adversarial multi-armed bandit framework to learn the signal response of TCLs and determine which TCLs to activate for demand response in real-time. We demonstrate the performance of our proposed approach by invoking theoretical bounds on the performance of an Exp3.M-based algorithm, and comparing the performance with a greedy algorithm. A sub-linear regret shows that the algorithm is able to learn and identify high-performing TCLs, and activate them more frequently as more information is acquired about the TCLs' signal response.