{"title":"Online quickest multiarmed bandit algorithm for distributive renewable energy resources","authors":"Yi Huang, L. Lai, Husheng Li, Wei Chen, Zhu Han","doi":"10.1109/SmartGridComm.2012.6486044","DOIUrl":null,"url":null,"abstract":"Distributive renewable energy resource (DRER) system is one key research in smart grid technology. One important challenge is to select the best among many DRERs. In this paper, we develop an online quickest multiarmed bandit algorithm to determine the best choice of DRERs as few samples as possible, under the constraint of accuracy. We derive the close form for the confident interval and obtain an upper bound for the expected regret for the proposed scheme. From the simulation results, we can show that a user can effectively switch and select the best DRER with the minimum delay while balancing the exploitation and exploration.","PeriodicalId":143915,"journal":{"name":"2012 IEEE Third International Conference on Smart Grid Communications (SmartGridComm)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Third International Conference on Smart Grid Communications (SmartGridComm)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartGridComm.2012.6486044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Distributive renewable energy resource (DRER) system is one key research in smart grid technology. One important challenge is to select the best among many DRERs. In this paper, we develop an online quickest multiarmed bandit algorithm to determine the best choice of DRERs as few samples as possible, under the constraint of accuracy. We derive the close form for the confident interval and obtain an upper bound for the expected regret for the proposed scheme. From the simulation results, we can show that a user can effectively switch and select the best DRER with the minimum delay while balancing the exploitation and exploration.