{"title":"Parallel Version of the Mirror Descent Algorithm for the Two-Armed Bandit Problem","authors":"A. Kolnogorov, D. Shiyan","doi":"10.1109/MCSI.2016.052","DOIUrl":null,"url":null,"abstract":"We propose some modified versions of the mirror descent algorithm for the two-armed bandit problem which allow parallel processing of data. Using Monte-Carlo simulations, we estimate the minimax risk for this versions.","PeriodicalId":421998,"journal":{"name":"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Third International Conference on Mathematics and Computers in Sciences and in Industry (MCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MCSI.2016.052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We propose some modified versions of the mirror descent algorithm for the two-armed bandit problem which allow parallel processing of data. Using Monte-Carlo simulations, we estimate the minimax risk for this versions.