重叠多强盗最佳武器识别

2019 IEEE International Symposium on Information Theory (ISIT) Pub Date : 2019-07-01 DOI:10.1109/ISIT.2019.8849327

J. Scarlett, Ilija Bogunovic, V. Cevher

{"title":"重叠多强盗最佳武器识别","authors":"J. Scarlett, Ilija Bogunovic, V. Cevher","doi":"10.1109/ISIT.2019.8849327","DOIUrl":null,"url":null,"abstract":"In the multi-armed bandit literature, the multibandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in every group, and we complement these bounds with a near-matching algorithm-independent lower bound.","PeriodicalId":6708,"journal":{"name":"2019 IEEE International Symposium on Information Theory (ISIT)","volume":"121 2 1","pages":"2544-2548"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Overlapping Multi-Bandit Best Arm Identification\",\"authors\":\"J. Scarlett, Ilija Bogunovic, V. Cevher\",\"doi\":\"10.1109/ISIT.2019.8849327\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the multi-armed bandit literature, the multibandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in every group, and we complement these bounds with a near-matching algorithm-independent lower bound.\",\"PeriodicalId\":6708,\"journal\":{\"name\":\"2019 IEEE International Symposium on Information Theory (ISIT)\",\"volume\":\"121 2 1\",\"pages\":\"2544-2548\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Symposium on Information Theory (ISIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISIT.2019.8849327\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Information Theory (ISIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIT.2019.8849327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

在多臂强盗文献中，多臂强盗最佳臂识别问题包括在许多不相交的臂组中确定每个最佳臂，并且总臂拉力尽可能少。本文引入了一种具有重叠群的多盗匪问题的变体，并给出了两种基于逐次消去和上下置信区间(LUCB)的算法。我们限定了每组中高概率最佳手臂识别所需的总手臂拉拔次数，并用一个接近匹配的与算法无关的下界来补充这些边界。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Overlapping Multi-Bandit Best Arm Identification

In the multi-armed bandit literature, the multibandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in every group, and we complement these bounds with a near-matching algorithm-independent lower bound.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE International Symposium on Information Theory (ISIT)

自引率

0.00%

发文量