{"title":"基于后验抽样的安全序贯优化","authors":"Pratik Kar, V. Sukumaran, S. Sumitra","doi":"10.1109/SPCOM55316.2022.9840822","DOIUrl":null,"url":null,"abstract":"We consider the problem of designing posterior sampling based sequential optimization policies for maximizing a blackbox function subject to safety constraints. Posterior sampling algorithms, which are easier to implement, have met with empirical success for blackbox maximization problems without safety constraints. We consider whether posterior sampling algorithms which satisfy safety constraints have good performance with respect to achieving the global maxima while minimizing the number of safety constraint violations. We propose a safe Gaussian process Thompson Sampling algorithm for safe maximization of a blackbox function. The algorithm uses a sample estimate of safe set in order to meet safety constraints and uses a mutual information based acquisition function in order to improve the estimate of the safe set. We evaluate the performance of the proposed policy with respect to prior work using simulations. We observe that the proposed policy achieves similar behaviour compared to prior work for safety violations while achieving the global maximum.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On safe sequential optimization using posterior sampling\",\"authors\":\"Pratik Kar, V. Sukumaran, S. Sumitra\",\"doi\":\"10.1109/SPCOM55316.2022.9840822\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the problem of designing posterior sampling based sequential optimization policies for maximizing a blackbox function subject to safety constraints. Posterior sampling algorithms, which are easier to implement, have met with empirical success for blackbox maximization problems without safety constraints. We consider whether posterior sampling algorithms which satisfy safety constraints have good performance with respect to achieving the global maxima while minimizing the number of safety constraint violations. We propose a safe Gaussian process Thompson Sampling algorithm for safe maximization of a blackbox function. The algorithm uses a sample estimate of safe set in order to meet safety constraints and uses a mutual information based acquisition function in order to improve the estimate of the safe set. We evaluate the performance of the proposed policy with respect to prior work using simulations. We observe that the proposed policy achieves similar behaviour compared to prior work for safety violations while achieving the global maximum.\",\"PeriodicalId\":246982,\"journal\":{\"name\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPCOM55316.2022.9840822\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On safe sequential optimization using posterior sampling
We consider the problem of designing posterior sampling based sequential optimization policies for maximizing a blackbox function subject to safety constraints. Posterior sampling algorithms, which are easier to implement, have met with empirical success for blackbox maximization problems without safety constraints. We consider whether posterior sampling algorithms which satisfy safety constraints have good performance with respect to achieving the global maxima while minimizing the number of safety constraint violations. We propose a safe Gaussian process Thompson Sampling algorithm for safe maximization of a blackbox function. The algorithm uses a sample estimate of safe set in order to meet safety constraints and uses a mutual information based acquisition function in order to improve the estimate of the safe set. We evaluate the performance of the proposed policy with respect to prior work using simulations. We observe that the proposed policy achieves similar behaviour compared to prior work for safety violations while achieving the global maximum.