有限交换机下带背包的盲网收益管理与盗匪

IF 0.1 4区工程技术 Q4 ENGINEERING, MANUFACTURING

Manufacturing Engineering Pub Date : 2019-11-01 DOI:10.2139/ssrn.3479477

D. Simchi-Levi, Yunzong Xu, Jinglong Zhao

{"title":"有限交换机下带背包的盲网收益管理与盗匪","authors":"D. Simchi-Levi, Yunzong Xu, Jinglong Zhao","doi":"10.2139/ssrn.3479477","DOIUrl":null,"url":null,"abstract":"Our work is motivated by a common business constraint in online markets. While firms respect the advantages of dynamic pricing and price experimentation, they must limit the number of price changes (i.e., switches) to be within some budget due to various practical reasons. We study both the classical price-based network revenue management problem in the distributionally-unknown setup, and the bandits with knapsacks problem. In these problems, a decision-maker (without prior knowledge of the environment) has finite initial inventory of multiple resources to allocate over a finite time horizon. Beyond the classical resource constraints, we introduce an additional switching constraint to these problems, which restricts the total number of times that the decision-maker makes switches to be within a fixed switching budget. For such problems, we show matching upper and lower bounds on the optimal regret, and propose computationally-efficient limited-switch algorithms that achieve the optimal regret. Our work reveals a surprising result: the optimal regret rate is completely characterized by a piecewise-constant function of the switching budget, which further depends on the number of resource constraints --- notably, this is the first time the number of resources constraints is shown to play a fundamental role in determining the statistical complexity of online learning problems. We conduct computational experiments to examine the performance of our algorithms on a numerical setup that is widely used in the literature. Compared with benchmark algorithms from the literature, our proposed algorithms achieve promising performance with clear advantages on the number of incurred switches. Practically, firms can benefit from our study and improve their learning and decision-making performance when they simultaneously face resource and switching constraints","PeriodicalId":49886,"journal":{"name":"Manufacturing Engineering","volume":"28 1","pages":""},"PeriodicalIF":0.1000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Blind Network Revenue Management and Bandits with Knapsacks under Limited Switches\",\"authors\":\"D. Simchi-Levi, Yunzong Xu, Jinglong Zhao\",\"doi\":\"10.2139/ssrn.3479477\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Our work is motivated by a common business constraint in online markets. While firms respect the advantages of dynamic pricing and price experimentation, they must limit the number of price changes (i.e., switches) to be within some budget due to various practical reasons. We study both the classical price-based network revenue management problem in the distributionally-unknown setup, and the bandits with knapsacks problem. In these problems, a decision-maker (without prior knowledge of the environment) has finite initial inventory of multiple resources to allocate over a finite time horizon. Beyond the classical resource constraints, we introduce an additional switching constraint to these problems, which restricts the total number of times that the decision-maker makes switches to be within a fixed switching budget. For such problems, we show matching upper and lower bounds on the optimal regret, and propose computationally-efficient limited-switch algorithms that achieve the optimal regret. Our work reveals a surprising result: the optimal regret rate is completely characterized by a piecewise-constant function of the switching budget, which further depends on the number of resource constraints --- notably, this is the first time the number of resources constraints is shown to play a fundamental role in determining the statistical complexity of online learning problems. We conduct computational experiments to examine the performance of our algorithms on a numerical setup that is widely used in the literature. Compared with benchmark algorithms from the literature, our proposed algorithms achieve promising performance with clear advantages on the number of incurred switches. Practically, firms can benefit from our study and improve their learning and decision-making performance when they simultaneously face resource and switching constraints\",\"PeriodicalId\":49886,\"journal\":{\"name\":\"Manufacturing Engineering\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Manufacturing Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3479477\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, MANUFACTURING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Manufacturing Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.2139/ssrn.3479477","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}

引用次数: 1

摘要

我们的工作受到在线市场中常见的业务约束的激励。虽然企业尊重动态定价和价格实验的优势，但由于各种实际原因，它们必须将价格变化(即转换)的数量限制在一定的预算范围内。本文研究了分布未知情况下基于价格的经典网络收益管理问题和带背包的强盗问题。在这些问题中，决策者(不事先了解环境)在有限的时间范围内分配有限的初始多个资源库存。除了经典的资源约束之外，我们还为这些问题引入了一个额外的交换约束，它限制了决策者在固定的交换预算内进行交换的总次数。针对这类问题，我们给出了最优后悔的匹配上界和下界，并提出了计算效率高的实现最优后悔的有限切换算法。我们的工作揭示了一个令人惊讶的结果:最优后悔率完全由切换预算的分段常数函数表征，这进一步取决于资源约束的数量——值得注意的是，这是第一次显示资源约束的数量在决定在线学习问题的统计复杂性方面发挥了基本作用。我们进行计算实验来检查我们的算法在数值设置上的性能，这在文献中被广泛使用。与文献中的基准算法相比，我们提出的算法在产生的开关数量上具有明显的优势，具有很好的性能。实际上，当企业同时面临资源约束和转换约束时，企业可以从我们的研究中获益，并提高其学习和决策绩效

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Blind Network Revenue Management and Bandits with Knapsacks under Limited Switches

Our work is motivated by a common business constraint in online markets. While firms respect the advantages of dynamic pricing and price experimentation, they must limit the number of price changes (i.e., switches) to be within some budget due to various practical reasons. We study both the classical price-based network revenue management problem in the distributionally-unknown setup, and the bandits with knapsacks problem. In these problems, a decision-maker (without prior knowledge of the environment) has finite initial inventory of multiple resources to allocate over a finite time horizon. Beyond the classical resource constraints, we introduce an additional switching constraint to these problems, which restricts the total number of times that the decision-maker makes switches to be within a fixed switching budget. For such problems, we show matching upper and lower bounds on the optimal regret, and propose computationally-efficient limited-switch algorithms that achieve the optimal regret. Our work reveals a surprising result: the optimal regret rate is completely characterized by a piecewise-constant function of the switching budget, which further depends on the number of resource constraints --- notably, this is the first time the number of resources constraints is shown to play a fundamental role in determining the statistical complexity of online learning problems. We conduct computational experiments to examine the performance of our algorithms on a numerical setup that is widely used in the literature. Compared with benchmark algorithms from the literature, our proposed algorithms achieve promising performance with clear advantages on the number of incurred switches. Practically, firms can benefit from our study and improve their learning and decision-making performance when they simultaneously face resource and switching constraints

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Manufacturing Engineering 工程技术-工程：制造

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Information not localized