Manage Inventories with Learning on Demands and Buy-up Substitution Probability

Manuf. Serv. Oper. Manag. Pub Date : 2022-12-08 DOI:10.1287/msom.2022.1169

Zhenwei Luo, Pengfei Guo, Yulan Wang

{"title":"Manage Inventories with Learning on Demands and Buy-up Substitution Probability","authors":"Zhenwei Luo, Pengfei Guo, Yulan Wang","doi":"10.1287/msom.2022.1169","DOIUrl":null,"url":null,"abstract":"Problem Definition: This paper considers a setting in which an airline company sells seats periodically, and each period consists of two selling phases, an early-bird discount phase and a regular-price phase. In each period, when the early-bird discount seat is stocked out, an early-bird customer who comes for the discounted seat either purchases the regular-price seat as a substitute (called buy-up substitution) or simply leaves. Methodology/Results: The optimal inventory level of the discounted seats reserved for the early-bird sale is a critical decision for the airline company to maximize its revenue. The airline company learns about the demands for both discounted and regular-price seats and the buy-up substitution probability from historical sales data, which, in turn, are affected by past inventory allocation decisions. In this paper, we investigate two information scenarios based on whether lost sales are observable, and we provide the corresponding Bayesian updating mechanism for learning about demand parameters and substitution probability. We then construct a dynamic programming model to derive the Bayesian optimal inventory level decisions in a multiperiod setting. The literature finds that the unobservability of lost sales drives the inventory manager to stock more (i.e., the Bayesian optimal inventory level should be kept higher than the myopic inventory level) to observe and learn more about demand distributions. Here, we show that when the buy-up substitution probability is known, one may stock less, because one can infer some information about the primary demand for the discounted seat from the customer substitution behavior. We also find that to learn about the unknown buy-up substitution probability drives the inventory manager to stock less so as to induce more substitution trials. Finally, we develop a SoftMax algorithm to solve our dynamic programming problem. We show that the obtained stock more (less) result can be utilized to speed up the convergence of the algorithm to the optimal solution. Managerial Implications: Our results shed light on the airline seat protection level decision with learning about demand parameters and buy-up substitution probability. Compared with myopic optimization, Bayesian inventory decisions that consider the exploration-exploitation tradeoff can avoid getting stuck in local optima and improve the revenue. We also identify new driving forces behind the stock more (less) result that complement the Bayesian inventory management literature. Funding: Z. Luo acknowledges the financial support by the Internal Start-up Fund of The Hong Kong Polytechnic University [Grant P0039035]. P. Guo acknowledges the financial support from the Research Grants Council of Hong Kong [Grant 15508518]. Y. Wang’s work was supported by the Research Grants Council of Hong Kong [Grant 15505318] and the National Natural Science Foundation of China [Grant 71971184]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2022.1169 .","PeriodicalId":18108,"journal":{"name":"Manuf. Serv. Oper. Manag.","volume":"24 1","pages":"563-580"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Manuf. Serv. Oper. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/msom.2022.1169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Problem Definition: This paper considers a setting in which an airline company sells seats periodically, and each period consists of two selling phases, an early-bird discount phase and a regular-price phase. In each period, when the early-bird discount seat is stocked out, an early-bird customer who comes for the discounted seat either purchases the regular-price seat as a substitute (called buy-up substitution) or simply leaves. Methodology/Results: The optimal inventory level of the discounted seats reserved for the early-bird sale is a critical decision for the airline company to maximize its revenue. The airline company learns about the demands for both discounted and regular-price seats and the buy-up substitution probability from historical sales data, which, in turn, are affected by past inventory allocation decisions. In this paper, we investigate two information scenarios based on whether lost sales are observable, and we provide the corresponding Bayesian updating mechanism for learning about demand parameters and substitution probability. We then construct a dynamic programming model to derive the Bayesian optimal inventory level decisions in a multiperiod setting. The literature finds that the unobservability of lost sales drives the inventory manager to stock more (i.e., the Bayesian optimal inventory level should be kept higher than the myopic inventory level) to observe and learn more about demand distributions. Here, we show that when the buy-up substitution probability is known, one may stock less, because one can infer some information about the primary demand for the discounted seat from the customer substitution behavior. We also find that to learn about the unknown buy-up substitution probability drives the inventory manager to stock less so as to induce more substitution trials. Finally, we develop a SoftMax algorithm to solve our dynamic programming problem. We show that the obtained stock more (less) result can be utilized to speed up the convergence of the algorithm to the optimal solution. Managerial Implications: Our results shed light on the airline seat protection level decision with learning about demand parameters and buy-up substitution probability. Compared with myopic optimization, Bayesian inventory decisions that consider the exploration-exploitation tradeoff can avoid getting stuck in local optima and improve the revenue. We also identify new driving forces behind the stock more (less) result that complement the Bayesian inventory management literature. Funding: Z. Luo acknowledges the financial support by the Internal Start-up Fund of The Hong Kong Polytechnic University [Grant P0039035]. P. Guo acknowledges the financial support from the Research Grants Council of Hong Kong [Grant 15508518]. Y. Wang’s work was supported by the Research Grants Council of Hong Kong [Grant 15505318] and the National Natural Science Foundation of China [Grant 71971184]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2022.1169 .

查看原文本刊更多论文

利用需求学习和购买替代概率管理库存

问题定义:本文考虑一个航空公司定期销售座位的设定，每个时期包括两个销售阶段，一个是早鸟折扣阶段，一个是正价阶段。在每个时期，当早到的折扣座位被卖光时，一个早到的顾客要么购买正价座位作为替代(称为买断替代)，要么干脆离开。方法/结果:对于航空公司而言，为早鸟销售预留折扣座位的最优库存水平是实现收益最大化的关键决策。航空公司从历史销售数据中了解到对折扣和正价座位的需求以及购买替代概率，而历史销售数据又受到过去库存分配决策的影响。本文研究了基于销售损失是否可观察的两种信息情景，并提供了相应的贝叶斯更新机制来学习需求参数和替代概率。在此基础上，构建了一个动态规划模型，推导出多周期环境下的贝叶斯最优库存水平决策。文献发现，销售损失的不可观察性促使库存管理者增加库存(即贝叶斯最优库存水平应高于近视库存水平)，以观察和了解更多的需求分布。在此，我们证明了当购买替代概率已知时，人们可能会减少库存，因为人们可以从顾客替代行为中推断出对折扣座位的主要需求的一些信息。我们还发现，了解未知的购买替代概率会促使库存管理者减少库存，从而引发更多的替代试验。最后，我们开发了一个SoftMax算法来解决我们的动态规划问题。我们证明了所得到的库存多(少)结果可以用来加快算法收敛到最优解的速度。管理启示:我们的研究结果揭示了航空公司座位保护水平的决策，了解了需求参数和购买替代概率。与短视优化相比，考虑勘探开发权衡的贝叶斯库存决策可以避免陷入局部最优，提高收益。我们还确定了库存多(少)结果背后的新驱动力，以补充贝叶斯库存管理文献。基金资助:罗铮感谢香港理工大学内部创业基金的资助[Grant P0039035]。郭培平感谢香港研究资助局的资助[拨款号15508518]。Wang Y.的研究得到香港研究资助局[Grant 15505318]和中国国家自然科学基金[Grant 71971184]的资助。补充材料:电子伴侣可在https://doi.org/10.1287/msom.2022.1169上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Manuf. Serv. Oper. Manag.

自引率

0.00%

发文量