{"title":"Manage Inventories with Learning on Demands and Buy-up Substitution Probability","authors":"Zhenwei Luo, Pengfei Guo, Yulan Wang","doi":"10.1287/msom.2022.1169","DOIUrl":null,"url":null,"abstract":"Problem Definition: This paper considers a setting in which an airline company sells seats periodically, and each period consists of two selling phases, an early-bird discount phase and a regular-price phase. In each period, when the early-bird discount seat is stocked out, an early-bird customer who comes for the discounted seat either purchases the regular-price seat as a substitute (called buy-up substitution) or simply leaves. Methodology/Results: The optimal inventory level of the discounted seats reserved for the early-bird sale is a critical decision for the airline company to maximize its revenue. The airline company learns about the demands for both discounted and regular-price seats and the buy-up substitution probability from historical sales data, which, in turn, are affected by past inventory allocation decisions. In this paper, we investigate two information scenarios based on whether lost sales are observable, and we provide the corresponding Bayesian updating mechanism for learning about demand parameters and substitution probability. We then construct a dynamic programming model to derive the Bayesian optimal inventory level decisions in a multiperiod setting. The literature finds that the unobservability of lost sales drives the inventory manager to stock more (i.e., the Bayesian optimal inventory level should be kept higher than the myopic inventory level) to observe and learn more about demand distributions. Here, we show that when the buy-up substitution probability is known, one may stock less, because one can infer some information about the primary demand for the discounted seat from the customer substitution behavior. We also find that to learn about the unknown buy-up substitution probability drives the inventory manager to stock less so as to induce more substitution trials. Finally, we develop a SoftMax algorithm to solve our dynamic programming problem. We show that the obtained stock more (less) result can be utilized to speed up the convergence of the algorithm to the optimal solution. Managerial Implications: Our results shed light on the airline seat protection level decision with learning about demand parameters and buy-up substitution probability. Compared with myopic optimization, Bayesian inventory decisions that consider the exploration-exploitation tradeoff can avoid getting stuck in local optima and improve the revenue. We also identify new driving forces behind the stock more (less) result that complement the Bayesian inventory management literature. Funding: Z. Luo acknowledges the financial support by the Internal Start-up Fund of The Hong Kong Polytechnic University [Grant P0039035]. P. Guo acknowledges the financial support from the Research Grants Council of Hong Kong [Grant 15508518]. Y. Wang’s work was supported by the Research Grants Council of Hong Kong [Grant 15505318] and the National Natural Science Foundation of China [Grant 71971184]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2022.1169 .","PeriodicalId":18108,"journal":{"name":"Manuf. Serv. Oper. Manag.","volume":"24 1","pages":"563-580"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Manuf. Serv. Oper. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/msom.2022.1169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Problem Definition: This paper considers a setting in which an airline company sells seats periodically, and each period consists of two selling phases, an early-bird discount phase and a regular-price phase. In each period, when the early-bird discount seat is stocked out, an early-bird customer who comes for the discounted seat either purchases the regular-price seat as a substitute (called buy-up substitution) or simply leaves. Methodology/Results: The optimal inventory level of the discounted seats reserved for the early-bird sale is a critical decision for the airline company to maximize its revenue. The airline company learns about the demands for both discounted and regular-price seats and the buy-up substitution probability from historical sales data, which, in turn, are affected by past inventory allocation decisions. In this paper, we investigate two information scenarios based on whether lost sales are observable, and we provide the corresponding Bayesian updating mechanism for learning about demand parameters and substitution probability. We then construct a dynamic programming model to derive the Bayesian optimal inventory level decisions in a multiperiod setting. The literature finds that the unobservability of lost sales drives the inventory manager to stock more (i.e., the Bayesian optimal inventory level should be kept higher than the myopic inventory level) to observe and learn more about demand distributions. Here, we show that when the buy-up substitution probability is known, one may stock less, because one can infer some information about the primary demand for the discounted seat from the customer substitution behavior. We also find that to learn about the unknown buy-up substitution probability drives the inventory manager to stock less so as to induce more substitution trials. Finally, we develop a SoftMax algorithm to solve our dynamic programming problem. We show that the obtained stock more (less) result can be utilized to speed up the convergence of the algorithm to the optimal solution. Managerial Implications: Our results shed light on the airline seat protection level decision with learning about demand parameters and buy-up substitution probability. Compared with myopic optimization, Bayesian inventory decisions that consider the exploration-exploitation tradeoff can avoid getting stuck in local optima and improve the revenue. We also identify new driving forces behind the stock more (less) result that complement the Bayesian inventory management literature. Funding: Z. Luo acknowledges the financial support by the Internal Start-up Fund of The Hong Kong Polytechnic University [Grant P0039035]. P. Guo acknowledges the financial support from the Research Grants Council of Hong Kong [Grant 15508518]. Y. Wang’s work was supported by the Research Grants Council of Hong Kong [Grant 15505318] and the National Natural Science Foundation of China [Grant 71971184]. Supplemental Material: The e-companion is available at https://doi.org/10.1287/msom.2022.1169 .