{"title":"AI vs. Human Buyers: A Study of Alibaba’s Inventory Replenishment System","authors":"Jiaxi Liu, Shuyi Lin, Linwei Xin, Yidong Zhang","doi":"10.1287/inte.2023.1160","DOIUrl":null,"url":null,"abstract":"Inventory management is one of the most important components of Alibaba’s business. Traditionally, human buyers make replenishment decisions: although artificial intelligence (AI) algorithms make recommendations, human buyers can choose to ignore these recommendations and make their own decisions. The company has been exploring a new replenishment system in which algorithmic recommendations are final. The algorithms combine state-of-the-art deep reinforcement learning techniques with the framework of fictitious play. By learning the supplier’s behavior, we are able to address the important issues of lead time and fill rate on order quantity, which have been ignored in the extant literature of stochastic inventory control. We present evidence that our algorithms outperform human buyers in terms of reducing out-of-stock rates and inventory levels. More interestingly, we have seen additional benefits amid the pandemic. Over the last two years, cities in China partially and intermittently locked down to mitigate COVID-19 outbreaks. We have observed panic buying from human buyers during lockdowns, leading to the bullwhip effect. By contrast, panic buying and the bullwhip effect can be mitigated using our algorithms due to their ability to recognize changes in the supplier’s behavior during lockdowns. History: This paper has been accepted for the INFORMS Journal on Applied Analytics Special Issue—2022 Daniel H. Wagner Prize for Excellence in the Practice of Advanced Analytics and Operations Research.","PeriodicalId":53206,"journal":{"name":"Informs Journal on Applied Analytics","volume":"25 1","pages":"0"},"PeriodicalIF":1.1000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informs Journal on Applied Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/inte.2023.1160","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 1
Abstract
Inventory management is one of the most important components of Alibaba’s business. Traditionally, human buyers make replenishment decisions: although artificial intelligence (AI) algorithms make recommendations, human buyers can choose to ignore these recommendations and make their own decisions. The company has been exploring a new replenishment system in which algorithmic recommendations are final. The algorithms combine state-of-the-art deep reinforcement learning techniques with the framework of fictitious play. By learning the supplier’s behavior, we are able to address the important issues of lead time and fill rate on order quantity, which have been ignored in the extant literature of stochastic inventory control. We present evidence that our algorithms outperform human buyers in terms of reducing out-of-stock rates and inventory levels. More interestingly, we have seen additional benefits amid the pandemic. Over the last two years, cities in China partially and intermittently locked down to mitigate COVID-19 outbreaks. We have observed panic buying from human buyers during lockdowns, leading to the bullwhip effect. By contrast, panic buying and the bullwhip effect can be mitigated using our algorithms due to their ability to recognize changes in the supplier’s behavior during lockdowns. History: This paper has been accepted for the INFORMS Journal on Applied Analytics Special Issue—2022 Daniel H. Wagner Prize for Excellence in the Practice of Advanced Analytics and Operations Research.
库存管理是阿里巴巴业务最重要的组成部分之一。传统上,人类买家做出补货决策:尽管人工智能(AI)算法会提出建议,但人类买家可以选择忽略这些建议,自己做出决定。该公司一直在探索一种新的补货系统,其中算法推荐是最终的。该算法结合了最先进的深度强化学习技术和虚拟游戏框架。通过对供应商行为的学习,可以解决现有随机库存控制文献中忽略的交货时间和交货率对订单数量的重要影响。我们提供的证据表明,我们的算法在减少缺货率和库存水平方面优于人类买家。更有趣的是,我们在大流行期间看到了额外的好处。在过去两年中,中国的城市部分和间歇性封锁以缓解COVID-19疫情。我们观察到在封锁期间人类买家的恐慌性购买,导致牛鞭效应。相比之下,由于我们的算法能够识别封锁期间供应商行为的变化,因此可以缓解恐慌性购买和牛鞭效应。历史:本文已被INFORMS应用分析杂志特刊- 2022年Daniel H. Wagner高级分析和运筹学实践优秀奖所接受。