Scalable probabilistic forecasting in retail with gradient boosted trees: A practitioner’s approach

IF 9.8 1区工程技术 Q1 ENGINEERING, INDUSTRIAL

International Journal of Production Economics Pub Date : 2024-11-05 DOI:10.1016/j.ijpe.2024.109449

Xueying Long , Quang Bui , Grady Oktavian , Daniel F. Schmidt , Christoph Bergmeir , Rakshitha Godahewa , Seong Per Lee , Kaifeng Zhao , Paul Condylis

{"title":"Scalable probabilistic forecasting in retail with gradient boosted trees: A practitioner’s approach","authors":"Xueying Long , Quang Bui , Grady Oktavian , Daniel F. Schmidt , Christoph Bergmeir , Rakshitha Godahewa , Seong Per Lee , Kaifeng Zhao , Paul Condylis","doi":"10.1016/j.ijpe.2024.109449","DOIUrl":null,"url":null,"abstract":"<div><div>The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, there are important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger stock assortment than brick-and-mortar retailers, leading to more intermittent data. To scale to larger dataset sizes with feasible computational effort, we investigate a two-layer hierarchy, namely the decision level with product unit sales and an aggregated level, e.g., through warehouse-product aggregation, reducing the number of series and degree of intermittency. We propose a top-down approach to forecasting at the aggregated level, and then disaggregate to obtain decision-level forecasts. Probabilistic forecasts are generated under distributional assumptions. The proposed scalable method is evaluated on both a large proprietary dataset, as well as the publicly available Corporación Favorita and M5 datasets. We are able to show the differences in characteristics of the e-commerce and brick-and-mortar retail datasets. Notably, our top-down forecasting framework enters the top 50 of the original M5 competition, even with models trained at a higher level under a much simpler setting.</div></div>","PeriodicalId":14287,"journal":{"name":"International Journal of Production Economics","volume":"279 ","pages":"Article 109449"},"PeriodicalIF":9.8000,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Production Economics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925527324003062","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}

引用次数: 0

Abstract

The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, there are important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger stock assortment than brick-and-mortar retailers, leading to more intermittent data. To scale to larger dataset sizes with feasible computational effort, we investigate a two-layer hierarchy, namely the decision level with product unit sales and an aggregated level, e.g., through warehouse-product aggregation, reducing the number of series and degree of intermittency. We propose a top-down approach to forecasting at the aggregated level, and then disaggregate to obtain decision-level forecasts. Probabilistic forecasts are generated under distributional assumptions. The proposed scalable method is evaluated on both a large proprietary dataset, as well as the publicly available Corporación Favorita and M5 datasets. We are able to show the differences in characteristics of the e-commerce and brick-and-mortar retail datasets. Notably, our top-down forecasting framework enters the top 50 of the original M5 competition, even with models trained at a higher level under a much simpler setting.

查看原文本刊更多论文

利用梯度提升树进行零售业可扩展概率预测：实践者的方法

最近的 M5 竞争推动了零售业预测技术的发展。然而，比赛的挑战与我们在大型电子商务公司面临的挑战之间存在重大差异。我们所面临的数据集更大（有数十万个时间序列），而且与实体零售商相比，电子商务可以拥有更多的库存品种，从而产生更多断断续续的数据。为了在可行的计算量下扩展到更大的数据集规模，我们研究了一种双层层次结构，即包含产品单位销售额的决策层和聚合层，例如，通过仓库-产品聚合，减少序列数量和间歇程度。我们提出了一种自上而下的方法，在汇总层进行预测，然后进行分解以获得决策层预测。在分布假设下生成概率预测。我们在大型专有数据集以及公开的 Corporación Favorita 和 M5 数据集上对所提出的可扩展方法进行了评估。我们能够显示出电子商务数据集和实体零售数据集在特征上的差异。值得注意的是，我们的自上而下预测框架在最初的 M5 竞赛中进入了前 50 名，即使是在更简单的设置下以更高水平训练的模型也是如此。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Production Economics 管理科学-工程：工业

CiteScore

21.40

自引率

7.50%

发文量

266

审稿时长

52 days

期刊介绍： The International Journal of Production Economics focuses on the interface between engineering and management. It covers all aspects of manufacturing and process industries, as well as production in general. The journal is interdisciplinary, considering activities throughout the product life cycle and material flow cycle. It aims to disseminate knowledge for improving industrial practice and strengthening the theoretical base for decision making. The journal serves as a forum for exchanging ideas and presenting new developments in theory and application, combining academic standards with practical value for industrial applications.