在季节变化的情况下,物联网驱动的新鲜农产品动态补充:一种使用奖励塑造的深度强化学习方法

IF 6.7 2区 管理学 Q1 MANAGEMENT
Zihao Wang , Wenlong Wang , Tianjun Liu , Jasmine Chang , Jim Shi
{"title":"在季节变化的情况下,物联网驱动的新鲜农产品动态补充:一种使用奖励塑造的深度强化学习方法","authors":"Zihao Wang ,&nbsp;Wenlong Wang ,&nbsp;Tianjun Liu ,&nbsp;Jasmine Chang ,&nbsp;Jim Shi","doi":"10.1016/j.omega.2025.103299","DOIUrl":null,"url":null,"abstract":"<div><div>Internet of things (IoT) has been transforming inventory management disruptively by linking and synchronizing inventory products together. It is one of the driving forces for the prevailing innovation of AgriTech. For fresh produce replenishment in the presence of its inherent seasonal variations, not only can IoT devices capture bidirectional seasonal information of lead time and demand, but also detect fresh produce loss and waste (FPLW) caused by deterioration. With the aid of the massive data collected by IoT, we propose a data-driven deep reinforcement learning (DRL) approach using reward shaping, called DQN-SV-RS, to optimize the dynamic replenishment policy for a fresh produce wholesaler, specifically addressing the challenge posed by seasonal variations. Experimental results show that our DQN-SV-SR approach yields significant improvements for fresh produce supply chain (FPSC) inventory management, especially achieving a remarkable reduction in FPLW. As a core innovation in our DQN-SV-SR approach, the introduced reward shaping can significantly mitigate lost sales and inventory holding, thereby lowering the total cost. Furthermore, with numerical experiments based on real business data, our proposed approach is demonstrated with plausible robustness and scalable applicability.</div></div>","PeriodicalId":19529,"journal":{"name":"Omega-international Journal of Management Science","volume":"134 ","pages":"Article 103299"},"PeriodicalIF":6.7000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IoT-driven dynamic replenishment of fresh produce in the presence of seasonal variations: A deep reinforcement learning approach using reward shaping\",\"authors\":\"Zihao Wang ,&nbsp;Wenlong Wang ,&nbsp;Tianjun Liu ,&nbsp;Jasmine Chang ,&nbsp;Jim Shi\",\"doi\":\"10.1016/j.omega.2025.103299\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Internet of things (IoT) has been transforming inventory management disruptively by linking and synchronizing inventory products together. It is one of the driving forces for the prevailing innovation of AgriTech. For fresh produce replenishment in the presence of its inherent seasonal variations, not only can IoT devices capture bidirectional seasonal information of lead time and demand, but also detect fresh produce loss and waste (FPLW) caused by deterioration. With the aid of the massive data collected by IoT, we propose a data-driven deep reinforcement learning (DRL) approach using reward shaping, called DQN-SV-RS, to optimize the dynamic replenishment policy for a fresh produce wholesaler, specifically addressing the challenge posed by seasonal variations. Experimental results show that our DQN-SV-SR approach yields significant improvements for fresh produce supply chain (FPSC) inventory management, especially achieving a remarkable reduction in FPLW. As a core innovation in our DQN-SV-SR approach, the introduced reward shaping can significantly mitigate lost sales and inventory holding, thereby lowering the total cost. Furthermore, with numerical experiments based on real business data, our proposed approach is demonstrated with plausible robustness and scalable applicability.</div></div>\",\"PeriodicalId\":19529,\"journal\":{\"name\":\"Omega-international Journal of Management Science\",\"volume\":\"134 \",\"pages\":\"Article 103299\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Omega-international Journal of Management Science\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0305048325000258\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MANAGEMENT\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Omega-international Journal of Management Science","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305048325000258","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 0

摘要

物联网(IoT)通过将库存产品连接和同步在一起,正在颠覆性地改变库存管理。它是AgriTech普遍创新的驱动力之一。对于存在固有季节性变化的新鲜农产品补给,物联网设备不仅可以捕获交货时间和需求的双向季节性信息,还可以检测因变质造成的新鲜农产品损失和浪费(FPLW)。在物联网收集的大量数据的帮助下,我们提出了一种数据驱动的深度强化学习(DRL)方法,使用奖励塑造,称为DQN-SV-RS,来优化生鲜农产品批发商的动态补货政策,特别是应对季节变化带来的挑战。实验结果表明,我们的DQN-SV-SR方法显著改善了生鲜农产品供应链(FPSC)库存管理,特别是显著降低了FPLW。作为我们DQN-SV-SR方法的核心创新,引入奖励塑造可以显著减轻销售损失和库存持有,从而降低总成本。此外,基于实际业务数据的数值实验表明,该方法具有良好的鲁棒性和可扩展性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
IoT-driven dynamic replenishment of fresh produce in the presence of seasonal variations: A deep reinforcement learning approach using reward shaping
Internet of things (IoT) has been transforming inventory management disruptively by linking and synchronizing inventory products together. It is one of the driving forces for the prevailing innovation of AgriTech. For fresh produce replenishment in the presence of its inherent seasonal variations, not only can IoT devices capture bidirectional seasonal information of lead time and demand, but also detect fresh produce loss and waste (FPLW) caused by deterioration. With the aid of the massive data collected by IoT, we propose a data-driven deep reinforcement learning (DRL) approach using reward shaping, called DQN-SV-RS, to optimize the dynamic replenishment policy for a fresh produce wholesaler, specifically addressing the challenge posed by seasonal variations. Experimental results show that our DQN-SV-SR approach yields significant improvements for fresh produce supply chain (FPSC) inventory management, especially achieving a remarkable reduction in FPLW. As a core innovation in our DQN-SV-SR approach, the introduced reward shaping can significantly mitigate lost sales and inventory holding, thereby lowering the total cost. Furthermore, with numerical experiments based on real business data, our proposed approach is demonstrated with plausible robustness and scalable applicability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Omega-international Journal of Management Science
Omega-international Journal of Management Science 管理科学-运筹学与管理科学
CiteScore
13.80
自引率
11.60%
发文量
130
审稿时长
56 days
期刊介绍: Omega reports on developments in management, including the latest research results and applications. Original contributions and review articles describe the state of the art in specific fields or functions of management, while there are shorter critical assessments of particular management techniques. Other features of the journal are the "Memoranda" section for short communications and "Feedback", a correspondence column. Omega is both stimulating reading and an important source for practising managers, specialists in management services, operational research workers and management scientists, management consultants, academics, students and research personnel throughout the world. The material published is of high quality and relevance, written in a manner which makes it accessible to all of this wide-ranging readership. Preference will be given to papers with implications to the practice of management. Submissions of purely theoretical papers are discouraged. The review of material for publication in the journal reflects this aim.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信