Jinlong Wang , Shangzhuo Zhou , Min Li , Guanyu Ren , Xianquan Ren , Xiaoyun Xiong , Yuanyuan Zhang
{"title":"碳税政策下基于强化学习的废旧电器电子设备闭环供应链多级库存优化","authors":"Jinlong Wang , Shangzhuo Zhou , Min Li , Guanyu Ren , Xianquan Ren , Xiaoyun Xiong , Yuanyuan Zhang","doi":"10.1016/j.engappai.2025.110987","DOIUrl":null,"url":null,"abstract":"<div><div>In response to environmental challenges posed by waste electrical and electronic equipment (WEEE), the WEEE closed-loop supply chain (CLSC) has emerged as a crucial means to promote circular economy through the recycling and reuse of WEEE, which not only reduces waste emissions but also improves the efficiency of resource utilization. In practice, both economic and environmental benefits must be considered in sustainable manufacturing within the CLSC to ensure the sustainable development of enterprises. Therefore, a multi-echelon, multi-period inventory model for the CLSC under carbon tax policy is developed in this paper, which innovatively introduces the carbon footprint to assess environmental impact and integrates the impacts of collection planning and the uncertainty of recycling quantity and product demand on the system, aiming to minimize total enterprise costs. To address the uncertainties in the model, the Proximal Policy Optimization (PPO) algorithm is employed to train a reinforcement learning (RL) agent. This agent enables enterprises to dynamically adjust internal strategies such as collection and production, as well as external procurement strategies, based on inventory levels and market conditions. By internalizing carbon emission costs through the carbon tax rate, the RL agent optimizes total costs while achieving a balance between economic and environmental benefits. Numerical experiments demonstrate that the PPO algorithm outperforms the traditional inventory management policy in terms of both cost control and carbon footprint reduction. Moreover, the moderate carbon tax policy on the CLSC appropriately increases the cost of enterprises while significantly reducing their carbon footprint and promoting sustainable development.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"154 ","pages":"Article 110987"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-echelon inventory optimization of waste electrical and electronic equipment closed-loop supply chain based on reinforcement learning under carbon tax policy\",\"authors\":\"Jinlong Wang , Shangzhuo Zhou , Min Li , Guanyu Ren , Xianquan Ren , Xiaoyun Xiong , Yuanyuan Zhang\",\"doi\":\"10.1016/j.engappai.2025.110987\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In response to environmental challenges posed by waste electrical and electronic equipment (WEEE), the WEEE closed-loop supply chain (CLSC) has emerged as a crucial means to promote circular economy through the recycling and reuse of WEEE, which not only reduces waste emissions but also improves the efficiency of resource utilization. In practice, both economic and environmental benefits must be considered in sustainable manufacturing within the CLSC to ensure the sustainable development of enterprises. Therefore, a multi-echelon, multi-period inventory model for the CLSC under carbon tax policy is developed in this paper, which innovatively introduces the carbon footprint to assess environmental impact and integrates the impacts of collection planning and the uncertainty of recycling quantity and product demand on the system, aiming to minimize total enterprise costs. To address the uncertainties in the model, the Proximal Policy Optimization (PPO) algorithm is employed to train a reinforcement learning (RL) agent. This agent enables enterprises to dynamically adjust internal strategies such as collection and production, as well as external procurement strategies, based on inventory levels and market conditions. By internalizing carbon emission costs through the carbon tax rate, the RL agent optimizes total costs while achieving a balance between economic and environmental benefits. Numerical experiments demonstrate that the PPO algorithm outperforms the traditional inventory management policy in terms of both cost control and carbon footprint reduction. Moreover, the moderate carbon tax policy on the CLSC appropriately increases the cost of enterprises while significantly reducing their carbon footprint and promoting sustainable development.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"154 \",\"pages\":\"Article 110987\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095219762500987X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095219762500987X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Multi-echelon inventory optimization of waste electrical and electronic equipment closed-loop supply chain based on reinforcement learning under carbon tax policy
In response to environmental challenges posed by waste electrical and electronic equipment (WEEE), the WEEE closed-loop supply chain (CLSC) has emerged as a crucial means to promote circular economy through the recycling and reuse of WEEE, which not only reduces waste emissions but also improves the efficiency of resource utilization. In practice, both economic and environmental benefits must be considered in sustainable manufacturing within the CLSC to ensure the sustainable development of enterprises. Therefore, a multi-echelon, multi-period inventory model for the CLSC under carbon tax policy is developed in this paper, which innovatively introduces the carbon footprint to assess environmental impact and integrates the impacts of collection planning and the uncertainty of recycling quantity and product demand on the system, aiming to minimize total enterprise costs. To address the uncertainties in the model, the Proximal Policy Optimization (PPO) algorithm is employed to train a reinforcement learning (RL) agent. This agent enables enterprises to dynamically adjust internal strategies such as collection and production, as well as external procurement strategies, based on inventory levels and market conditions. By internalizing carbon emission costs through the carbon tax rate, the RL agent optimizes total costs while achieving a balance between economic and environmental benefits. Numerical experiments demonstrate that the PPO algorithm outperforms the traditional inventory management policy in terms of both cost control and carbon footprint reduction. Moreover, the moderate carbon tax policy on the CLSC appropriately increases the cost of enterprises while significantly reducing their carbon footprint and promoting sustainable development.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.