{"title":"基于策略梯度的供应链管理强化学习方法","authors":"Yassine Hachaïchi, Yassine Chemingui, M. Affes","doi":"10.1109/IC_ASET49463.2020.9318258","DOIUrl":null,"url":null,"abstract":"Technological advances of the recent decades have significantly affected the business world, the retail business in particular. Retailers need to innovate to maintain a competitive edge over competitors and ensure business sustainability. Therefore Research and Development is a crucial component of businesses growth. Intuition based approaches are replaced by supply chain computerized solutions such as inventory management, warehousing, allocation and replenishment. This paper aims at building a reinforcement learning agent capable of placing optimal orders for the sake of constructing a replenishment plan for next period. The goal is to develop a novel method of inventory replenishment. We base the developed module on the recent breakthroughs of reinforcement learning research in using deep neural networks for control. We compare the classical RL methods to the recently introduced Proximal policy optimization algorithm. As far as we know, this is the first time PPO is used in Supply Chain Management.","PeriodicalId":250315,"journal":{"name":"2020 4th International Conference on Advanced Systems and Emergent Technologies (IC_ASET)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Policy Gradient Based Reinforcement Learning Method for Supply Chain Management\",\"authors\":\"Yassine Hachaïchi, Yassine Chemingui, M. Affes\",\"doi\":\"10.1109/IC_ASET49463.2020.9318258\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Technological advances of the recent decades have significantly affected the business world, the retail business in particular. Retailers need to innovate to maintain a competitive edge over competitors and ensure business sustainability. Therefore Research and Development is a crucial component of businesses growth. Intuition based approaches are replaced by supply chain computerized solutions such as inventory management, warehousing, allocation and replenishment. This paper aims at building a reinforcement learning agent capable of placing optimal orders for the sake of constructing a replenishment plan for next period. The goal is to develop a novel method of inventory replenishment. We base the developed module on the recent breakthroughs of reinforcement learning research in using deep neural networks for control. We compare the classical RL methods to the recently introduced Proximal policy optimization algorithm. As far as we know, this is the first time PPO is used in Supply Chain Management.\",\"PeriodicalId\":250315,\"journal\":{\"name\":\"2020 4th International Conference on Advanced Systems and Emergent Technologies (IC_ASET)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th International Conference on Advanced Systems and Emergent Technologies (IC_ASET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC_ASET49463.2020.9318258\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Advanced Systems and Emergent Technologies (IC_ASET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC_ASET49463.2020.9318258","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Policy Gradient Based Reinforcement Learning Method for Supply Chain Management
Technological advances of the recent decades have significantly affected the business world, the retail business in particular. Retailers need to innovate to maintain a competitive edge over competitors and ensure business sustainability. Therefore Research and Development is a crucial component of businesses growth. Intuition based approaches are replaced by supply chain computerized solutions such as inventory management, warehousing, allocation and replenishment. This paper aims at building a reinforcement learning agent capable of placing optimal orders for the sake of constructing a replenishment plan for next period. The goal is to develop a novel method of inventory replenishment. We base the developed module on the recent breakthroughs of reinforcement learning research in using deep neural networks for control. We compare the classical RL methods to the recently introduced Proximal policy optimization algorithm. As far as we know, this is the first time PPO is used in Supply Chain Management.