{"title":"机器学习在非稳定需求生产者最优控制问题中的应用","authors":"Aleksandr Delev, A. Zhukova, A. Flerova","doi":"10.1109/CoDIT55151.2022.9803997","DOIUrl":null,"url":null,"abstract":"Profit is one of the key performance indicators of the company and each company needs to allocate resources in such a way that it achieves the greatest possible profit. The profit maximization problem is typically a dynamic optimization problem. This paper considers an approach to solving the problem of expanding production, using methods of Reinforcement Learning. The goal is to choose the firm's strategy for the long term: how to use the resources to maximize profits in the long-term period. To make the problem realistic, we consider the case of unstable demand. We study applicability of machine learning methods to optimal control problems, in particular Reinforcement Learning (RL), to optimal control problems arising in economics. To confirm or refute this possibility, we compare the analytical solution to the problem and the estimate obtained using the RL algorithms.","PeriodicalId":185510,"journal":{"name":"2022 8th International Conference on Control, Decision and Information Technologies (CoDIT)","volume":"174 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Application of Machine Learning in the producer's optimal control problem with non-stable demand\",\"authors\":\"Aleksandr Delev, A. Zhukova, A. Flerova\",\"doi\":\"10.1109/CoDIT55151.2022.9803997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Profit is one of the key performance indicators of the company and each company needs to allocate resources in such a way that it achieves the greatest possible profit. The profit maximization problem is typically a dynamic optimization problem. This paper considers an approach to solving the problem of expanding production, using methods of Reinforcement Learning. The goal is to choose the firm's strategy for the long term: how to use the resources to maximize profits in the long-term period. To make the problem realistic, we consider the case of unstable demand. We study applicability of machine learning methods to optimal control problems, in particular Reinforcement Learning (RL), to optimal control problems arising in economics. To confirm or refute this possibility, we compare the analytical solution to the problem and the estimate obtained using the RL algorithms.\",\"PeriodicalId\":185510,\"journal\":{\"name\":\"2022 8th International Conference on Control, Decision and Information Technologies (CoDIT)\",\"volume\":\"174 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Conference on Control, Decision and Information Technologies (CoDIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CoDIT55151.2022.9803997\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Control, Decision and Information Technologies (CoDIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CoDIT55151.2022.9803997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of Machine Learning in the producer's optimal control problem with non-stable demand
Profit is one of the key performance indicators of the company and each company needs to allocate resources in such a way that it achieves the greatest possible profit. The profit maximization problem is typically a dynamic optimization problem. This paper considers an approach to solving the problem of expanding production, using methods of Reinforcement Learning. The goal is to choose the firm's strategy for the long term: how to use the resources to maximize profits in the long-term period. To make the problem realistic, we consider the case of unstable demand. We study applicability of machine learning methods to optimal control problems, in particular Reinforcement Learning (RL), to optimal control problems arising in economics. To confirm or refute this possibility, we compare the analytical solution to the problem and the estimate obtained using the RL algorithms.