采矿复合体同步随机优化的学习调度启发式

Comput. Oper. Res. Pub Date : 2022-02-25 DOI:10.2139/ssrn.4229477

Yassine Yaakoubi, R. Dimitrakopoulos

{"title":"采矿复合体同步随机优化的学习调度启发式","authors":"Yassine Yaakoubi, R. Dimitrakopoulos","doi":"10.2139/ssrn.4229477","DOIUrl":null,"url":null,"abstract":"The simultaneous stochastic optimization of mining complexes (SSOMC) is a large-scale stochastic combinatorial optimization problem that simultaneously manages the extraction of materials from multiple mines and their processing using interconnected facilities to generate a set of final products, while taking into account material supply (geological) uncertainty to manage the associated risk. Although simulated annealing has been shown to outperform comparing methods for solving the SSOMC, early performance might dominate recent performance in that a combination of the heuristics' performance is used to determine which perturbations to apply. This work proposes a data-driven framework for heuristic scheduling in a fully self-managed hyper-heuristic to solve the SSOMC. The proposed learn-to-perturb (L2P) hyper-heuristic is a multi-neighborhood simulated annealing algorithm. The L2P selects the heuristic (perturbation) to be applied in a self-adaptive manner using reinforcement learning to efficiently explore which local search is best suited for a particular search point. Several state-of-the-art agents have been incorporated into L2P to better adapt the search and guide it towards better solutions. By learning from data describing the performance of the heuristics, a problem-specific ordering of heuristics that collectively finds better solutions faster is obtained. L2P is tested on several real-world mining complexes, with an emphasis on efficiency, robustness, and generalization capacity. Results show a reduction in the number of iterations by 30-50% and in the computational time by 30-45%.","PeriodicalId":10582,"journal":{"name":"Comput. Oper. Res.","volume":"54 1","pages":"106349"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes\",\"authors\":\"Yassine Yaakoubi, R. Dimitrakopoulos\",\"doi\":\"10.2139/ssrn.4229477\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The simultaneous stochastic optimization of mining complexes (SSOMC) is a large-scale stochastic combinatorial optimization problem that simultaneously manages the extraction of materials from multiple mines and their processing using interconnected facilities to generate a set of final products, while taking into account material supply (geological) uncertainty to manage the associated risk. Although simulated annealing has been shown to outperform comparing methods for solving the SSOMC, early performance might dominate recent performance in that a combination of the heuristics' performance is used to determine which perturbations to apply. This work proposes a data-driven framework for heuristic scheduling in a fully self-managed hyper-heuristic to solve the SSOMC. The proposed learn-to-perturb (L2P) hyper-heuristic is a multi-neighborhood simulated annealing algorithm. The L2P selects the heuristic (perturbation) to be applied in a self-adaptive manner using reinforcement learning to efficiently explore which local search is best suited for a particular search point. Several state-of-the-art agents have been incorporated into L2P to better adapt the search and guide it towards better solutions. By learning from data describing the performance of the heuristics, a problem-specific ordering of heuristics that collectively finds better solutions faster is obtained. L2P is tested on several real-world mining complexes, with an emphasis on efficiency, robustness, and generalization capacity. Results show a reduction in the number of iterations by 30-50% and in the computational time by 30-45%.\",\"PeriodicalId\":10582,\"journal\":{\"name\":\"Comput. Oper. Res.\",\"volume\":\"54 1\",\"pages\":\"106349\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Comput. Oper. Res.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.4229477\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comput. Oper. Res.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.4229477","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

矿山复合体同步随机优化(SSOMC)是一个大规模随机组合优化问题，它同时管理从多个矿山中提取材料并使用相互关联的设施进行加工以产生一组最终产品，同时考虑材料供应(地质)的不确定性以管理相关风险。虽然模拟退火在解决SSOMC方面的表现优于比较方法，但早期的性能可能会主导最近的性能，因为启发式性能的组合用于确定应用哪种扰动。本文提出了一个数据驱动的启发式调度框架，在一个完全自我管理的超启发式中解决SSOMC问题。提出的L2P超启发式算法是一种多邻域模拟退火算法。L2P选择启发式(扰动)以自适应的方式应用，使用强化学习来有效地探索最适合特定搜索点的局部搜索。几个最先进的代理已被纳入L2P，以更好地适应搜索并引导其找到更好的解决方案。通过从描述启发式性能的数据中学习，可以获得特定于问题的启发式排序，从而更快地找到更好的解决方案。L2P在几个现实世界的采矿复合体上进行了测试，重点是效率、鲁棒性和泛化能力。结果表明，迭代次数减少了30-50%，计算时间减少了30-45%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes

The simultaneous stochastic optimization of mining complexes (SSOMC) is a large-scale stochastic combinatorial optimization problem that simultaneously manages the extraction of materials from multiple mines and their processing using interconnected facilities to generate a set of final products, while taking into account material supply (geological) uncertainty to manage the associated risk. Although simulated annealing has been shown to outperform comparing methods for solving the SSOMC, early performance might dominate recent performance in that a combination of the heuristics' performance is used to determine which perturbations to apply. This work proposes a data-driven framework for heuristic scheduling in a fully self-managed hyper-heuristic to solve the SSOMC. The proposed learn-to-perturb (L2P) hyper-heuristic is a multi-neighborhood simulated annealing algorithm. The L2P selects the heuristic (perturbation) to be applied in a self-adaptive manner using reinforcement learning to efficiently explore which local search is best suited for a particular search point. Several state-of-the-art agents have been incorporated into L2P to better adapt the search and guide it towards better solutions. By learning from data describing the performance of the heuristics, a problem-specific ordering of heuristics that collectively finds better solutions faster is obtained. L2P is tested on several real-world mining complexes, with an emphasis on efficiency, robustness, and generalization capacity. Results show a reduction in the number of iterations by 30-50% and in the computational time by 30-45%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Comput. Oper. Res.

自引率

0.00%

发文量