基于近似动态规划的竞争物流仿真输出分析:一种提出的方法

Q3 Decision Sciences

Journal of Defense Analytics and Logistics Pub Date : 2022-11-15 DOI:10.1108/jdal-07-2022-0004

Matthew Powers, Brian O'Flynn

{"title":"基于近似动态规划的竞争物流仿真输出分析:一种提出的方法","authors":"Matthew Powers, Brian O'Flynn","doi":"10.1108/jdal-07-2022-0004","DOIUrl":null,"url":null,"abstract":"PurposeRapid sensitivity analysis and near-optimal decision-making in contested environments are valuable requirements when providing military logistics support. Port of debarkation denial motivates maneuver from strategic operational locations, further complicating logistics support. Simulations enable rapid concept design, experiment and testing that meet these complicated logistic support demands. However, simulation model analyses are time consuming as output data complexity grows with simulation input. This paper proposes a methodology that leverages the benefits of simulation-based insight and the computational speed of approximate dynamic programming (ADP).Design/methodology/approachThis paper describes a simulated contested logistics environment and demonstrates how output data informs the parameters required for the ADP dialect of reinforcement learning (aka Q-learning). Q-learning output includes a near-optimal policy that prescribes decisions for each state modeled in the simulation. This paper's methods conform to DoD simulation modeling practices complemented with AI-enabled decision-making.FindingsThis study demonstrates simulation output data as a means of state–space reduction to mitigate the curse of dimensionality. Furthermore, massive amounts of simulation output data become unwieldy. This work demonstrates how Q-learning parameters reflect simulation inputs so that simulation model behavior can compare to near-optimal policies.Originality/valueFast computation is attractive for sensitivity analysis while divorcing evaluation from scenario-based limitations. The United States military is eager to embrace emerging AI analytic techniques to inform decision-making but is hesitant to abandon simulation modeling. This paper proposes Q-learning as an aid to overcome cognitive limitations in a way that satisfies the desire to wield AI-enabled decision-making combined with modeling and simulation.","PeriodicalId":32838,"journal":{"name":"Journal of Defense Analytics and Logistics","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Contested logistics simulation output analysis with approximate dynamic programming: a proposed methodology\",\"authors\":\"Matthew Powers, Brian O'Flynn\",\"doi\":\"10.1108/jdal-07-2022-0004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PurposeRapid sensitivity analysis and near-optimal decision-making in contested environments are valuable requirements when providing military logistics support. Port of debarkation denial motivates maneuver from strategic operational locations, further complicating logistics support. Simulations enable rapid concept design, experiment and testing that meet these complicated logistic support demands. However, simulation model analyses are time consuming as output data complexity grows with simulation input. This paper proposes a methodology that leverages the benefits of simulation-based insight and the computational speed of approximate dynamic programming (ADP).Design/methodology/approachThis paper describes a simulated contested logistics environment and demonstrates how output data informs the parameters required for the ADP dialect of reinforcement learning (aka Q-learning). Q-learning output includes a near-optimal policy that prescribes decisions for each state modeled in the simulation. This paper's methods conform to DoD simulation modeling practices complemented with AI-enabled decision-making.FindingsThis study demonstrates simulation output data as a means of state–space reduction to mitigate the curse of dimensionality. Furthermore, massive amounts of simulation output data become unwieldy. This work demonstrates how Q-learning parameters reflect simulation inputs so that simulation model behavior can compare to near-optimal policies.Originality/valueFast computation is attractive for sensitivity analysis while divorcing evaluation from scenario-based limitations. The United States military is eager to embrace emerging AI analytic techniques to inform decision-making but is hesitant to abandon simulation modeling. This paper proposes Q-learning as an aid to overcome cognitive limitations in a way that satisfies the desire to wield AI-enabled decision-making combined with modeling and simulation.\",\"PeriodicalId\":32838,\"journal\":{\"name\":\"Journal of Defense Analytics and Logistics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Defense Analytics and Logistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/jdal-07-2022-0004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Defense Analytics and Logistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/jdal-07-2022-0004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Decision Sciences","Score":null,"Total":0}

引用次数: 0

摘要

在提供军事后勤支持时，在有争议的环境中快速的灵敏度分析和接近最优的决策是有价值的要求。登陆港拒止促使从战略作战地点进行机动，使后勤支援进一步复杂化。模拟可以实现快速的概念设计、实验和测试，以满足这些复杂的后勤支持需求。然而，仿真模型分析是费时的，因为输出数据的复杂性随着仿真输入的增加而增加。本文提出了一种利用基于仿真的洞察力和近似动态规划(ADP)的计算速度的方法。设计/方法/方法本文描述了一个模拟的有争议的物流环境，并演示了输出数据如何通知强化学习(又名q -学习)的ADP方言所需的参数。q学习的输出包括一个近乎最优的策略，该策略为模拟中建模的每个状态规定了决策。本文的方法符合国防部仿真建模实践，并辅以人工智能支持的决策。本研究展示了模拟输出数据作为一种状态空间缩减的手段，以减轻维数的诅咒。此外，大量的模拟输出数据变得难以处理。这项工作展示了Q-learning参数如何反映模拟输入，以便模拟模型行为可以与接近最优的策略进行比较。独创性/价值快速计算对敏感性分析很有吸引力，同时将评估从基于场景的限制中分离出来。美国军方渴望采用新兴的人工智能分析技术来为决策提供信息，但对放弃仿真建模犹豫不决。本文提出Q-learning作为克服认知限制的一种辅助手段，以满足将ai支持的决策与建模和仿真相结合的愿望。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Contested logistics simulation output analysis with approximate dynamic programming: a proposed methodology

PurposeRapid sensitivity analysis and near-optimal decision-making in contested environments are valuable requirements when providing military logistics support. Port of debarkation denial motivates maneuver from strategic operational locations, further complicating logistics support. Simulations enable rapid concept design, experiment and testing that meet these complicated logistic support demands. However, simulation model analyses are time consuming as output data complexity grows with simulation input. This paper proposes a methodology that leverages the benefits of simulation-based insight and the computational speed of approximate dynamic programming (ADP).Design/methodology/approachThis paper describes a simulated contested logistics environment and demonstrates how output data informs the parameters required for the ADP dialect of reinforcement learning (aka Q-learning). Q-learning output includes a near-optimal policy that prescribes decisions for each state modeled in the simulation. This paper's methods conform to DoD simulation modeling practices complemented with AI-enabled decision-making.FindingsThis study demonstrates simulation output data as a means of state–space reduction to mitigate the curse of dimensionality. Furthermore, massive amounts of simulation output data become unwieldy. This work demonstrates how Q-learning parameters reflect simulation inputs so that simulation model behavior can compare to near-optimal policies.Originality/valueFast computation is attractive for sensitivity analysis while divorcing evaluation from scenario-based limitations. The United States military is eager to embrace emerging AI analytic techniques to inform decision-making but is hesitant to abandon simulation modeling. This paper proposes Q-learning as an aid to overcome cognitive limitations in a way that satisfies the desire to wield AI-enabled decision-making combined with modeling and simulation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Defense Analytics and Logistics

CiteScore

0.90

自引率

0.00%

发文量

审稿时长

12 weeks