A Methodology to Build Decision Analysis Tools Applied to Distributed Reinforcement Learning

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Pub Date : 2022-05-01 DOI:10.1109/IPDPSW55747.2022.00173

Cèdric Prigent, Loïc Cudennec, Alexandru Costan, Gabriel Antoniu

{"title":"A Methodology to Build Decision Analysis Tools Applied to Distributed Reinforcement Learning","authors":"Cèdric Prigent, Loïc Cudennec, Alexandru Costan, Gabriel Antoniu","doi":"10.1109/IPDPSW55747.2022.00173","DOIUrl":null,"url":null,"abstract":"As Artificial Intelligence-based applications become more and more complex, speeding up the learning phase (which is typically computation-intensive) becomes more and more necessary. Distributed machine learning (ML) appears adequate to address this problem. Unfortunately, ML also brings new development frameworks, methodologies and high-level program-ming languages that do not fit to the regular high-performance computing design flow. This paper introduces a methodology to build a decision making tool that allows ML experts to arbitrate between different frameworks and deployment configurations, in order to fulfill project objectives such as the accuracy of the resulting model, the computing speed or the energy consumption of the learning computation. The proposed methodology is applied to an industrial-grade case study in which reinforcement learning is used to train an autonomous steering model for a cargo airdrop system. Results are presented within a Pareto front that lets ML experts choose an appropriate solution, a framework and a deployment configuration, based on the current operational situation. While the proposed approach can effortlessly be applied to other machine learning problems, as for many decision making systems, the selected solutions involve a trade-off between several antagonist evaluation criteria and require experts from different domains to pick the most efficient solution from the short list. Nevertheless, this methodology speeds up the development process by clearly discarding, or, on the contrary, including combinations of frameworks and configurations, which has a significant impact for time and budget-constrained projects.","PeriodicalId":286968,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW55747.2022.00173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

As Artificial Intelligence-based applications become more and more complex, speeding up the learning phase (which is typically computation-intensive) becomes more and more necessary. Distributed machine learning (ML) appears adequate to address this problem. Unfortunately, ML also brings new development frameworks, methodologies and high-level program-ming languages that do not fit to the regular high-performance computing design flow. This paper introduces a methodology to build a decision making tool that allows ML experts to arbitrate between different frameworks and deployment configurations, in order to fulfill project objectives such as the accuracy of the resulting model, the computing speed or the energy consumption of the learning computation. The proposed methodology is applied to an industrial-grade case study in which reinforcement learning is used to train an autonomous steering model for a cargo airdrop system. Results are presented within a Pareto front that lets ML experts choose an appropriate solution, a framework and a deployment configuration, based on the current operational situation. While the proposed approach can effortlessly be applied to other machine learning problems, as for many decision making systems, the selected solutions involve a trade-off between several antagonist evaluation criteria and require experts from different domains to pick the most efficient solution from the short list. Nevertheless, this methodology speeds up the development process by clearly discarding, or, on the contrary, including combinations of frameworks and configurations, which has a significant impact for time and budget-constrained projects.

查看原文本刊更多论文

一种构建用于分布式强化学习的决策分析工具的方法

随着基于人工智能的应用程序变得越来越复杂，加快学习阶段(通常是计算密集型的)变得越来越必要。分布式机器学习(ML)似乎足以解决这个问题。不幸的是，ML也带来了新的开发框架、方法和高级编程语言，它们不适合常规的高性能计算设计流程。本文介绍了一种构建决策工具的方法，该工具允许机器学习专家在不同的框架和部署配置之间进行仲裁，以实现项目目标，例如结果模型的准确性，计算速度或学习计算的能耗。提出的方法应用于工业级案例研究，其中使用强化学习来训练货物空投系统的自主转向模型。结果在Pareto front中呈现，让ML专家根据当前的操作情况选择合适的解决方案、框架和部署配置。虽然所提出的方法可以毫不费力地应用于其他机器学习问题，但对于许多决策系统来说，所选的解决方案涉及到几个对抗评估标准之间的权衡，并且需要来自不同领域的专家从短列表中选择最有效的解决方案。然而，这种方法通过明确地放弃，或者相反地，包括框架和配置的组合来加速开发过程，这对时间和预算受限的项目有重大影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

自引率

0.00%

发文量