电力控制中的分布式强化学习智能体集成

2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS) Pub Date : 2022-08-01 DOI:10.1109/COINS54846.2022.9854987

Pierrick Pochelu, B. Conche, S. Petiton

{"title":"电力控制中的分布式强化学习智能体集成","authors":"Pierrick Pochelu, B. Conche, S. Petiton","doi":"10.1109/COINS54846.2022.9854987","DOIUrl":null,"url":null,"abstract":"Deep Reinforcement Learning (or just \"RL\") is gaining popularity for industrial and research applications. However, it still suffers from some key limits slowing down its widespread adoption. Its performance is sensitive to initial conditions and non-determinism. To unlock those challenges, we propose a procedure to ensemble of RL agents based to efficiently build better local decisions towards long-term cumulated rewards. For the first time, hundreds of experiments have been done to compare different ensemble constructions procedure on 2 electricity control environments. We discovered an ensemble of 4 agents improves accumulated rewards by 36% in average, improve stability by factor 2.05 and can naturally and efficiently trained and predicted in parallel on GPUs and CPUs.","PeriodicalId":187055,"journal":{"name":"2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Distributed Ensembles of Reinforcement Learning Agents for Electricity Control\",\"authors\":\"Pierrick Pochelu, B. Conche, S. Petiton\",\"doi\":\"10.1109/COINS54846.2022.9854987\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep Reinforcement Learning (or just \\\"RL\\\") is gaining popularity for industrial and research applications. However, it still suffers from some key limits slowing down its widespread adoption. Its performance is sensitive to initial conditions and non-determinism. To unlock those challenges, we propose a procedure to ensemble of RL agents based to efficiently build better local decisions towards long-term cumulated rewards. For the first time, hundreds of experiments have been done to compare different ensemble constructions procedure on 2 electricity control environments. We discovered an ensemble of 4 agents improves accumulated rewards by 36% in average, improve stability by factor 2.05 and can naturally and efficiently trained and predicted in parallel on GPUs and CPUs.\",\"PeriodicalId\":187055,\"journal\":{\"name\":\"2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COINS54846.2022.9854987\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COINS54846.2022.9854987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

深度强化学习(或简称“RL”)在工业和研究应用中越来越受欢迎。然而，它仍然受到一些阻碍其广泛采用的关键限制。其性能对初始条件和非确定性敏感。为了解决这些挑战，我们提出了一个基于强化学习代理的集成程序，以有效地构建更好的针对长期累积奖励的本地决策。首次进行了数百次实验，比较了两种电控制环境下不同的集成构建过程。我们发现4个智能体的集合平均提高了36%的累积奖励，提高了2.05倍的稳定性，并且可以在gpu和cpu上自然有效地并行训练和预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Distributed Ensembles of Reinforcement Learning Agents for Electricity Control

Deep Reinforcement Learning (or just "RL") is gaining popularity for industrial and research applications. However, it still suffers from some key limits slowing down its widespread adoption. Its performance is sensitive to initial conditions and non-determinism. To unlock those challenges, we propose a procedure to ensemble of RL agents based to efficiently build better local decisions towards long-term cumulated rewards. For the first time, hundreds of experiments have been done to compare different ensemble constructions procedure on 2 electricity control environments. We discovered an ensemble of 4 agents improves accumulated rewards by 36% in average, improve stability by factor 2.05 and can naturally and efficiently trained and predicted in parallel on GPUs and CPUs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Omni-layer Intelligent Systems (COINS)

自引率

0.00%

发文量