提高多智能体强化学习在暖通空调系统控制中的性能

2022 IEEE Conference on Technologies for Sustainability (SusTech) Pub Date : 2022-04-21 DOI:10.1109/SusTech53338.2022.9794179

D. Bayer, M. Pruckner

{"title":"提高多智能体强化学习在暖通空调系统控制中的性能","authors":"D. Bayer, M. Pruckner","doi":"10.1109/SusTech53338.2022.9794179","DOIUrl":null,"url":null,"abstract":"Systems for heating, ventilation and air-conditioning (HVAC) of buildings are traditionally controlled by a rule-based approach. In order to reduce the energy consumption and the environmental impact of HVAC systems more advanced control methods such as reinforcement learning are promising. Reinforcement learning (RL) strategies offer a good alternative, as user feedback can be integrated more easily and presence can also be incorporated. Moreover, multi-agent RL approaches scale well and can be generalized. In this paper, we propose a multi-agent RL framework based on existing work that learns reducing on one hand energy consumption by optimizing HVAC control and on the other hand user feedback by occupants about uncomfortable room temperatures. Second, we show how to reduce training time required for proper RL-agent-training by using parameter sharing between the multiple agents and apply different pretraining techniques. Results show that our framework is capable of reducing the energy by around 6% when controlling a complete building or 8% for a single room zone. The occupants complaints are acceptable or even better compared to a rule-based baseline. Additionally, our performance analysis show that the training time can be drastically reduced by using parameter sharing.","PeriodicalId":434652,"journal":{"name":"2022 IEEE Conference on Technologies for Sustainability (SusTech)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Enhancing the Performance of Multi-Agent Reinforcement Learning for Controlling HVAC Systems\",\"authors\":\"D. Bayer, M. Pruckner\",\"doi\":\"10.1109/SusTech53338.2022.9794179\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Systems for heating, ventilation and air-conditioning (HVAC) of buildings are traditionally controlled by a rule-based approach. In order to reduce the energy consumption and the environmental impact of HVAC systems more advanced control methods such as reinforcement learning are promising. Reinforcement learning (RL) strategies offer a good alternative, as user feedback can be integrated more easily and presence can also be incorporated. Moreover, multi-agent RL approaches scale well and can be generalized. In this paper, we propose a multi-agent RL framework based on existing work that learns reducing on one hand energy consumption by optimizing HVAC control and on the other hand user feedback by occupants about uncomfortable room temperatures. Second, we show how to reduce training time required for proper RL-agent-training by using parameter sharing between the multiple agents and apply different pretraining techniques. Results show that our framework is capable of reducing the energy by around 6% when controlling a complete building or 8% for a single room zone. The occupants complaints are acceptable or even better compared to a rule-based baseline. Additionally, our performance analysis show that the training time can be drastically reduced by using parameter sharing.\",\"PeriodicalId\":434652,\"journal\":{\"name\":\"2022 IEEE Conference on Technologies for Sustainability (SusTech)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Conference on Technologies for Sustainability (SusTech)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SusTech53338.2022.9794179\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Conference on Technologies for Sustainability (SusTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SusTech53338.2022.9794179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

建筑的供暖、通风和空调(HVAC)系统传统上是由基于规则的方法控制的。为了减少暖通空调系统的能源消耗和环境影响，更先进的控制方法如强化学习是有希望的。强化学习(RL)策略提供了一个很好的选择，因为用户反馈可以更容易地集成，并且也可以整合存在。此外，多智能体强化学习方法具有良好的可扩展性和泛化性。在本文中，我们提出了一个基于现有工作的多智能体强化学习框架，该框架一方面通过优化HVAC控制来学习减少能耗，另一方面通过居住者对不舒服的室温的用户反馈来学习减少能耗。其次，我们展示了如何通过在多个智能体之间使用参数共享和应用不同的预训练技术来减少正确的强化学习智能体训练所需的训练时间。结果表明，我们的框架能够在控制整个建筑时减少约6%的能源，或者在单个房间区域减少8%的能源。与基于规则的基线相比，居住者的投诉是可以接受的，甚至更好。此外，我们的性能分析表明，使用参数共享可以大大减少训练时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhancing the Performance of Multi-Agent Reinforcement Learning for Controlling HVAC Systems

Systems for heating, ventilation and air-conditioning (HVAC) of buildings are traditionally controlled by a rule-based approach. In order to reduce the energy consumption and the environmental impact of HVAC systems more advanced control methods such as reinforcement learning are promising. Reinforcement learning (RL) strategies offer a good alternative, as user feedback can be integrated more easily and presence can also be incorporated. Moreover, multi-agent RL approaches scale well and can be generalized. In this paper, we propose a multi-agent RL framework based on existing work that learns reducing on one hand energy consumption by optimizing HVAC control and on the other hand user feedback by occupants about uncomfortable room temperatures. Second, we show how to reduce training time required for proper RL-agent-training by using parameter sharing between the multiple agents and apply different pretraining techniques. Results show that our framework is capable of reducing the energy by around 6% when controlling a complete building or 8% for a single room zone. The occupants complaints are acceptable or even better compared to a rule-based baseline. Additionally, our performance analysis show that the training time can be drastically reduced by using parameter sharing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE Conference on Technologies for Sustainability (SusTech)

自引率

0.00%

发文量