Multi-Environment Training Against Reward Poisoning Attacks on Deep Reinforcement Learning

SECRYPT ... : proceedings of the International Conference on Security and Cryptography. International Conference on Security and Cryptography Pub Date : 2023-01-01 DOI:10.5220/0012139900003555

Myria Bouhaddi, K. Adi

引用次数: 0

Abstract

: Our research tackles the critical challenge of defending against poisoning attacks in deep reinforcement learning, which have significant cybersecurity implications. These attacks involve subtle manipulation of rewards, leading the attacker’s policy to appear optimal under the poisoned rewards, thus compromising the integrity and reliability of such systems. Our goal is to develop robust agents resistant to manipulations. We propose an optimization framework with a multi-environment setting, which enhances resilience and generalization. By exposing agents to diverse environments, we mitigate the impact of poisoning attacks. Additionally, we employ a variance-based method to detect reward manipulation effectively. Leveraging this information, our optimization framework derives a defense policy that fortifies agents against attacks, bolstering their resistance to reward manipulation.

查看原文本刊更多论文

针对深度强化学习奖励中毒攻击的多环境训练

我们的研究解决了在深度强化学习中防御中毒攻击的关键挑战，这具有重要的网络安全意义。这些攻击涉及对奖励的微妙操纵，导致攻击者的策略在有毒奖励下显得最优，从而损害了此类系统的完整性和可靠性。我们的目标是开发出抗操纵的强大药剂。我们提出了一个具有多环境设置的优化框架，增强了弹性和泛化。通过将药剂暴露在不同的环境中，我们减轻了中毒攻击的影响。此外，我们采用基于方差的方法来有效地检测奖励操纵。利用这些信息，我们的优化框架派生出一个防御策略，加强代理对攻击的防御，增强它们对奖励操纵的抵抗力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

SECRYPT ... : proceedings of the International Conference on Security and Cryptography. International Conference on Security and Cryptography

自引率

0.00%

发文量