Safe Deep Reinforcement Learning Based on Sample Value Evaluation

2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID) Pub Date : 2022-12-02 DOI:10.1109/ASID56930.2022.9996075

Rongjun Ye, Hao Wang, Huan Li

{"title":"Safe Deep Reinforcement Learning Based on Sample Value Evaluation","authors":"Rongjun Ye, Hao Wang, Huan Li","doi":"10.1109/ASID56930.2022.9996075","DOIUrl":null,"url":null,"abstract":"In recent years, deep reinforcement learning has combined the advantages of reinforcement learning and deep learning, and has made great progress in decision-making tasks. However, the training of deep reinforcement learning requires frequent interactions between the agent and the environment and repeated experiments. Adversaries have chances to poison the sample data collected by the agent by attacking the experimental environment in the training process, thereby bringing security risks and serious consequences to the training process of reinforcement learning. This work is committed to addressing the security risks in the field of deep reinforcement learning. However, this work improves the algorithm from the perspective of sample data filtering, and improves the security performance of deep reinforcement learning algorithm. There are two contributions in this work: one is to defend against adversarial attacks against deep reinforcement learning through cluster analysis and sample value evaluation; the other is to propose a deep reinforcement learning algorithm based on sample value evaluation on the basis of deterministic strategy gradient algorithm. The algorithm uses the clustering method to classify the sample pool, and measures the contribution value and security risk of the sample to the model training through the sample value evaluation. The classic game experiments show that the proposed algorithm is safe and effective. It reduces the threat of the agent falling into the adversarial sample attack and improves the training performance of deep reinforcement learning.","PeriodicalId":183908,"journal":{"name":"2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASID56930.2022.9996075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, deep reinforcement learning has combined the advantages of reinforcement learning and deep learning, and has made great progress in decision-making tasks. However, the training of deep reinforcement learning requires frequent interactions between the agent and the environment and repeated experiments. Adversaries have chances to poison the sample data collected by the agent by attacking the experimental environment in the training process, thereby bringing security risks and serious consequences to the training process of reinforcement learning. This work is committed to addressing the security risks in the field of deep reinforcement learning. However, this work improves the algorithm from the perspective of sample data filtering, and improves the security performance of deep reinforcement learning algorithm. There are two contributions in this work: one is to defend against adversarial attacks against deep reinforcement learning through cluster analysis and sample value evaluation; the other is to propose a deep reinforcement learning algorithm based on sample value evaluation on the basis of deterministic strategy gradient algorithm. The algorithm uses the clustering method to classify the sample pool, and measures the contribution value and security risk of the sample to the model training through the sample value evaluation. The classic game experiments show that the proposed algorithm is safe and effective. It reduces the threat of the agent falling into the adversarial sample attack and improves the training performance of deep reinforcement learning.

查看原文本刊更多论文

基于样本值评估的安全深度强化学习

近年来，深度强化学习结合了强化学习和深度学习的优点，在决策任务方面取得了很大进展。然而，深度强化学习的训练需要智能体与环境之间频繁的交互以及反复的实验。在训练过程中，攻击者有机会通过攻击实验环境来毒害agent收集的样本数据，从而给强化学习的训练过程带来安全隐患和严重后果。这项工作致力于解决深度强化学习领域的安全风险。然而，本工作从样本数据过滤的角度对算法进行了改进，提高了深度强化学习算法的安全性能。在这项工作中有两个贡献:一是通过聚类分析和样本值评估来防御针对深度强化学习的对抗性攻击;二是在确定性策略梯度算法的基础上提出了一种基于样本值评估的深度强化学习算法。该算法采用聚类方法对样本池进行分类，并通过样本值评价来度量样本对模型训练的贡献值和安全风险。经典博弈实验表明，该算法是安全有效的。降低了智能体陷入对抗性样本攻击的威胁，提高了深度强化学习的训练性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID)

自引率

0.00%

发文量