基于人类团队的扰动控制深度q学习增强对抗鲁棒性

IF 2 Q2 AUTOMATION & CONTROL SYSTEMS

IEEE Control Systems Letters Pub Date : 2025-07-30 DOI:10.1109/LCSYS.2025.3594257

Sadredin Hokmi;Pegah Moushaee;Mohammad Haeri

{"title":"基于人类团队的扰动控制深度q学习增强对抗鲁棒性","authors":"Sadredin Hokmi;Pegah Moushaee;Mohammad Haeri","doi":"10.1109/LCSYS.2025.3594257","DOIUrl":null,"url":null,"abstract":"In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2303-2308"},"PeriodicalIF":2.0000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Perturbation-Controlled Deep Q-Learning With Human-Teaming for Enhancing Adversarial Robustness\",\"authors\":\"Sadredin Hokmi;Pegah Moushaee;Mohammad Haeri\",\"doi\":\"10.1109/LCSYS.2025.3594257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.\",\"PeriodicalId\":37235,\"journal\":{\"name\":\"IEEE Control Systems Letters\",\"volume\":\"9 \",\"pages\":\"2303-2308\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Control Systems Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11104842/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11104842/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

在这封信中，提出了一个具有人类团队的扰动控制深度q网络的集成框架，以有效减轻对抗性干扰的影响，特别是虚假数据注入和拒绝服务攻击。通过收敛和误差补偿机制，所提出的集成大大降低了这些误差的影响。人为干预的结合在收敛速度和鲁棒性之间引入了有利的权衡，这在安全性敏感的应用中尤为重要，因为鲁棒性必须优先于通过自适应量化扰动注入与人类团队相结合的快速收敛。因此，该算法可以在保持令人满意的性能水平的同时实现高效可靠的恢复。仿真结果表明，在对抗区间内，与传统的基于深度q网络的方法相比，该方法具有更好的抑制和补偿注入误差的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Perturbation-Controlled Deep Q-Learning With Human-Teaming for Enhancing Adversarial Robustness

In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Control Systems Letters Mathematics-Control and Optimization

CiteScore

4.40

自引率

13.30%

发文量

471