{"title":"基于人类团队的扰动控制深度q学习增强对抗鲁棒性","authors":"Sadredin Hokmi;Pegah Moushaee;Mohammad Haeri","doi":"10.1109/LCSYS.2025.3594257","DOIUrl":null,"url":null,"abstract":"In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":"9 ","pages":"2303-2308"},"PeriodicalIF":2.0000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Perturbation-Controlled Deep Q-Learning With Human-Teaming for Enhancing Adversarial Robustness\",\"authors\":\"Sadredin Hokmi;Pegah Moushaee;Mohammad Haeri\",\"doi\":\"10.1109/LCSYS.2025.3594257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.\",\"PeriodicalId\":37235,\"journal\":{\"name\":\"IEEE Control Systems Letters\",\"volume\":\"9 \",\"pages\":\"2303-2308\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Control Systems Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11104842/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11104842/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Perturbation-Controlled Deep Q-Learning With Human-Teaming for Enhancing Adversarial Robustness
In this letter, an integrated framework of perturbation-controlled deep Q-network with human- teaming is proposed to effectively mitigate the impact of adversarial disturbances, specifically false data injection and denial-of-service attacks. Through a convergence and error-compensation mechanism, the proposed integration substantially reduces the effects of such errors. The incorporation of human intervention introduces a favorable trade-off between convergence speed and robustness, which is particularly critical in safety-sensitive applications where robustness must take precedence over fast convergence through adaptive quantized perturbation injection integrating with human-teaming. Consequently, the algorithm enables efficient and reliable recovery while maintaining satisfactory performance levels. Simulation results demonstrate that within adversarial intervals, the proposed method exhibits superior capability in mitigating and compensating for injected errors compared to conventional deep Q-network-based approach.