{"title":"基于目标轨迹预测的深度强化学习自主空对空作战","authors":"J. Yoo, Donghwi Kim, D. Shim","doi":"10.23919/ICCAS52745.2021.9649876","DOIUrl":null,"url":null,"abstract":"This study designed an intelligent control system for autonomous air-to-air combat and verified it in a realtime flight simulation. Previous studies of aerial combat have required significant effort to design agile control actions for different engagement conditions. In this work, optimal flight control under random engagement conditions was performed by using reinforcement learning and recurrent neural networks. A target trajectory was predicted using Sequence-to-Sequence model with LSTM, for occupying an advantageous location from an enemy aircraft in a close engagement. In addition, this study proposed an algorithm with improved performance compared to the existing algorithm. The result of the study confirmed that the maneuvers of trained agent were similar to the performance of human pilots and the future position of the enemy was tracked by own ship aircraft.","PeriodicalId":411064,"journal":{"name":"2021 21st International Conference on Control, Automation and Systems (ICCAS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Deep Reinforcement Learning based Autonomous Air-to-Air Combat using Target Trajectory Prediction\",\"authors\":\"J. Yoo, Donghwi Kim, D. Shim\",\"doi\":\"10.23919/ICCAS52745.2021.9649876\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study designed an intelligent control system for autonomous air-to-air combat and verified it in a realtime flight simulation. Previous studies of aerial combat have required significant effort to design agile control actions for different engagement conditions. In this work, optimal flight control under random engagement conditions was performed by using reinforcement learning and recurrent neural networks. A target trajectory was predicted using Sequence-to-Sequence model with LSTM, for occupying an advantageous location from an enemy aircraft in a close engagement. In addition, this study proposed an algorithm with improved performance compared to the existing algorithm. The result of the study confirmed that the maneuvers of trained agent were similar to the performance of human pilots and the future position of the enemy was tracked by own ship aircraft.\",\"PeriodicalId\":411064,\"journal\":{\"name\":\"2021 21st International Conference on Control, Automation and Systems (ICCAS)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 21st International Conference on Control, Automation and Systems (ICCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICCAS52745.2021.9649876\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 21st International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS52745.2021.9649876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning based Autonomous Air-to-Air Combat using Target Trajectory Prediction
This study designed an intelligent control system for autonomous air-to-air combat and verified it in a realtime flight simulation. Previous studies of aerial combat have required significant effort to design agile control actions for different engagement conditions. In this work, optimal flight control under random engagement conditions was performed by using reinforcement learning and recurrent neural networks. A target trajectory was predicted using Sequence-to-Sequence model with LSTM, for occupying an advantageous location from an enemy aircraft in a close engagement. In addition, this study proposed an algorithm with improved performance compared to the existing algorithm. The result of the study confirmed that the maneuvers of trained agent were similar to the performance of human pilots and the future position of the enemy was tracked by own ship aircraft.