Lei Wang , Hongyu Yang , Yunxiang Han , Suwan Yin , Yuankai Wu
{"title":"基于几何技术的基于深度强化学习的空中交通管制冲突解决","authors":"Lei Wang , Hongyu Yang , Yunxiang Han , Suwan Yin , Yuankai Wu","doi":"10.1016/j.eswa.2025.127579","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in Deep Reinforcement Learning (DRL)-based CR have shown promise; however, the end-to-end nature of DRL systems, which rely on reward-driven mechanisms, poses challenges in adhering to ATC’s stringent regulations. Traditional geometric methods, such as the Modified Voltage Potential (MVP) technique, theoretically meet ATC requirements by minimizing deviations from planned paths during conflicts but struggle to ensure optimal performance in complex, uncertain environments. In this study, we amalgamate DRL with the geometric MVP technique, leveraging DRL’s capacity for intelligent decision-making in complex environments and MVP’s ability to theoretically minimize deviations from planned paths. Within this framework, we utilize a look-ahead DRL agent, in conjunction with MVP conflict detection methods, to foresee potential conflicts. Should a conflict be imminent, a Maneuver DRL agent takes immediate control of the aircraft to adeptly navigate the situation, ensuring conflict avoidance. Once the aircraft is clear of the conflict, a rule-based method is employed to swiftly return it to the planned path. Extensive simulations demonstrate that our proposed framework significantly reduces conflict rates, maintains efficient trajectory adherence with minimal deviations, lowers unnecessary computational overhead, and effectively adapts to dynamic environmental conditions.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127579"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Taming deep reinforcement learning-based conflict resolution in air traffic control using geometric technique\",\"authors\":\"Lei Wang , Hongyu Yang , Yunxiang Han , Suwan Yin , Yuankai Wu\",\"doi\":\"10.1016/j.eswa.2025.127579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent advances in Deep Reinforcement Learning (DRL)-based CR have shown promise; however, the end-to-end nature of DRL systems, which rely on reward-driven mechanisms, poses challenges in adhering to ATC’s stringent regulations. Traditional geometric methods, such as the Modified Voltage Potential (MVP) technique, theoretically meet ATC requirements by minimizing deviations from planned paths during conflicts but struggle to ensure optimal performance in complex, uncertain environments. In this study, we amalgamate DRL with the geometric MVP technique, leveraging DRL’s capacity for intelligent decision-making in complex environments and MVP’s ability to theoretically minimize deviations from planned paths. Within this framework, we utilize a look-ahead DRL agent, in conjunction with MVP conflict detection methods, to foresee potential conflicts. Should a conflict be imminent, a Maneuver DRL agent takes immediate control of the aircraft to adeptly navigate the situation, ensuring conflict avoidance. Once the aircraft is clear of the conflict, a rule-based method is employed to swiftly return it to the planned path. Extensive simulations demonstrate that our proposed framework significantly reduces conflict rates, maintains efficient trajectory adherence with minimal deviations, lowers unnecessary computational overhead, and effectively adapts to dynamic environmental conditions.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127579\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425012011\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012011","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Taming deep reinforcement learning-based conflict resolution in air traffic control using geometric technique
Recent advances in Deep Reinforcement Learning (DRL)-based CR have shown promise; however, the end-to-end nature of DRL systems, which rely on reward-driven mechanisms, poses challenges in adhering to ATC’s stringent regulations. Traditional geometric methods, such as the Modified Voltage Potential (MVP) technique, theoretically meet ATC requirements by minimizing deviations from planned paths during conflicts but struggle to ensure optimal performance in complex, uncertain environments. In this study, we amalgamate DRL with the geometric MVP technique, leveraging DRL’s capacity for intelligent decision-making in complex environments and MVP’s ability to theoretically minimize deviations from planned paths. Within this framework, we utilize a look-ahead DRL agent, in conjunction with MVP conflict detection methods, to foresee potential conflicts. Should a conflict be imminent, a Maneuver DRL agent takes immediate control of the aircraft to adeptly navigate the situation, ensuring conflict avoidance. Once the aircraft is clear of the conflict, a rule-based method is employed to swiftly return it to the planned path. Extensive simulations demonstrate that our proposed framework significantly reduces conflict rates, maintains efficient trajectory adherence with minimal deviations, lowers unnecessary computational overhead, and effectively adapts to dynamic environmental conditions.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.