Haifeng Liu, Dongyue Guo, Shizhong Zhou, Zheng Zhang, Hongyu Yang, Yi Lin
{"title":"学习飞行接近的类人决策:环境和模仿学习方法","authors":"Haifeng Liu, Dongyue Guo, Shizhong Zhou, Zheng Zhang, Hongyu Yang, Yi Lin","doi":"10.1016/j.trc.2025.105142","DOIUrl":null,"url":null,"abstract":"<div><div><span><span><sup>1</sup></span></span> Flight approach in terminal airspace is a challenging task with high aircraft density and maneuvering in air traffic control decisions. Existing reinforcement learning methods were only studied based on simulation environments, and also suffered from sparse reward and state-space explosion problems. In this work, an imitation learning-based autonomous framework, AppGAIL, is proposed to achieve the flight approach decision based on human expert demonstrations, which has the ability to eliminate the requirement of designing handcrafted rewards. To cope with the state-space explosion problem, a cylindrical grid airspace model is designed to convert the earth space to discrete airspace, obtaining the transformation of the real-time traffic situation by near distance–identical cells. The generative adversarial mechanism is applied to achieve imitation learning by distinguishing the source of the input observations (state and action sequences with a sliding window), i.e., from the generator or expert demonstrations. Since all human expert demonstrations are safe operations, limiting the model to learn knowledge of flight conflicts and confusing the generator to plan conflict trajectories, a conflict-aware discriminator is proposed to detect possible conflicts by a multi-task framework with learnable weights, which further supports the adversarial training. The real-world traffic dataset is applied to validate the proposed method, in which several custom metrics are proposed to support the real-world air traffic control. The experimental results demonstrate that the AppGAIL outperforms other baseline methods, achieving only 0.67% potential conflict rate and 3.732 kilometers dynamic time wrapping distance. Most importantly, all proposed technical modules contribute the desired performance improvement. Additionally, multi-aircraft planning and real-time factors can also be resolved to improve the applicability of the proposed method.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"176 ","pages":"Article 105142"},"PeriodicalIF":7.6000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning human-like decisions for flight approach: Environment and an imitation learning method\",\"authors\":\"Haifeng Liu, Dongyue Guo, Shizhong Zhou, Zheng Zhang, Hongyu Yang, Yi Lin\",\"doi\":\"10.1016/j.trc.2025.105142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div><span><span><sup>1</sup></span></span> Flight approach in terminal airspace is a challenging task with high aircraft density and maneuvering in air traffic control decisions. Existing reinforcement learning methods were only studied based on simulation environments, and also suffered from sparse reward and state-space explosion problems. In this work, an imitation learning-based autonomous framework, AppGAIL, is proposed to achieve the flight approach decision based on human expert demonstrations, which has the ability to eliminate the requirement of designing handcrafted rewards. To cope with the state-space explosion problem, a cylindrical grid airspace model is designed to convert the earth space to discrete airspace, obtaining the transformation of the real-time traffic situation by near distance–identical cells. The generative adversarial mechanism is applied to achieve imitation learning by distinguishing the source of the input observations (state and action sequences with a sliding window), i.e., from the generator or expert demonstrations. Since all human expert demonstrations are safe operations, limiting the model to learn knowledge of flight conflicts and confusing the generator to plan conflict trajectories, a conflict-aware discriminator is proposed to detect possible conflicts by a multi-task framework with learnable weights, which further supports the adversarial training. The real-world traffic dataset is applied to validate the proposed method, in which several custom metrics are proposed to support the real-world air traffic control. The experimental results demonstrate that the AppGAIL outperforms other baseline methods, achieving only 0.67% potential conflict rate and 3.732 kilometers dynamic time wrapping distance. Most importantly, all proposed technical modules contribute the desired performance improvement. Additionally, multi-aircraft planning and real-time factors can also be resolved to improve the applicability of the proposed method.</div></div>\",\"PeriodicalId\":54417,\"journal\":{\"name\":\"Transportation Research Part C-Emerging Technologies\",\"volume\":\"176 \",\"pages\":\"Article 105142\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part C-Emerging Technologies\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968090X25001469\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25001469","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
Learning human-like decisions for flight approach: Environment and an imitation learning method
1 Flight approach in terminal airspace is a challenging task with high aircraft density and maneuvering in air traffic control decisions. Existing reinforcement learning methods were only studied based on simulation environments, and also suffered from sparse reward and state-space explosion problems. In this work, an imitation learning-based autonomous framework, AppGAIL, is proposed to achieve the flight approach decision based on human expert demonstrations, which has the ability to eliminate the requirement of designing handcrafted rewards. To cope with the state-space explosion problem, a cylindrical grid airspace model is designed to convert the earth space to discrete airspace, obtaining the transformation of the real-time traffic situation by near distance–identical cells. The generative adversarial mechanism is applied to achieve imitation learning by distinguishing the source of the input observations (state and action sequences with a sliding window), i.e., from the generator or expert demonstrations. Since all human expert demonstrations are safe operations, limiting the model to learn knowledge of flight conflicts and confusing the generator to plan conflict trajectories, a conflict-aware discriminator is proposed to detect possible conflicts by a multi-task framework with learnable weights, which further supports the adversarial training. The real-world traffic dataset is applied to validate the proposed method, in which several custom metrics are proposed to support the real-world air traffic control. The experimental results demonstrate that the AppGAIL outperforms other baseline methods, achieving only 0.67% potential conflict rate and 3.732 kilometers dynamic time wrapping distance. Most importantly, all proposed technical modules contribute the desired performance improvement. Additionally, multi-aircraft planning and real-time factors can also be resolved to improve the applicability of the proposed method.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.