Jie Yuan , Yang Pei , Yan Xu , Yuxue Ge , Zhiqiang Wei
{"title":"Autonomous interval management of multi-aircraft based on multi-agent reinforcement learning considering fuel consumption","authors":"Jie Yuan , Yang Pei , Yan Xu , Yuxue Ge , Zhiqiang Wei","doi":"10.1016/j.trc.2024.104729","DOIUrl":null,"url":null,"abstract":"<div><p>Real-time autonomous interval management in multi-aircraft operational scenarios addresses safety, efficiency, and economic issues in air transportation. This study proposes an autonomous interval management supporter (AIMS) prototype system with high scalability potential to address these issues. The system utilizes a multi-agent deep reinforcement learning method, specifically the deep deterministic policy gradient (DDPG) algorithm, which enables interval management and fuel-saving by providing speed decisions in a continuous action space amidst uncertainty. This study innovatively incorporates aircraft performance-related parameters as observational features. These features are categorized into interval- and performance-related groups as inputs, and trained using a separate reconstructed critic network structure. Experiments are focused on the enroute descent phase to validate the performance of the proposed AIMS. Compared with real flight data based on traffic controller decisions, the AIMS demonstrated superior speed change decision-making regardless of the aircraft type or classification criteria. Simulation results suggest that incorporating aircraft performance-related states and utilizing a separate critic network training structure positively improve the success rate of decision-making and reduce fuel consumption. By utilizing aircraft performance-related states, the success rate increases by an average of 49.64%, with a corresponding average fuel consumption decrease of 4.42%. Additionally, employing a separate critic network training structure results in an average success rate increase of 16.10%, with an average fuel reduction of 1.09%. To further reduce fuel consumption and achieve a shortened interval, it is recommended to set the initial altitude of the aircraft sequence appropriately high based on flight altitude constraints.</p></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":null,"pages":null},"PeriodicalIF":7.6000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0968090X2400250X/pdfft?md5=c26f5ec62797f36063cee03f673675b5&pid=1-s2.0-S0968090X2400250X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X2400250X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Real-time autonomous interval management in multi-aircraft operational scenarios addresses safety, efficiency, and economic issues in air transportation. This study proposes an autonomous interval management supporter (AIMS) prototype system with high scalability potential to address these issues. The system utilizes a multi-agent deep reinforcement learning method, specifically the deep deterministic policy gradient (DDPG) algorithm, which enables interval management and fuel-saving by providing speed decisions in a continuous action space amidst uncertainty. This study innovatively incorporates aircraft performance-related parameters as observational features. These features are categorized into interval- and performance-related groups as inputs, and trained using a separate reconstructed critic network structure. Experiments are focused on the enroute descent phase to validate the performance of the proposed AIMS. Compared with real flight data based on traffic controller decisions, the AIMS demonstrated superior speed change decision-making regardless of the aircraft type or classification criteria. Simulation results suggest that incorporating aircraft performance-related states and utilizing a separate critic network training structure positively improve the success rate of decision-making and reduce fuel consumption. By utilizing aircraft performance-related states, the success rate increases by an average of 49.64%, with a corresponding average fuel consumption decrease of 4.42%. Additionally, employing a separate critic network training structure results in an average success rate increase of 16.10%, with an average fuel reduction of 1.09%. To further reduce fuel consumption and achieve a shortened interval, it is recommended to set the initial altitude of the aircraft sequence appropriately high based on flight altitude constraints.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.