Autonomous interval management of multi-aircraft based on multi-agent reinforcement learning considering fuel consumption

IF 7.6 1区工程技术 Q1 TRANSPORTATION SCIENCE & TECHNOLOGY

Transportation Research Part C-Emerging Technologies Pub Date : 2024-07-01 DOI:10.1016/j.trc.2024.104729

Jie Yuan , Yang Pei , Yan Xu , Yuxue Ge , Zhiqiang Wei

{"title":"Autonomous interval management of multi-aircraft based on multi-agent reinforcement learning considering fuel consumption","authors":"Jie Yuan , Yang Pei , Yan Xu , Yuxue Ge , Zhiqiang Wei","doi":"10.1016/j.trc.2024.104729","DOIUrl":null,"url":null,"abstract":"<div><p>Real-time autonomous interval management in multi-aircraft operational scenarios addresses safety, efficiency, and economic issues in air transportation. This study proposes an autonomous interval management supporter (AIMS) prototype system with high scalability potential to address these issues. The system utilizes a multi-agent deep reinforcement learning method, specifically the deep deterministic policy gradient (DDPG) algorithm, which enables interval management and fuel-saving by providing speed decisions in a continuous action space amidst uncertainty. This study innovatively incorporates aircraft performance-related parameters as observational features. These features are categorized into interval- and performance-related groups as inputs, and trained using a separate reconstructed critic network structure. Experiments are focused on the enroute descent phase to validate the performance of the proposed AIMS. Compared with real flight data based on traffic controller decisions, the AIMS demonstrated superior speed change decision-making regardless of the aircraft type or classification criteria. Simulation results suggest that incorporating aircraft performance-related states and utilizing a separate critic network training structure positively improve the success rate of decision-making and reduce fuel consumption. By utilizing aircraft performance-related states, the success rate increases by an average of 49.64%, with a corresponding average fuel consumption decrease of 4.42%. Additionally, employing a separate critic network training structure results in an average success rate increase of 16.10%, with an average fuel reduction of 1.09%. To further reduce fuel consumption and achieve a shortened interval, it is recommended to set the initial altitude of the aircraft sequence appropriately high based on flight altitude constraints.</p></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"165 ","pages":"Article 104729"},"PeriodicalIF":7.6000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0968090X2400250X/pdfft?md5=c26f5ec62797f36063cee03f673675b5&pid=1-s2.0-S0968090X2400250X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X2400250X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Real-time autonomous interval management in multi-aircraft operational scenarios addresses safety, efficiency, and economic issues in air transportation. This study proposes an autonomous interval management supporter (AIMS) prototype system with high scalability potential to address these issues. The system utilizes a multi-agent deep reinforcement learning method, specifically the deep deterministic policy gradient (DDPG) algorithm, which enables interval management and fuel-saving by providing speed decisions in a continuous action space amidst uncertainty. This study innovatively incorporates aircraft performance-related parameters as observational features. These features are categorized into interval- and performance-related groups as inputs, and trained using a separate reconstructed critic network structure. Experiments are focused on the enroute descent phase to validate the performance of the proposed AIMS. Compared with real flight data based on traffic controller decisions, the AIMS demonstrated superior speed change decision-making regardless of the aircraft type or classification criteria. Simulation results suggest that incorporating aircraft performance-related states and utilizing a separate critic network training structure positively improve the success rate of decision-making and reduce fuel consumption. By utilizing aircraft performance-related states, the success rate increases by an average of 49.64%, with a corresponding average fuel consumption decrease of 4.42%. Additionally, employing a separate critic network training structure results in an average success rate increase of 16.10%, with an average fuel reduction of 1.09%. To further reduce fuel consumption and achieve a shortened interval, it is recommended to set the initial altitude of the aircraft sequence appropriately high based on flight altitude constraints.

查看原文本刊更多论文

基于多代理强化学习的多架飞机自主间隔管理（考虑油耗因素

多飞机运行场景中的实时自主间隔管理可解决航空运输中的安全、效率和经济问题。本研究提出了一个具有高扩展潜力的自主间隔管理支持系统（AIMS）原型，以解决这些问题。该系统利用多代理深度强化学习方法，特别是深度确定性策略梯度（DDPG）算法，通过在不确定的连续行动空间中提供速度决策，实现间隔管理和节油。本研究创新性地将飞机性能相关参数作为观测特征。这些特征作为输入被分为间隔和性能相关组，并使用单独的重构批评网络结构进行训练。实验主要集中在航线下降阶段，以验证所提出的 AIMS 的性能。与基于交通管制员决策的真实飞行数据相比，无论飞机类型或分类标准如何，AIMS 都显示出卓越的速度变化决策能力。仿真结果表明，结合飞机性能相关状态并利用单独的批评者网络训练结构可积极提高决策成功率并降低油耗。通过利用飞机性能相关状态，成功率平均提高了 49.64%，相应的平均油耗降低了 4.42%。此外，采用单独的批评者网络训练结构，平均成功率提高了 16.10%，平均油耗降低了 1.09%。为了进一步降低油耗和缩短间隔时间，建议根据飞行高度限制适当提高飞机序列的初始高度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Transportation Research Part C-Emerging Technologies 工程技术-运输科技

CiteScore

15.80

自引率

12.00%

发文量

332

审稿时长

64 days

期刊介绍： Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.