AviationCopilot：受人类飞行员训练启发，建立一个可靠的基于llm的航空副驾驶

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2025-09-08 DOI:10.1016/j.aei.2025.103806

Zhuorui Zhang , Shanshan Feng , Tiance Yang , Ruobing Huang , Hao Wang , Fu Wang , Fan Li

{"title":"AviationCopilot：受人类飞行员训练启发，建立一个可靠的基于llm的航空副驾驶","authors":"Zhuorui Zhang , Shanshan Feng , Tiance Yang , Ruobing Huang , Hao Wang , Fu Wang , Fan Li","doi":"10.1016/j.aei.2025.103806","DOIUrl":null,"url":null,"abstract":"<div><div>Modern pilots routinely face high cognitive loads during complex flight operations. Although large language models (LLMs) demonstrate exceptional natural language understanding and exhibit tremendous potential as copilots, there is a notable gap in LLMs specifically designed to handle the knowledge-intensive tasks required of pilots. Inspired by pilots’ learning and manual retrieval patterns, we introduce <strong>AviationCopilot</strong>—a novel framework that efficiently injects both aviation knowledge content and knowledge structure into LLMs. Specifically, we employ differentiated data fusion and generalization strategies for two training stages including continual pre-training and instruction tuning. This approach equips the model with enhanced domain-specific knowledge retention and instruction-following capabilities, akin to human pilots. During inference, AviationCopilot activates its knowledge structure memory to adaptively retrieve comprehensive context, improving factual accuracy. To evaluate effectiveness, we construct a comprehensive benchmark named <strong>OpenAviation</strong> featuring both LLM-synthesized and expert-designed questions. Experimental results show that models with fewer than two billion parameters, trained with the AviationCopilot framework, consistently outperform strong LLM baselines, including those utilizing Retrieval-Augmented Generation (RAG). Additionally, AviationCopilot enhances structured aviation understanding and enables LLMs to serve as retrievers for improving other models, supporting more reliable AI copilots. The dataset and source code are available at <span><span>https://github.com/zhuorui-zhang/AviationCopilot</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103806"},"PeriodicalIF":9.9000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AviationCopilot: Building a reliable LLM-based Aviation Copilot inspired by human pilot training\",\"authors\":\"Zhuorui Zhang , Shanshan Feng , Tiance Yang , Ruobing Huang , Hao Wang , Fu Wang , Fan Li\",\"doi\":\"10.1016/j.aei.2025.103806\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Modern pilots routinely face high cognitive loads during complex flight operations. Although large language models (LLMs) demonstrate exceptional natural language understanding and exhibit tremendous potential as copilots, there is a notable gap in LLMs specifically designed to handle the knowledge-intensive tasks required of pilots. Inspired by pilots’ learning and manual retrieval patterns, we introduce <strong>AviationCopilot</strong>—a novel framework that efficiently injects both aviation knowledge content and knowledge structure into LLMs. Specifically, we employ differentiated data fusion and generalization strategies for two training stages including continual pre-training and instruction tuning. This approach equips the model with enhanced domain-specific knowledge retention and instruction-following capabilities, akin to human pilots. During inference, AviationCopilot activates its knowledge structure memory to adaptively retrieve comprehensive context, improving factual accuracy. To evaluate effectiveness, we construct a comprehensive benchmark named <strong>OpenAviation</strong> featuring both LLM-synthesized and expert-designed questions. Experimental results show that models with fewer than two billion parameters, trained with the AviationCopilot framework, consistently outperform strong LLM baselines, including those utilizing Retrieval-Augmented Generation (RAG). Additionally, AviationCopilot enhances structured aviation understanding and enables LLMs to serve as retrievers for improving other models, supporting more reliable AI copilots. The dataset and source code are available at <span><span>https://github.com/zhuorui-zhang/AviationCopilot</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103806\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625006998\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625006998","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

现代飞行员在复杂的飞行操作中经常面临高认知负荷。尽管大型语言模型（llm）表现出卓越的自然语言理解能力，并表现出作为副驾驶的巨大潜力，但专门设计用于处理飞行员所需的知识密集型任务的llm仍存在显著差距。受飞行员学习和人工检索模式的启发，我们引入了一个新的框架aviationcopilot，它可以有效地将航空知识内容和知识结构注入llm中。具体来说，我们在连续预训练和指令调优两个训练阶段采用了差异化的数据融合和泛化策略。这种方法为模型配备了增强的特定于领域的知识保留和指令遵循能力，类似于人类飞行员。在推理过程中，AviationCopilot激活其知识结构记忆，自适应地检索全面的上下文，提高事实的准确性。为了评估有效性，我们构建了一个名为OpenAviation的综合基准，其中包括llm合成和专家设计的问题。实验结果表明，使用AviationCopilot框架训练的参数少于20亿个的模型，始终优于强大的LLM基线，包括使用检索增强生成（RAG）的模型。此外，AviationCopilot增强了结构化航空理解，使llm能够作为检索器来改进其他模型，支持更可靠的人工智能副驾驶。数据集和源代码可在https://github.com/zhuorui-zhang/AviationCopilot上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

AviationCopilot: Building a reliable LLM-based Aviation Copilot inspired by human pilot training

Modern pilots routinely face high cognitive loads during complex flight operations. Although large language models (LLMs) demonstrate exceptional natural language understanding and exhibit tremendous potential as copilots, there is a notable gap in LLMs specifically designed to handle the knowledge-intensive tasks required of pilots. Inspired by pilots’ learning and manual retrieval patterns, we introduce AviationCopilot—a novel framework that efficiently injects both aviation knowledge content and knowledge structure into LLMs. Specifically, we employ differentiated data fusion and generalization strategies for two training stages including continual pre-training and instruction tuning. This approach equips the model with enhanced domain-specific knowledge retention and instruction-following capabilities, akin to human pilots. During inference, AviationCopilot activates its knowledge structure memory to adaptively retrieve comprehensive context, improving factual accuracy. To evaluate effectiveness, we construct a comprehensive benchmark named OpenAviation featuring both LLM-synthesized and expert-designed questions. Experimental results show that models with fewer than two billion parameters, trained with the AviationCopilot framework, consistently outperform strong LLM baselines, including those utilizing Retrieval-Augmented Generation (RAG). Additionally, AviationCopilot enhances structured aviation understanding and enables LLMs to serve as retrievers for improving other models, supporting more reliable AI copilots. The dataset and source code are available at https://github.com/zhuorui-zhang/AviationCopilot.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.