CELLMEA：一种协同增强的基于大语言模型的飞机故障维修实体对齐方法

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-04-21 DOI:10.1016/j.eswa.2025.127630

Xiangzhen Meng , Xiaoxuan Jiao , Jiahui Li , Shenglong Wang , Jinxin Pan , Bo Jing , Xilang Tang

{"title":"CELLMEA：一种协同增强的基于大语言模型的飞机故障维修实体对齐方法","authors":"Xiangzhen Meng , Xiaoxuan Jiao , Jiahui Li , Shenglong Wang , Jinxin Pan , Bo Jing , Xilang Tang","doi":"10.1016/j.eswa.2025.127630","DOIUrl":null,"url":null,"abstract":"<div><div>Aircraft fault knowledge graphs serve as a critical knowledge base for the intelligent maintenance and operations of aviation equipment. However, the entity alignment tasks in their construction remain overly dependent on manual annotation, leading to issues such as inconsistent annotation quality and low annotation efficiency. Unsupervised methods provide a promising solution and have garnered significant research interest. However, existing unsupervised entity alignment approaches often overlook the impact of noisy entities, presenting a significant challenge for aligning entities in aviation fault data. This paper proposes a solution by incorporating a large language model (LLM) into the entity alignment process for aircraft fault knowledge graphs. By leveraging the world knowledge encoded in the LLM, the approach enhances the performance of unsupervised entity alignment models. Specifically, we introduce the Collaboratively Enhanced-based Large Language Model Entity Alignment (CELLMEA), which utilizes data from the aircraft flight control system manual, fault analysis manual, and typical fault cases. The model’s architecture includes a multi-view semantic information embedding that integrates structural, relational, and semantic data. Additionally, we propose an adaptive method for mixing hard negative samples, which generates higher-quality negative entities by combining noisy negative samples with reliable ones. Furthermore, an incremental consistency regularization technique is introduced to progressively refine the robustness of pseudo-labeling within the CELLMEA model. Finally, experimental results on a flight control system entity alignment dataset demonstrate that CELLMEA outperforms all baseline models, achieving an MRR (Mean Reciprocal Rank) value of 0.917 ± 0.011. These results validate the model’s effectiveness in handling unlabeled data and lay the groundwork for the engineering of aircraft fault knowledge graphs.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127630"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CELLMEA:A Collaboratively Enhanced Large Language Model-based Entity Alignment for aircraft fault maintenance\",\"authors\":\"Xiangzhen Meng , Xiaoxuan Jiao , Jiahui Li , Shenglong Wang , Jinxin Pan , Bo Jing , Xilang Tang\",\"doi\":\"10.1016/j.eswa.2025.127630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Aircraft fault knowledge graphs serve as a critical knowledge base for the intelligent maintenance and operations of aviation equipment. However, the entity alignment tasks in their construction remain overly dependent on manual annotation, leading to issues such as inconsistent annotation quality and low annotation efficiency. Unsupervised methods provide a promising solution and have garnered significant research interest. However, existing unsupervised entity alignment approaches often overlook the impact of noisy entities, presenting a significant challenge for aligning entities in aviation fault data. This paper proposes a solution by incorporating a large language model (LLM) into the entity alignment process for aircraft fault knowledge graphs. By leveraging the world knowledge encoded in the LLM, the approach enhances the performance of unsupervised entity alignment models. Specifically, we introduce the Collaboratively Enhanced-based Large Language Model Entity Alignment (CELLMEA), which utilizes data from the aircraft flight control system manual, fault analysis manual, and typical fault cases. The model’s architecture includes a multi-view semantic information embedding that integrates structural, relational, and semantic data. Additionally, we propose an adaptive method for mixing hard negative samples, which generates higher-quality negative entities by combining noisy negative samples with reliable ones. Furthermore, an incremental consistency regularization technique is introduced to progressively refine the robustness of pseudo-labeling within the CELLMEA model. Finally, experimental results on a flight control system entity alignment dataset demonstrate that CELLMEA outperforms all baseline models, achieving an MRR (Mean Reciprocal Rank) value of 0.917 ± 0.011. These results validate the model’s effectiveness in handling unlabeled data and lay the groundwork for the engineering of aircraft fault knowledge graphs.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"282 \",\"pages\":\"Article 127630\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425012527\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012527","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

飞机故障知识图是航空设备智能维修与运行的重要知识库。然而，实体对齐任务在其构建过程中仍然过度依赖于手工标注，导致标注质量不一致、标注效率低等问题。无监督方法提供了一个很有前途的解决方案，并获得了重要的研究兴趣。然而，现有的无监督实体对齐方法往往忽略了噪声实体的影响，这对航空故障数据中的实体对齐提出了重大挑战。本文提出了一种将大语言模型（LLM）集成到飞机故障知识图实体对齐过程中的解决方案。通过利用LLM中编码的世界知识，该方法提高了无监督实体对齐模型的性能。具体来说，我们介绍了基于协同增强的大语言模型实体校准（CELLMEA），该方法利用了飞机飞行控制系统手册、故障分析手册和典型故障案例中的数据。该模型的体系结构包括一个多视图语义信息嵌入，它集成了结构、关系和语义数据。此外，我们提出了一种自适应混合硬负样本的方法，该方法通过将有噪声的负样本与可靠的负样本相结合，产生更高质量的负实体。此外，引入了一种增量一致性正则化技术来逐步改进CELLMEA模型中伪标记的鲁棒性。最后，在飞行控制系统实体对齐数据集上的实验结果表明，CELLMEA优于所有基线模型，MRR （Mean Reciprocal Rank）值为0.917±0.011。这些结果验证了该模型在处理未标记数据方面的有效性，为飞机故障知识图的工程化奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CELLMEA:A Collaboratively Enhanced Large Language Model-based Entity Alignment for aircraft fault maintenance

Aircraft fault knowledge graphs serve as a critical knowledge base for the intelligent maintenance and operations of aviation equipment. However, the entity alignment tasks in their construction remain overly dependent on manual annotation, leading to issues such as inconsistent annotation quality and low annotation efficiency. Unsupervised methods provide a promising solution and have garnered significant research interest. However, existing unsupervised entity alignment approaches often overlook the impact of noisy entities, presenting a significant challenge for aligning entities in aviation fault data. This paper proposes a solution by incorporating a large language model (LLM) into the entity alignment process for aircraft fault knowledge graphs. By leveraging the world knowledge encoded in the LLM, the approach enhances the performance of unsupervised entity alignment models. Specifically, we introduce the Collaboratively Enhanced-based Large Language Model Entity Alignment (CELLMEA), which utilizes data from the aircraft flight control system manual, fault analysis manual, and typical fault cases. The model’s architecture includes a multi-view semantic information embedding that integrates structural, relational, and semantic data. Additionally, we propose an adaptive method for mixing hard negative samples, which generates higher-quality negative entities by combining noisy negative samples with reliable ones. Furthermore, an incremental consistency regularization technique is introduced to progressively refine the robustness of pseudo-labeling within the CELLMEA model. Finally, experimental results on a flight control system entity alignment dataset demonstrate that CELLMEA outperforms all baseline models, achieving an MRR (Mean Reciprocal Rank) value of 0.917 ± 0.011. These results validate the model’s effectiveness in handling unlabeled data and lay the groundwork for the engineering of aircraft fault knowledge graphs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.