Enhancing aero-engine blade few-shot anomaly detection with visual-language multi-modal models under domain shift conditions

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2025-10-06 DOI:10.1016/j.aei.2025.103953

Jiafeng Tang, Kunpeng Tan, Zhibin Zhao, Xingwu Zhang, Chuang Sun, Xuefeng Chen

{"title":"Enhancing aero-engine blade few-shot anomaly detection with visual-language multi-modal models under domain shift conditions","authors":"Jiafeng Tang, Kunpeng Tan, Zhibin Zhao, Xingwu Zhang, Chuang Sun, Xuefeng Chen","doi":"10.1016/j.aei.2025.103953","DOIUrl":null,"url":null,"abstract":"<div><div>The blade is a critical component of the aero-engines, and the regular inspection of blades is essential for the healthy operation of aero-engines. Data-driven deep learning (DL) methods for blade anomaly detection gain significant success. However, the nature of data-hungry hampers the further development of general DL-based methods. In light of the insights garnered from the intricate feature embedding of the recent large-scale models, we discern their aptitude for the extensive potential in blade anomaly detection (BAD), notably within contexts constrained by scant sample sizes. Additionally, we are aware of the real-world domain shift problems caused by complex circumstances in aero-engine BAD highly cripple the performance of large-scale models. Thus, we propose a domain semantic perception for blade anomaly detection (DSP4BAD), a few-shot BAD method based on the visual-language large-scale model (CLIP), for exploring the potential of large-scale models and mitigating the above existing challenges. In general, benefiting from the CLIP’s extensive pre-training knowledge in large-scale datasets, DSP4BAD can obtain excellent anomaly detection performance by embracing the tailored knowledge from few-shot samples. To this end, we develop the domain-state prompt augmentation template (DSPAT) and the attention-guided feature adaptation module (AGFAM) to facilitate the adaptation of CLIP’s general knowledge to the domain-specific ones of BAD. Meanwhile, given the redundancy of in-context information in CLIP, an in-context semantic refinement module (ISRM) is devised for purifying the context semantic about anomaly to further alleviate the domain shift issues. Extensive experiment results demonstrate that our DSP4BAD achieves the state-of-the-art performance of anomaly detection with few-shot samples, which provides a promising tack for applications of large-scale models toward real-world blade inspection.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103953"},"PeriodicalIF":9.9000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625008468","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The blade is a critical component of the aero-engines, and the regular inspection of blades is essential for the healthy operation of aero-engines. Data-driven deep learning (DL) methods for blade anomaly detection gain significant success. However, the nature of data-hungry hampers the further development of general DL-based methods. In light of the insights garnered from the intricate feature embedding of the recent large-scale models, we discern their aptitude for the extensive potential in blade anomaly detection (BAD), notably within contexts constrained by scant sample sizes. Additionally, we are aware of the real-world domain shift problems caused by complex circumstances in aero-engine BAD highly cripple the performance of large-scale models. Thus, we propose a domain semantic perception for blade anomaly detection (DSP4BAD), a few-shot BAD method based on the visual-language large-scale model (CLIP), for exploring the potential of large-scale models and mitigating the above existing challenges. In general, benefiting from the CLIP’s extensive pre-training knowledge in large-scale datasets, DSP4BAD can obtain excellent anomaly detection performance by embracing the tailored knowledge from few-shot samples. To this end, we develop the domain-state prompt augmentation template (DSPAT) and the attention-guided feature adaptation module (AGFAM) to facilitate the adaptation of CLIP’s general knowledge to the domain-specific ones of BAD. Meanwhile, given the redundancy of in-context information in CLIP, an in-context semantic refinement module (ISRM) is devised for purifying the context semantic about anomaly to further alleviate the domain shift issues. Extensive experiment results demonstrate that our DSP4BAD achieves the state-of-the-art performance of anomaly detection with few-shot samples, which provides a promising tack for applications of large-scale models toward real-world blade inspection.

查看原文本刊更多论文

基于视觉语言多模态模型在域漂移条件下增强航空发动机叶片少弹异常检测

叶片是航空发动机的关键部件，对叶片的定期检查对航空发动机的健康运行至关重要。数据驱动的深度学习（DL）方法在叶片异常检测中取得了显著的成功。然而，数据饥渴的本质阻碍了基于通用dl方法的进一步发展。根据从最近大规模模型的复杂特征嵌入中获得的见解，我们发现它们在叶片异常检测（BAD）中具有广泛的潜力，特别是在样本量有限的情况下。此外，我们意识到航空发动机BAD中复杂环境引起的现实世界的域漂移问题严重削弱了大尺度模型的性能。因此，我们提出了一种基于视觉语言大规模模型（CLIP）的领域语义感知叶片异常检测方法（DSP4BAD），以探索大规模模型的潜力，缓解上述存在的挑战。总的来说，得益于CLIP在大规模数据集中广泛的预训练知识，DSP4BAD可以通过从少量样本中获得定制化的知识来获得出色的异常检测性能。为此，我们开发了领域状态提示增强模板（dspit）和注意引导特征适应模块（AGFAM），以促进CLIP的一般知识适应BAD的领域特定知识。同时，针对CLIP中上下文信息的冗余性，设计了一个上下文语义精炼模块（ISRM）来净化异常的上下文语义，进一步缓解领域转移问题。大量的实验结果表明，我们的DSP4BAD在少量样本的情况下实现了最先进的异常检测性能，这为大规模模型在实际叶片检测中的应用提供了一种有希望的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.