{"title":"Enhancing aero-engine blade few-shot anomaly detection with visual-language multi-modal models under domain shift conditions","authors":"Jiafeng Tang, Kunpeng Tan, Zhibin Zhao, Xingwu Zhang, Chuang Sun, Xuefeng Chen","doi":"10.1016/j.aei.2025.103953","DOIUrl":null,"url":null,"abstract":"<div><div>The blade is a critical component of the aero-engines, and the regular inspection of blades is essential for the healthy operation of aero-engines. Data-driven deep learning (DL) methods for blade anomaly detection gain significant success. However, the nature of data-hungry hampers the further development of general DL-based methods. In light of the insights garnered from the intricate feature embedding of the recent large-scale models, we discern their aptitude for the extensive potential in blade anomaly detection (BAD), notably within contexts constrained by scant sample sizes. Additionally, we are aware of the real-world domain shift problems caused by complex circumstances in aero-engine BAD highly cripple the performance of large-scale models. Thus, we propose a domain semantic perception for blade anomaly detection (DSP4BAD), a few-shot BAD method based on the visual-language large-scale model (CLIP), for exploring the potential of large-scale models and mitigating the above existing challenges. In general, benefiting from the CLIP’s extensive pre-training knowledge in large-scale datasets, DSP4BAD can obtain excellent anomaly detection performance by embracing the tailored knowledge from few-shot samples. To this end, we develop the domain-state prompt augmentation template (DSPAT) and the attention-guided feature adaptation module (AGFAM) to facilitate the adaptation of CLIP’s general knowledge to the domain-specific ones of BAD. Meanwhile, given the redundancy of in-context information in CLIP, an in-context semantic refinement module (ISRM) is devised for purifying the context semantic about anomaly to further alleviate the domain shift issues. Extensive experiment results demonstrate that our DSP4BAD achieves the state-of-the-art performance of anomaly detection with few-shot samples, which provides a promising tack for applications of large-scale models toward real-world blade inspection.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103953"},"PeriodicalIF":9.9000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625008468","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The blade is a critical component of the aero-engines, and the regular inspection of blades is essential for the healthy operation of aero-engines. Data-driven deep learning (DL) methods for blade anomaly detection gain significant success. However, the nature of data-hungry hampers the further development of general DL-based methods. In light of the insights garnered from the intricate feature embedding of the recent large-scale models, we discern their aptitude for the extensive potential in blade anomaly detection (BAD), notably within contexts constrained by scant sample sizes. Additionally, we are aware of the real-world domain shift problems caused by complex circumstances in aero-engine BAD highly cripple the performance of large-scale models. Thus, we propose a domain semantic perception for blade anomaly detection (DSP4BAD), a few-shot BAD method based on the visual-language large-scale model (CLIP), for exploring the potential of large-scale models and mitigating the above existing challenges. In general, benefiting from the CLIP’s extensive pre-training knowledge in large-scale datasets, DSP4BAD can obtain excellent anomaly detection performance by embracing the tailored knowledge from few-shot samples. To this end, we develop the domain-state prompt augmentation template (DSPAT) and the attention-guided feature adaptation module (AGFAM) to facilitate the adaptation of CLIP’s general knowledge to the domain-specific ones of BAD. Meanwhile, given the redundancy of in-context information in CLIP, an in-context semantic refinement module (ISRM) is devised for purifying the context semantic about anomaly to further alleviate the domain shift issues. Extensive experiment results demonstrate that our DSP4BAD achieves the state-of-the-art performance of anomaly detection with few-shot samples, which provides a promising tack for applications of large-scale models toward real-world blade inspection.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.