{"title":"FSAMLM:一种用于跨域故障诊断的多模态大模型","authors":"Xin Zhang , Shixi Liu , Li Jiang , Yibing Li","doi":"10.1016/j.asoc.2025.113985","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in deep learning have demonstrated remarkable performance in mechanical fault diagnosis. However, most existing approaches are designed for a single data modality or a single task, limiting their flexibility and generalization ability. Moreover, common knowledge across different tasks remains largely unexploited. Inspired by the success of large-scale models in natural language processing and computer vision, integrating multimodal data with multi-task learning strategies may significantly improve the performance of fault diagnosis models. Nonetheless, a significant gap exists between general pre-trained knowledge and domain-specific expertise, posing considerable challenges for effective integration. To address these issues, we propose a novel few-shot adaptation multimodal large model (FSAMLM) that efficiently captures shared representations through a parameter-efficient fine-tuning strategy, structured in two stages. Specifically, we first design a low-rank adaptation meta-learning (LoRAML) framework, which employs low-rank decomposition on pre-trained parameters to reduce computational complexity and improve robustness. This approach not only accelerates adaptation to new few-shot tasks but also preserves pre-training knowledge effectively. During the second fine-tuning stage, we implement target-domain training with limited samples and introduce a training-free inference option for real-world deployment. Experimental validation using six public datasets demonstrates FSAMLM’s superior domain generalization capability in few-shot fault diagnosis tasks compared to existing methods. The code repository is publicly available at: <span><span>https://github.com/ohhyeeaah/FSAMLM-for-fault-diagnosis</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 113985"},"PeriodicalIF":6.6000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FSAMLM: A few-shot adaptation multimodal large model for cross-domain fault diagnosis\",\"authors\":\"Xin Zhang , Shixi Liu , Li Jiang , Yibing Li\",\"doi\":\"10.1016/j.asoc.2025.113985\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent advances in deep learning have demonstrated remarkable performance in mechanical fault diagnosis. However, most existing approaches are designed for a single data modality or a single task, limiting their flexibility and generalization ability. Moreover, common knowledge across different tasks remains largely unexploited. Inspired by the success of large-scale models in natural language processing and computer vision, integrating multimodal data with multi-task learning strategies may significantly improve the performance of fault diagnosis models. Nonetheless, a significant gap exists between general pre-trained knowledge and domain-specific expertise, posing considerable challenges for effective integration. To address these issues, we propose a novel few-shot adaptation multimodal large model (FSAMLM) that efficiently captures shared representations through a parameter-efficient fine-tuning strategy, structured in two stages. Specifically, we first design a low-rank adaptation meta-learning (LoRAML) framework, which employs low-rank decomposition on pre-trained parameters to reduce computational complexity and improve robustness. This approach not only accelerates adaptation to new few-shot tasks but also preserves pre-training knowledge effectively. During the second fine-tuning stage, we implement target-domain training with limited samples and introduce a training-free inference option for real-world deployment. Experimental validation using six public datasets demonstrates FSAMLM’s superior domain generalization capability in few-shot fault diagnosis tasks compared to existing methods. The code repository is publicly available at: <span><span>https://github.com/ohhyeeaah/FSAMLM-for-fault-diagnosis</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"185 \",\"pages\":\"Article 113985\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625012980\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625012980","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
FSAMLM: A few-shot adaptation multimodal large model for cross-domain fault diagnosis
Recent advances in deep learning have demonstrated remarkable performance in mechanical fault diagnosis. However, most existing approaches are designed for a single data modality or a single task, limiting their flexibility and generalization ability. Moreover, common knowledge across different tasks remains largely unexploited. Inspired by the success of large-scale models in natural language processing and computer vision, integrating multimodal data with multi-task learning strategies may significantly improve the performance of fault diagnosis models. Nonetheless, a significant gap exists between general pre-trained knowledge and domain-specific expertise, posing considerable challenges for effective integration. To address these issues, we propose a novel few-shot adaptation multimodal large model (FSAMLM) that efficiently captures shared representations through a parameter-efficient fine-tuning strategy, structured in two stages. Specifically, we first design a low-rank adaptation meta-learning (LoRAML) framework, which employs low-rank decomposition on pre-trained parameters to reduce computational complexity and improve robustness. This approach not only accelerates adaptation to new few-shot tasks but also preserves pre-training knowledge effectively. During the second fine-tuning stage, we implement target-domain training with limited samples and introduce a training-free inference option for real-world deployment. Experimental validation using six public datasets demonstrates FSAMLM’s superior domain generalization capability in few-shot fault diagnosis tasks compared to existing methods. The code repository is publicly available at: https://github.com/ohhyeeaah/FSAMLM-for-fault-diagnosis.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.