Jian Cen, Weiwei Si, Xi Liu, Bichuang Zhao, Hankun Huang, Junfu Liu
{"title":"Intelligent fault diagnosis method based on data generation and long-patch vision transformer under small samples","authors":"Jian Cen, Weiwei Si, Xi Liu, Bichuang Zhao, Hankun Huang, Junfu Liu","doi":"10.1007/s10489-025-06535-w","DOIUrl":null,"url":null,"abstract":"<div><p>Rotating machinery is an important part of modern industry, and bearings are one of the most important things. However, bearing fault data are difficult to collect, and bearing fault diagnosis under small samples has significant research potential. In this paper, we proposed a fault diagnosis framework that combines diffusion modeling and improved Vision Transformer. First, the short-time Fourier transform is applied to the original one-dimensional vibration signals to convert the data into time-frequency maps. Second, the conditional diffusion model was applied to generate the required samples and expand the dataset. Finally, the Long-patch Vision Transformer (LVT) proposed in this paper is used to classify the mixed samples. LVT designs a long-patch division method for time-frequency maps with dense transverse features. The LVT contains denser features in each patch, and this method is more suitable for time-frequency maps. Validating the method proposed in this paper on two datasets and comparing it with other methods, our method achieved the highest accuracy among the compared methods.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 7","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06535-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Rotating machinery is an important part of modern industry, and bearings are one of the most important things. However, bearing fault data are difficult to collect, and bearing fault diagnosis under small samples has significant research potential. In this paper, we proposed a fault diagnosis framework that combines diffusion modeling and improved Vision Transformer. First, the short-time Fourier transform is applied to the original one-dimensional vibration signals to convert the data into time-frequency maps. Second, the conditional diffusion model was applied to generate the required samples and expand the dataset. Finally, the Long-patch Vision Transformer (LVT) proposed in this paper is used to classify the mixed samples. LVT designs a long-patch division method for time-frequency maps with dense transverse features. The LVT contains denser features in each patch, and this method is more suitable for time-frequency maps. Validating the method proposed in this paper on two datasets and comparing it with other methods, our method achieved the highest accuracy among the compared methods.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.