{"title":"一种轻型视觉变压器框架与流动可视化相结合,用于离心泵早期空化诊断","authors":"Bingyang Shang , Zheming Tong , Hao Liu","doi":"10.1016/j.flowmeasinst.2025.103016","DOIUrl":null,"url":null,"abstract":"<div><div>Long-term incipient cavitation in hydraulic machinery poses significant reliability and safety risks, yet its timely detection remains challenging due to imprecise cavitation criterion, weak signals, and noise interference. To address these issues, this study proposes a lightweight and interpretable diagnostic framework based on Vision Transformer (ViT) for early-stage cavitation identification under complex and noisy conditions. Accurate cavitation labels are obtained through high-speed flow visualization, while the synchronously acquired multi-channel vibration signals are subjected to dimensionality reduction and fusion, and subsequently transformed into time–frequency representations to enhance cavitation-related features. A novel Adaptive Convolution (AC) block is developed to extract global and local information from time-frequency images, and combined with Mobile ViT block to construct the lightweight AM ViT model. The proposed model achieves 100 % accuracy across all eight cavitation states in noiseless conditions, with diagnosis time of 15.4 ms. Compared to the pure Transformer ViT-Base model (86M parameters) and the ResNet-18 model (11.7M parameters), the parameter of the proposed model is 5.8M. Even under signal-to-noise ratio (SNR) of −10 dB, the model improves the accuracy by 19.2 % compared to the baseline model. The model trained under flow rate 25 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> generalizes effectively to 20 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> and 30 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> without retraining. Attention map shows that the model focuses on the 3000–5000 Hz band, which contains key cavitation-related features. Comparative experiments confirms that the proposed model consistently outperforms alternatives under various noise conditions. Sensor contribution analysis further indicates that x- and y-axis sensors near the impeller inlet provide the most informative features, offering guidance for optimal sensor layout. The proposed method establishes a correlation between vibration signals and cavitation visualization, offering potential applications in real-time, noise-resistant diagnostic systems.</div></div>","PeriodicalId":50440,"journal":{"name":"Flow Measurement and Instrumentation","volume":"106 ","pages":"Article 103016"},"PeriodicalIF":2.7000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A lightweight vision transformer framework integrated with flow visualization for incipient cavitation diagnosis in centrifugal pumps\",\"authors\":\"Bingyang Shang , Zheming Tong , Hao Liu\",\"doi\":\"10.1016/j.flowmeasinst.2025.103016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Long-term incipient cavitation in hydraulic machinery poses significant reliability and safety risks, yet its timely detection remains challenging due to imprecise cavitation criterion, weak signals, and noise interference. To address these issues, this study proposes a lightweight and interpretable diagnostic framework based on Vision Transformer (ViT) for early-stage cavitation identification under complex and noisy conditions. Accurate cavitation labels are obtained through high-speed flow visualization, while the synchronously acquired multi-channel vibration signals are subjected to dimensionality reduction and fusion, and subsequently transformed into time–frequency representations to enhance cavitation-related features. A novel Adaptive Convolution (AC) block is developed to extract global and local information from time-frequency images, and combined with Mobile ViT block to construct the lightweight AM ViT model. The proposed model achieves 100 % accuracy across all eight cavitation states in noiseless conditions, with diagnosis time of 15.4 ms. Compared to the pure Transformer ViT-Base model (86M parameters) and the ResNet-18 model (11.7M parameters), the parameter of the proposed model is 5.8M. Even under signal-to-noise ratio (SNR) of −10 dB, the model improves the accuracy by 19.2 % compared to the baseline model. The model trained under flow rate 25 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> generalizes effectively to 20 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> and 30 <span><math><mrow><msup><mi>m</mi><mn>3</mn></msup><mo>/</mo><mi>h</mi></mrow></math></span> without retraining. Attention map shows that the model focuses on the 3000–5000 Hz band, which contains key cavitation-related features. Comparative experiments confirms that the proposed model consistently outperforms alternatives under various noise conditions. Sensor contribution analysis further indicates that x- and y-axis sensors near the impeller inlet provide the most informative features, offering guidance for optimal sensor layout. The proposed method establishes a correlation between vibration signals and cavitation visualization, offering potential applications in real-time, noise-resistant diagnostic systems.</div></div>\",\"PeriodicalId\":50440,\"journal\":{\"name\":\"Flow Measurement and Instrumentation\",\"volume\":\"106 \",\"pages\":\"Article 103016\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Flow Measurement and Instrumentation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0955598625002080\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow Measurement and Instrumentation","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0955598625002080","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
A lightweight vision transformer framework integrated with flow visualization for incipient cavitation diagnosis in centrifugal pumps
Long-term incipient cavitation in hydraulic machinery poses significant reliability and safety risks, yet its timely detection remains challenging due to imprecise cavitation criterion, weak signals, and noise interference. To address these issues, this study proposes a lightweight and interpretable diagnostic framework based on Vision Transformer (ViT) for early-stage cavitation identification under complex and noisy conditions. Accurate cavitation labels are obtained through high-speed flow visualization, while the synchronously acquired multi-channel vibration signals are subjected to dimensionality reduction and fusion, and subsequently transformed into time–frequency representations to enhance cavitation-related features. A novel Adaptive Convolution (AC) block is developed to extract global and local information from time-frequency images, and combined with Mobile ViT block to construct the lightweight AM ViT model. The proposed model achieves 100 % accuracy across all eight cavitation states in noiseless conditions, with diagnosis time of 15.4 ms. Compared to the pure Transformer ViT-Base model (86M parameters) and the ResNet-18 model (11.7M parameters), the parameter of the proposed model is 5.8M. Even under signal-to-noise ratio (SNR) of −10 dB, the model improves the accuracy by 19.2 % compared to the baseline model. The model trained under flow rate 25 generalizes effectively to 20 and 30 without retraining. Attention map shows that the model focuses on the 3000–5000 Hz band, which contains key cavitation-related features. Comparative experiments confirms that the proposed model consistently outperforms alternatives under various noise conditions. Sensor contribution analysis further indicates that x- and y-axis sensors near the impeller inlet provide the most informative features, offering guidance for optimal sensor layout. The proposed method establishes a correlation between vibration signals and cavitation visualization, offering potential applications in real-time, noise-resistant diagnostic systems.
期刊介绍:
Flow Measurement and Instrumentation is dedicated to disseminating the latest research results on all aspects of flow measurement, in both closed conduits and open channels. The design of flow measurement systems involves a wide variety of multidisciplinary activities including modelling the flow sensor, the fluid flow and the sensor/fluid interactions through the use of computation techniques; the development of advanced transducer systems and their associated signal processing and the laboratory and field assessment of the overall system under ideal and disturbed conditions.
FMI is the essential forum for critical information exchange, and contributions are particularly encouraged in the following areas of interest:
Modelling: the application of mathematical and computational modelling to the interaction of fluid dynamics with flowmeters, including flowmeter behaviour, improved flowmeter design and installation problems. Application of CAD/CAE techniques to flowmeter modelling are eligible.
Design and development: the detailed design of the flowmeter head and/or signal processing aspects of novel flowmeters. Emphasis is given to papers identifying new sensor configurations, multisensor flow measurement systems, non-intrusive flow metering techniques and the application of microelectronic techniques in smart or intelligent systems.
Calibration techniques: including descriptions of new or existing calibration facilities and techniques, calibration data from different flowmeter types, and calibration intercomparison data from different laboratories.
Installation effect data: dealing with the effects of non-ideal flow conditions on flowmeters. Papers combining a theoretical understanding of flowmeter behaviour with experimental work are particularly welcome.