轴承故障诊断的集成框架：基于知识蒸馏的卷积神经网络模型压缩

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2024-10-21 DOI:10.1109/JSEN.2024.3481298

Jun Ma;Wei Cai;Yuhao Shan;Yuting Xia;Runtong Zhang

{"title":"轴承故障诊断的集成框架：基于知识蒸馏的卷积神经网络模型压缩","authors":"Jun Ma;Wei Cai;Yuhao Shan;Yuting Xia;Runtong Zhang","doi":"10.1109/JSEN.2024.3481298","DOIUrl":null,"url":null,"abstract":"The industrial application of rolling bearing fault diagnosis necessitates achieving high classification accuracy while minimizing the number of model parameters to reduce the computational resources and storage space required for the model. To meet this requirement, this study proposes a knowledge distillation convolutional neural network-deep forest (KDCNN-DF) hybrid model framework. The proposed method integrates the continuous wavelet transform (CWT) for signal data processing, a convolutional neural network (CNN) optimized by knowledge distillation (KD) for feature extraction, and a simplified multigranular scanning (MGS) process using deep forest (DF) for fault classification. Besides, during the construction of the student models, this study found that the arrangement order of kernel sizes in the CNN convolutional layers significantly impacts the extraction of bearing fault features. Experimental validation confirmed that architecture with a smaller kernel size preceding a larger kernel size in shallow-level models is more effective. This effect is particularly pronounced after the KD process and adoption in hybrid models, resulting in higher classification accuracy. The proposed KD method reduces the parameter count of the CNN model to 5% of the original number while maintaining relatively high accuracy and significantly reducing computing time. In addition, the modeling architecture of DF has been simplified by adopting a streamlined MGS process. The proposed model achieves the highest accuracy on the original Case Western Reserve University (CWRU) datasets, with 99.75% on the 48-kHz dataset, 99.90% on the 12-kHz dataset, and a perfect 100% on the Ottawa dataset. These results surpass the accuracy of existing methods.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 23","pages":"40083-40095"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Integrated Framework for Bearing Fault Diagnosis: Convolutional Neural Network Model Compression Through Knowledge Distillation\",\"authors\":\"Jun Ma;Wei Cai;Yuhao Shan;Yuting Xia;Runtong Zhang\",\"doi\":\"10.1109/JSEN.2024.3481298\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The industrial application of rolling bearing fault diagnosis necessitates achieving high classification accuracy while minimizing the number of model parameters to reduce the computational resources and storage space required for the model. To meet this requirement, this study proposes a knowledge distillation convolutional neural network-deep forest (KDCNN-DF) hybrid model framework. The proposed method integrates the continuous wavelet transform (CWT) for signal data processing, a convolutional neural network (CNN) optimized by knowledge distillation (KD) for feature extraction, and a simplified multigranular scanning (MGS) process using deep forest (DF) for fault classification. Besides, during the construction of the student models, this study found that the arrangement order of kernel sizes in the CNN convolutional layers significantly impacts the extraction of bearing fault features. Experimental validation confirmed that architecture with a smaller kernel size preceding a larger kernel size in shallow-level models is more effective. This effect is particularly pronounced after the KD process and adoption in hybrid models, resulting in higher classification accuracy. The proposed KD method reduces the parameter count of the CNN model to 5% of the original number while maintaining relatively high accuracy and significantly reducing computing time. In addition, the modeling architecture of DF has been simplified by adopting a streamlined MGS process. The proposed model achieves the highest accuracy on the original Case Western Reserve University (CWRU) datasets, with 99.75% on the 48-kHz dataset, 99.90% on the 12-kHz dataset, and a perfect 100% on the Ottawa dataset. These results surpass the accuracy of existing methods.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"24 23\",\"pages\":\"40083-40095\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10726702/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10726702/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

滚动轴承故障诊断的工业应用要求在实现高分类精度的同时，尽量减少模型参数的数量，以减少模型所需的计算资源和存储空间。为了满足这一需求，本研究提出了一种知识蒸馏卷积神经网络-深度森林（KDCNN-DF）混合模型框架。该方法将连续小波变换（CWT）用于信号数据处理、知识蒸馏（KD）优化的卷积神经网络（CNN）用于特征提取以及基于深度森林（DF）的简化多颗粒扫描（MGS）过程用于故障分类相结合。此外，在构建学生模型的过程中，本研究发现CNN卷积层核尺寸的排列顺序对轴承故障特征的提取有显著影响。实验验证表明，在浅层模型中，较小的核尺寸先于较大的核尺寸的体系结构更为有效。这种效应在KD处理和混合模型采用后尤为明显，分类精度更高。本文提出的KD方法将CNN模型的参数数量减少到原始数量的5%，同时保持了较高的准确率，显著减少了计算时间。此外，通过采用流线型的MGS流程，简化了DF的建模体系结构。该模型在原始的凯斯西储大学（CWRU）数据集上达到了最高的精度，在48khz数据集上达到99.75%，在12khz数据集上达到99.90%，在渥太华数据集上达到了完美的100%。这些结果超过了现有方法的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Integrated Framework for Bearing Fault Diagnosis: Convolutional Neural Network Model Compression Through Knowledge Distillation

The industrial application of rolling bearing fault diagnosis necessitates achieving high classification accuracy while minimizing the number of model parameters to reduce the computational resources and storage space required for the model. To meet this requirement, this study proposes a knowledge distillation convolutional neural network-deep forest (KDCNN-DF) hybrid model framework. The proposed method integrates the continuous wavelet transform (CWT) for signal data processing, a convolutional neural network (CNN) optimized by knowledge distillation (KD) for feature extraction, and a simplified multigranular scanning (MGS) process using deep forest (DF) for fault classification. Besides, during the construction of the student models, this study found that the arrangement order of kernel sizes in the CNN convolutional layers significantly impacts the extraction of bearing fault features. Experimental validation confirmed that architecture with a smaller kernel size preceding a larger kernel size in shallow-level models is more effective. This effect is particularly pronounced after the KD process and adoption in hybrid models, resulting in higher classification accuracy. The proposed KD method reduces the parameter count of the CNN model to 5% of the original number while maintaining relatively high accuracy and significantly reducing computing time. In addition, the modeling architecture of DF has been simplified by adopting a streamlined MGS process. The proposed model achieves the highest accuracy on the original Case Western Reserve University (CWRU) datasets, with 99.75% on the 48-kHz dataset, 99.90% on the 12-kHz dataset, and a perfect 100% on the Ottawa dataset. These results surpass the accuracy of existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice