Novel imbalanced multi-class fault diagnosis method using transfer learning and oversampling strategies-based multi-layer support vector machines (ML-SVMs)
IF 7.2 1区 计算机科学Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
{"title":"Novel imbalanced multi-class fault diagnosis method using transfer learning and oversampling strategies-based multi-layer support vector machines (ML-SVMs)","authors":"Jianan Wei , Hualin Chen , Yage Yuan , Haisong Huang , Long Wen , Weidong Jiao","doi":"10.1016/j.asoc.2024.112324","DOIUrl":null,"url":null,"abstract":"<div><div>For health monitoring and fault diagnosis of critical mechanical system components, historical data related to equipment failures are often limited and exhibit varying imbalanced multi-class characteristics (e.g., with noisy and time-series data). Moreover, fault diagnosis frameworks based on traditional resampling algorithms (e.g., SMOTE) mostly heavily rely on manual feature extraction, making them difficult to adapt to diverse working conditions or objects. To address these challenges, we propose a novel end-to-end imbalanced multi-class fault diagnosis architecture using transfer learning and oversampling strategies-based multi-layer support vector machines (ML-SVMs). ML-SVMs utilize a VGG-based deep migration feature extraction method to extract features from original time-domain vibration signals, employing natural source domain weights to reduce dependence on human experience and sample size. Then, ML-SVMs introduce ISCOTE (i.e., the first and second layers of ML-SVMs), an improved version of the sample-characteristic over-sampling technique (SCOTE). ISCOTE generates more effective and reasonable samples for each fault class through a scaling factor and iterative optimization mechanism, whether in noisy feature spaces with fuzzy boundaries or in clear boundary feature spaces. Finally, in the third layer of ML-SVMs, multi-class SVMs (e.g., LS-SVMs and standard SVMs) are utilized to train balanced feature samples and derive classification models with strong generalization ability. The effectiveness of ML-SVMs is demonstrated through 16 fault diagnosis instances using CWRU and IMS bearing data, PHM 2010 and TTWD tool wear data. Results indicate that ML-SVMs outperform 8 well-known oversampling-based algorithms in fault diagnosis recognition rates and algorithm robustness. It has offered a feasible architecture for multi-class imbalanced fault scenarios with limited data and multiple adverse features.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494624010986","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
For health monitoring and fault diagnosis of critical mechanical system components, historical data related to equipment failures are often limited and exhibit varying imbalanced multi-class characteristics (e.g., with noisy and time-series data). Moreover, fault diagnosis frameworks based on traditional resampling algorithms (e.g., SMOTE) mostly heavily rely on manual feature extraction, making them difficult to adapt to diverse working conditions or objects. To address these challenges, we propose a novel end-to-end imbalanced multi-class fault diagnosis architecture using transfer learning and oversampling strategies-based multi-layer support vector machines (ML-SVMs). ML-SVMs utilize a VGG-based deep migration feature extraction method to extract features from original time-domain vibration signals, employing natural source domain weights to reduce dependence on human experience and sample size. Then, ML-SVMs introduce ISCOTE (i.e., the first and second layers of ML-SVMs), an improved version of the sample-characteristic over-sampling technique (SCOTE). ISCOTE generates more effective and reasonable samples for each fault class through a scaling factor and iterative optimization mechanism, whether in noisy feature spaces with fuzzy boundaries or in clear boundary feature spaces. Finally, in the third layer of ML-SVMs, multi-class SVMs (e.g., LS-SVMs and standard SVMs) are utilized to train balanced feature samples and derive classification models with strong generalization ability. The effectiveness of ML-SVMs is demonstrated through 16 fault diagnosis instances using CWRU and IMS bearing data, PHM 2010 and TTWD tool wear data. Results indicate that ML-SVMs outperform 8 well-known oversampling-based algorithms in fault diagnosis recognition rates and algorithm robustness. It has offered a feasible architecture for multi-class imbalanced fault scenarios with limited data and multiple adverse features.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.