Xueyi Li , Daiyou Li , Tianyang Wang , Peng Yuan , Tianyu Yu
{"title":"Digital twin-driven imbalanced fault diagnosis method based on new distribution discrepancy metric and large margin aware focal loss","authors":"Xueyi Li , Daiyou Li , Tianyang Wang , Peng Yuan , Tianyu Yu","doi":"10.1016/j.simpat.2025.103160","DOIUrl":null,"url":null,"abstract":"<div><div>In actual industrial production, the frequency of gear failures is much lower than that in normal conditions. The scarcity of fault samples leads to a severe data imbalance problem, which significantly limits the performance of deep learning-based fault diagnosis methods. To address this issue, this paper proposes a digital twin-driven imbalanced fault diagnosis method based on a New Distribution Discrepancy Metric (NDDM) and Large Margin aware Focal (LMF) loss. First, a fault virtual data generation strategy based on digital twin technology is proposed. By analyzing the nonlinear dynamic characteristics of the gearbox, an effective virtual model of the gearbox is established, generating a large amount of high-quality virtual fault data to mitigate the data imbalance problem. Then, the NDDM is employed to simultaneously align the marginal distribution and subdomain conditional distribution by reducing the distribution discrepancy between the virtual and actual domains. Finally, the LMF is adopted to further enhance the model's fault diagnosis performance by focusing on hard samples and preserving more inclusive decision boundaries for fault categories. Experimental validation on two datasets demonstrates that the proposed method significantly outperforms other approaches in handling imbalanced data, providing a novel solution for effective gear fault diagnosis under data imbalance.</div></div>","PeriodicalId":49518,"journal":{"name":"Simulation Modelling Practice and Theory","volume":"143 ","pages":"Article 103160"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Simulation Modelling Practice and Theory","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569190X25000954","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In actual industrial production, the frequency of gear failures is much lower than that in normal conditions. The scarcity of fault samples leads to a severe data imbalance problem, which significantly limits the performance of deep learning-based fault diagnosis methods. To address this issue, this paper proposes a digital twin-driven imbalanced fault diagnosis method based on a New Distribution Discrepancy Metric (NDDM) and Large Margin aware Focal (LMF) loss. First, a fault virtual data generation strategy based on digital twin technology is proposed. By analyzing the nonlinear dynamic characteristics of the gearbox, an effective virtual model of the gearbox is established, generating a large amount of high-quality virtual fault data to mitigate the data imbalance problem. Then, the NDDM is employed to simultaneously align the marginal distribution and subdomain conditional distribution by reducing the distribution discrepancy between the virtual and actual domains. Finally, the LMF is adopted to further enhance the model's fault diagnosis performance by focusing on hard samples and preserving more inclusive decision boundaries for fault categories. Experimental validation on two datasets demonstrates that the proposed method significantly outperforms other approaches in handling imbalanced data, providing a novel solution for effective gear fault diagnosis under data imbalance.
期刊介绍:
The journal Simulation Modelling Practice and Theory provides a forum for original, high-quality papers dealing with any aspect of systems simulation and modelling.
The journal aims at being a reference and a powerful tool to all those professionally active and/or interested in the methods and applications of simulation. Submitted papers will be peer reviewed and must significantly contribute to modelling and simulation in general or use modelling and simulation in application areas.
Paper submission is solicited on:
• theoretical aspects of modelling and simulation including formal modelling, model-checking, random number generators, sensitivity analysis, variance reduction techniques, experimental design, meta-modelling, methods and algorithms for validation and verification, selection and comparison procedures etc.;
• methodology and application of modelling and simulation in any area, including computer systems, networks, real-time and embedded systems, mobile and intelligent agents, manufacturing and transportation systems, management, engineering, biomedical engineering, economics, ecology and environment, education, transaction handling, etc.;
• simulation languages and environments including those, specific to distributed computing, grid computing, high performance computers or computer networks, etc.;
• distributed and real-time simulation, simulation interoperability;
• tools for high performance computing simulation, including dedicated architectures and parallel computing.