Digital twin-driven imbalanced fault diagnosis method based on new distribution discrepancy metric and large margin aware focal loss

IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Xueyi Li , Daiyou Li , Tianyang Wang , Peng Yuan , Tianyu Yu
{"title":"Digital twin-driven imbalanced fault diagnosis method based on new distribution discrepancy metric and large margin aware focal loss","authors":"Xueyi Li ,&nbsp;Daiyou Li ,&nbsp;Tianyang Wang ,&nbsp;Peng Yuan ,&nbsp;Tianyu Yu","doi":"10.1016/j.simpat.2025.103160","DOIUrl":null,"url":null,"abstract":"<div><div>In actual industrial production, the frequency of gear failures is much lower than that in normal conditions. The scarcity of fault samples leads to a severe data imbalance problem, which significantly limits the performance of deep learning-based fault diagnosis methods. To address this issue, this paper proposes a digital twin-driven imbalanced fault diagnosis method based on a New Distribution Discrepancy Metric (NDDM) and Large Margin aware Focal (LMF) loss. First, a fault virtual data generation strategy based on digital twin technology is proposed. By analyzing the nonlinear dynamic characteristics of the gearbox, an effective virtual model of the gearbox is established, generating a large amount of high-quality virtual fault data to mitigate the data imbalance problem. Then, the NDDM is employed to simultaneously align the marginal distribution and subdomain conditional distribution by reducing the distribution discrepancy between the virtual and actual domains. Finally, the LMF is adopted to further enhance the model's fault diagnosis performance by focusing on hard samples and preserving more inclusive decision boundaries for fault categories. Experimental validation on two datasets demonstrates that the proposed method significantly outperforms other approaches in handling imbalanced data, providing a novel solution for effective gear fault diagnosis under data imbalance.</div></div>","PeriodicalId":49518,"journal":{"name":"Simulation Modelling Practice and Theory","volume":"143 ","pages":"Article 103160"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Simulation Modelling Practice and Theory","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569190X25000954","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

In actual industrial production, the frequency of gear failures is much lower than that in normal conditions. The scarcity of fault samples leads to a severe data imbalance problem, which significantly limits the performance of deep learning-based fault diagnosis methods. To address this issue, this paper proposes a digital twin-driven imbalanced fault diagnosis method based on a New Distribution Discrepancy Metric (NDDM) and Large Margin aware Focal (LMF) loss. First, a fault virtual data generation strategy based on digital twin technology is proposed. By analyzing the nonlinear dynamic characteristics of the gearbox, an effective virtual model of the gearbox is established, generating a large amount of high-quality virtual fault data to mitigate the data imbalance problem. Then, the NDDM is employed to simultaneously align the marginal distribution and subdomain conditional distribution by reducing the distribution discrepancy between the virtual and actual domains. Finally, the LMF is adopted to further enhance the model's fault diagnosis performance by focusing on hard samples and preserving more inclusive decision boundaries for fault categories. Experimental validation on two datasets demonstrates that the proposed method significantly outperforms other approaches in handling imbalanced data, providing a novel solution for effective gear fault diagnosis under data imbalance.
基于新分布差异度量和大裕度感知焦损的数字双驱动不平衡故障诊断方法
在实际工业生产中,齿轮故障的发生频率远低于正常情况下。故障样本的稀缺性导致了严重的数据不平衡问题,极大地限制了基于深度学习的故障诊断方法的性能。针对这一问题,提出了一种基于新分布差异度量(NDDM)和大裕度感知焦点(LMF)损失的双驱动不平衡故障诊断方法。首先,提出了基于数字孪生技术的故障虚拟数据生成策略。通过分析齿轮箱的非线性动态特性,建立了有效的齿轮箱虚拟模型,生成了大量高质量的虚拟故障数据,缓解了数据不平衡问题。然后,通过减小虚拟域与实际域之间的分布差异,利用NDDM对边缘分布和子域条件分布进行同步对齐;最后,通过关注硬样本并为故障类别保留更具包容性的决策边界,采用LMF进一步提高模型的故障诊断性能。在两个数据集上的实验验证表明,该方法在处理不平衡数据方面明显优于其他方法,为数据不平衡下齿轮故障的有效诊断提供了一种新的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Simulation Modelling Practice and Theory
Simulation Modelling Practice and Theory 工程技术-计算机:跨学科应用
CiteScore
9.80
自引率
4.80%
发文量
142
审稿时长
21 days
期刊介绍: The journal Simulation Modelling Practice and Theory provides a forum for original, high-quality papers dealing with any aspect of systems simulation and modelling. The journal aims at being a reference and a powerful tool to all those professionally active and/or interested in the methods and applications of simulation. Submitted papers will be peer reviewed and must significantly contribute to modelling and simulation in general or use modelling and simulation in application areas. Paper submission is solicited on: • theoretical aspects of modelling and simulation including formal modelling, model-checking, random number generators, sensitivity analysis, variance reduction techniques, experimental design, meta-modelling, methods and algorithms for validation and verification, selection and comparison procedures etc.; • methodology and application of modelling and simulation in any area, including computer systems, networks, real-time and embedded systems, mobile and intelligent agents, manufacturing and transportation systems, management, engineering, biomedical engineering, economics, ecology and environment, education, transaction handling, etc.; • simulation languages and environments including those, specific to distributed computing, grid computing, high performance computers or computer networks, etc.; • distributed and real-time simulation, simulation interoperability; • tools for high performance computing simulation, including dedicated architectures and parallel computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信