{"title":"A Novel Adaptive System-Level Fault Self-Diagnosis Algorithm and Its Applications","authors":"Fuxing Liao;Jiafei Liu;Chia-Wei Lee;Sun-Yuan Hsieh;Jingli Wu","doi":"10.1109/TR.2025.3553903","DOIUrl":null,"url":null,"abstract":"With the application and rapid development of high-performance computing and cloud computing technology, the scale of the interconnection network has appeared to grow exponentially. Network attacks have become increasingly sophisticated and stealthy. To reach a high reliable network system, widespread attention has been paid to fault diagnosis. In this article, we put forward a reliable and adaptive self-diagnosis strategy, the <inline-formula><tex-math>$h$</tex-math></inline-formula>-extra <inline-formula><tex-math>$r$</tex-math></inline-formula>-component conditional diagnosability, denoted by <inline-formula><tex-math>$ct_{r}^{h}(G)$</tex-math></inline-formula>. Then, we provide a theoretical derivation to characterize the <inline-formula><tex-math>$h$</tex-math></inline-formula>-extra <inline-formula><tex-math>$r$</tex-math></inline-formula>-component conditional diagnosability of bubble sort networks <inline-formula><tex-math>$B_{n}$</tex-math></inline-formula> under the PMC model. Furthermore, we develop a fast and adaptive fault self-diagnosis algorithm FAFD-PMC to detect all faulty units. Extensive experiments are implemented and applied to synthetic networks and real networks in terms of accuracy (ACCR), true negative rate, false positive rate, recall, and precision, which demonstrates the ACCR/efficiency of our algorithm.","PeriodicalId":56305,"journal":{"name":"IEEE Transactions on Reliability","volume":"74 3","pages":"4294-4305"},"PeriodicalIF":5.7000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Reliability","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10959718/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
With the application and rapid development of high-performance computing and cloud computing technology, the scale of the interconnection network has appeared to grow exponentially. Network attacks have become increasingly sophisticated and stealthy. To reach a high reliable network system, widespread attention has been paid to fault diagnosis. In this article, we put forward a reliable and adaptive self-diagnosis strategy, the $h$-extra $r$-component conditional diagnosability, denoted by $ct_{r}^{h}(G)$. Then, we provide a theoretical derivation to characterize the $h$-extra $r$-component conditional diagnosability of bubble sort networks $B_{n}$ under the PMC model. Furthermore, we develop a fast and adaptive fault self-diagnosis algorithm FAFD-PMC to detect all faulty units. Extensive experiments are implemented and applied to synthetic networks and real networks in terms of accuracy (ACCR), true negative rate, false positive rate, recall, and precision, which demonstrates the ACCR/efficiency of our algorithm.
期刊介绍:
IEEE Transactions on Reliability is a refereed journal for the reliability and allied disciplines including, but not limited to, maintainability, physics of failure, life testing, prognostics, design and manufacture for reliability, reliability for systems of systems, network availability, mission success, warranty, safety, and various measures of effectiveness. Topics eligible for publication range from hardware to software, from materials to systems, from consumer and industrial devices to manufacturing plants, from individual items to networks, from techniques for making things better to ways of predicting and measuring behavior in the field. As an engineering subject that supports new and existing technologies, we constantly expand into new areas of the assurance sciences.