Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan
{"title":"Label-free evaluation for performance of fault diagnosis model on unknown distribution dataset","authors":"Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan","doi":"10.1016/j.aei.2024.102912","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102912"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624005639","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.