在未知分布数据集上对故障诊断模型的性能进行无标签评估

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan
{"title":"在未知分布数据集上对故障诊断模型的性能进行无标签评估","authors":"Zhenyu Liu ,&nbsp;Haowen Zheng ,&nbsp;Hui Liu ,&nbsp;Weiqiang Jia ,&nbsp;Jianrong Tan","doi":"10.1016/j.aei.2024.102912","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102912"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Label-free evaluation for performance of fault diagnosis model on unknown distribution dataset\",\"authors\":\"Zhenyu Liu ,&nbsp;Haowen Zheng ,&nbsp;Hui Liu ,&nbsp;Weiqiang Jia ,&nbsp;Jianrong Tan\",\"doi\":\"10.1016/j.aei.2024.102912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102912\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624005639\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624005639","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

实时数据可能会因运行条件变化和其他因素而发生分布漂移,从而影响在线故障诊断模型的分类准确性,并可能导致严重后果。因此,了解模型在实时数据上的分类准确性具有重要意义。然而,实时数据中没有标签,这给评估分类准确性带来了挑战。此外,故障诊断的实时性要求采用快速、直接的评估方法。基于上述原因,本文提出了一种在实时数据中评估模型分类准确性的方法,这种方法是在实时数据没有标签的情况下完成的。本文提出的无标签评估方法将模型的输出转化为一个标量,用来衡量分类概率之间的匹配程度,即平均自由能。然后,利用辅助数据集建立平均自由能与分类准确率之间的映射关系。最后,通过该映射和实时数据的平均自由能,预测模型在实时数据上的分类准确率。所提出的方法在公共数据集上进行了实验评估,证明了其在各方面的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Label-free evaluation for performance of fault diagnosis model on unknown distribution dataset
Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信