一种同时考虑属性噪声和标签噪声的高维序列数据噪声样本检测方法

IF 6.7 1区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Liang Chang , Lu-Xin Guan , Enrico Zio , Yan-Hui Lin
{"title":"一种同时考虑属性噪声和标签噪声的高维序列数据噪声样本检测方法","authors":"Liang Chang ,&nbsp;Lu-Xin Guan ,&nbsp;Enrico Zio ,&nbsp;Yan-Hui Lin","doi":"10.1016/j.cie.2025.111064","DOIUrl":null,"url":null,"abstract":"<div><div>The high-dimensional sequential data available across various industrial scenarios may be contaminated with both attribute and label noise, hindering the establishment of accurate deep learning-based prediction models. The existing noise detection methods can only detect one type of noise. Conversely, in this article, a novel noisy samples detection method is proposed to detect both types of noise simultaneously through generative learning. An enhanced variational recurrent prediction model (EVRPM) is proposed to model the log-likelihood of samples, which incorporates a label predictor and an auxiliary task into the variational recurrent neural network. Moreover, an iterative detection process is adopted to refine EVRPM training and enhance noisy sample detection, which is particularly beneficial for low-quality datasets. A prediction model with higher prediction accuracy can be obtained using the refined dataset. The effectiveness and superiority of the proposed method are verified using both public and real experimental datasets.</div></div>","PeriodicalId":55220,"journal":{"name":"Computers & Industrial Engineering","volume":"203 ","pages":"Article 111064"},"PeriodicalIF":6.7000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel method of detection of noisy samples in high-dimensional sequential data considering both attribute and label noise\",\"authors\":\"Liang Chang ,&nbsp;Lu-Xin Guan ,&nbsp;Enrico Zio ,&nbsp;Yan-Hui Lin\",\"doi\":\"10.1016/j.cie.2025.111064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The high-dimensional sequential data available across various industrial scenarios may be contaminated with both attribute and label noise, hindering the establishment of accurate deep learning-based prediction models. The existing noise detection methods can only detect one type of noise. Conversely, in this article, a novel noisy samples detection method is proposed to detect both types of noise simultaneously through generative learning. An enhanced variational recurrent prediction model (EVRPM) is proposed to model the log-likelihood of samples, which incorporates a label predictor and an auxiliary task into the variational recurrent neural network. Moreover, an iterative detection process is adopted to refine EVRPM training and enhance noisy sample detection, which is particularly beneficial for low-quality datasets. A prediction model with higher prediction accuracy can be obtained using the refined dataset. The effectiveness and superiority of the proposed method are verified using both public and real experimental datasets.</div></div>\",\"PeriodicalId\":55220,\"journal\":{\"name\":\"Computers & Industrial Engineering\",\"volume\":\"203 \",\"pages\":\"Article 111064\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Industrial Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360835225002104\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Industrial Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360835225002104","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

在各种工业场景中可用的高维序列数据可能受到属性和标签噪声的污染,阻碍了基于深度学习的准确预测模型的建立。现有的噪声检测方法只能检测一种类型的噪声。相反,本文提出了一种新的噪声样本检测方法,通过生成学习同时检测两种类型的噪声。提出了一种增强的变分递归预测模型(EVRPM),该模型将标签预测器和辅助任务结合到变分递归神经网络中,对样本的对数似然进行建模。此外,采用迭代检测过程来细化EVRPM训练,增强噪声样本检测能力,尤其有利于低质量数据集的检测。利用改进后的数据集可以得到具有较高预测精度的预测模型。通过公开和真实的实验数据验证了该方法的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A novel method of detection of noisy samples in high-dimensional sequential data considering both attribute and label noise
The high-dimensional sequential data available across various industrial scenarios may be contaminated with both attribute and label noise, hindering the establishment of accurate deep learning-based prediction models. The existing noise detection methods can only detect one type of noise. Conversely, in this article, a novel noisy samples detection method is proposed to detect both types of noise simultaneously through generative learning. An enhanced variational recurrent prediction model (EVRPM) is proposed to model the log-likelihood of samples, which incorporates a label predictor and an auxiliary task into the variational recurrent neural network. Moreover, an iterative detection process is adopted to refine EVRPM training and enhance noisy sample detection, which is particularly beneficial for low-quality datasets. A prediction model with higher prediction accuracy can be obtained using the refined dataset. The effectiveness and superiority of the proposed method are verified using both public and real experimental datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers & Industrial Engineering
Computers & Industrial Engineering 工程技术-工程:工业
CiteScore
12.70
自引率
12.70%
发文量
794
审稿时长
10.6 months
期刊介绍: Computers & Industrial Engineering (CAIE) is dedicated to researchers, educators, and practitioners in industrial engineering and related fields. Pioneering the integration of computers in research, education, and practice, industrial engineering has evolved to make computers and electronic communication integral to its domain. CAIE publishes original contributions focusing on the development of novel computerized methodologies to address industrial engineering problems. It also highlights the applications of these methodologies to issues within the broader industrial engineering and associated communities. The journal actively encourages submissions that push the boundaries of fundamental theories and concepts in industrial engineering techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信