Hybrid Label Noise Correction Algorithm For Medical Auxiliary Diagnosis

Jiwei Xu, Yun Yang, Po Yang
{"title":"Hybrid Label Noise Correction Algorithm For Medical Auxiliary Diagnosis","authors":"Jiwei Xu, Yun Yang, Po Yang","doi":"10.1109/INDIN45582.2020.9442246","DOIUrl":null,"url":null,"abstract":"In the context of the continuous development of Internet of Things (IoT) technology and Machine learning (ML) technology, its application in the medical field is becoming more and more extensive. However, with a dramatic increase in medical data obtained from the IoT-based medical auxiliary diagnosis system, the impact of label noise problems is also increasing. When training a machine learning algorithm for a supervised-learning task in some clinical applications, uncertainty in the labels of some patients may adversely affect the performance of the algorithm. For example, due to ambiguous patient conditions or poor reliability of diagnostic criteria, even clinical experts may lack confidence in making medical diagnoses for some patients. As a result, some samples used in algorithm training may be mislabeled, which adversely affects the performance of the algorithm. In this paper, we study a classification problem of sample labels with random damage. We propose a new hybrid label noise correction model that generalizes many learning problems, including supervised, unsupervised and semi-supervised learning. This hybrid model can withstand the negative effects of random noise and various non-random label noise. Extensive experimental results using real-world datasets from UCI machine learning repository are provided, the empirical study shows that our approach successfully improves data quality in many cases, in terms of classification accuracy, over existing label noise correction methods.","PeriodicalId":185948,"journal":{"name":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN45582.2020.9442246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In the context of the continuous development of Internet of Things (IoT) technology and Machine learning (ML) technology, its application in the medical field is becoming more and more extensive. However, with a dramatic increase in medical data obtained from the IoT-based medical auxiliary diagnosis system, the impact of label noise problems is also increasing. When training a machine learning algorithm for a supervised-learning task in some clinical applications, uncertainty in the labels of some patients may adversely affect the performance of the algorithm. For example, due to ambiguous patient conditions or poor reliability of diagnostic criteria, even clinical experts may lack confidence in making medical diagnoses for some patients. As a result, some samples used in algorithm training may be mislabeled, which adversely affects the performance of the algorithm. In this paper, we study a classification problem of sample labels with random damage. We propose a new hybrid label noise correction model that generalizes many learning problems, including supervised, unsupervised and semi-supervised learning. This hybrid model can withstand the negative effects of random noise and various non-random label noise. Extensive experimental results using real-world datasets from UCI machine learning repository are provided, the empirical study shows that our approach successfully improves data quality in many cases, in terms of classification accuracy, over existing label noise correction methods.
医疗辅助诊断的混合标签噪声校正算法
在物联网(IoT)技术和机器学习(ML)技术不断发展的背景下,其在医疗领域的应用也越来越广泛。然而,随着基于物联网的医疗辅助诊断系统获得的医疗数据的急剧增加,标签噪声问题的影响也越来越大。在一些临床应用中,当训练机器学习算法进行监督学习任务时,一些患者标签的不确定性可能会对算法的性能产生不利影响。例如,由于患者病情不明确或诊断标准可靠性差,即使是临床专家也可能对某些患者的医学诊断缺乏信心。因此,在算法训练中使用的一些样本可能会被错误标记,从而对算法的性能产生不利影响。本文研究了具有随机损伤的样本标签的分类问题。我们提出了一种新的混合标签噪声校正模型,该模型推广了许多学习问题,包括监督学习、无监督学习和半监督学习。该混合模型能够承受随机噪声和各种非随机标签噪声的负面影响。使用UCI机器学习存储库的真实数据集提供了大量的实验结果,实证研究表明,我们的方法在许多情况下成功地提高了数据质量,就分类精度而言,超过了现有的标签噪声校正方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信