Semi-supervised Machine Learning with MixMatch and Equivalence Classes.

Lecture notes-monograph series Pub Date : 2020-01-01 Epub Date: 2020-10-02
Colin B Hansen, Vishwesh Nath, Riqiang Gao, Camilo Bermudez, Yuankai Huo, Kim L Sandler, Pierre P Massion, Jeffrey D Blume, Thomas A Lasko, Bennett A Landman
{"title":"Semi-supervised Machine Learning with MixMatch and Equivalence Classes.","authors":"Colin B Hansen,&nbsp;Vishwesh Nath,&nbsp;Riqiang Gao,&nbsp;Camilo Bermudez,&nbsp;Yuankai Huo,&nbsp;Kim L Sandler,&nbsp;Pierre P Massion,&nbsp;Jeffrey D Blume,&nbsp;Thomas A Lasko,&nbsp;Bennett A Landman","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.</p>","PeriodicalId":93329,"journal":{"name":"Lecture notes-monograph series","volume":"12446 ","pages":"112-121"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8388309/pdf/nihms-1647070.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lecture notes-monograph series","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/10/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.

半监督机器学习与MixMatch和等价类。
半监督方法对计算机视觉任务的影响越来越大,以利用大型数据集上的稀缺标签,但这些方法尚未很好地转化为医学成像。特别有趣的是,MixMatch方法比流行的CIFAR-10数据集中具有稀缺标签的半监督学习方法实现了显着的性能改进。在一种补充方法中,对等价类的Nullspace调优提供了在主题的基本真相未知时利用多个主题扫描的潜力。这项工作是第一个(1)探索混合匹配与零空间调谐在医学成像的背景下,(2)表征与递减标签的方法的影响。我们考虑两个不同的医学成像领域:皮肤病变诊断和肺癌预测。在这两种情况下,我们使用监督、MixMatch和Nullspace调优方法以及混合匹配和Nullspace调优一起评估用递减标记数据训练的模型。MixMatch和Nullspace Tuning结合在一起,在国家肺筛查试验中,只有200个标记的受试者,肺癌诊断的AUC为0.755;在HAM10000上,只有779个标记的样本,平衡的多类准确率为77%。这种性能类似于所有标签可用时的完全监督方法。在医学成像中推进数据驱动方法的过程中,重要的是要考虑使用来自更大的机器学习社区的当前最先进的半监督学习方法,以及它们对数据采集和注释局限性的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信