Voxel-wise Medical Images Generalization for Eliminating Distribution Shift

IF 4.8 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Knowledge Discovery from Data Pub Date : 2024-01-25 DOI:10.1145/3643034

Feifei Li, Yuanbin Wang, Oya Beyan, Mirjam Schöneck, Liliana Lourenco Caldeira

{"title":"Voxel-wise Medical Images Generalization for Eliminating Distribution Shift","authors":"Feifei Li, Yuanbin Wang, Oya Beyan, Mirjam Schöneck, Liliana Lourenco Caldeira","doi":"10.1145/3643034","DOIUrl":null,"url":null,"abstract":"<p>Nowadays, more and more machine learning methods are applied in the medical domain. Supervised Learning methods adopted in classification, prediction, and segmentation tasks for medical images always experience decreased performance when the training and testing datasets do not follow the i.i.d(independent and identically distributed) assumption. These distribution shift situations seriously influence machine learning applications’ robustness, fairness, and trustworthiness in the medical domain. Hence, in this paper, we adopt the CycleGAN(Generative Adversarial Networks) method to cycle train the CT(Computer Tomography) data from different scanners/manufacturers, which aims to eliminate the distribution shift from diverse data terminals, on the basis of our previous work[14]. However, due to the model collapse problem and generative mechanisms of the GAN-based model, the images we generated contained serious artifacts. To remove the boundary marks and artifacts, we adopt score-based diffusion generative models to refine the images voxel-wisely. This innovative combination of two generative models enhances the quality of data providers while maintaining significant features. Meanwhile, we use five paired patients’ medical images to deal with the evaluation experiments with SSIM(structural similarity index measure) metrics and the segmentation model’s performance comparison. We conclude that CycleGAN can be utilized as an efficient data augmentation technique rather than a distribution-shift-eliminating method. While the denoising diffusion model is more suitable for dealing with the distribution shift problem aroused by the different terminal modules. In addition, another limitation of generative methods applied in medical images is the difficulty in obtaining large and diverse datasets that accurately capture the complexity of biological structure and variability. In future works, we will evaluate the original and generative datasets by experimenting with a broader range of supervised methods. We will implement the generative methods under the federated learning architecture, which can preserve their benefits and eliminate the distribution shift problem in a broader range.</p>","PeriodicalId":49249,"journal":{"name":"ACM Transactions on Knowledge Discovery from Data","volume":"1 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Knowledge Discovery from Data","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3643034","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Nowadays, more and more machine learning methods are applied in the medical domain. Supervised Learning methods adopted in classification, prediction, and segmentation tasks for medical images always experience decreased performance when the training and testing datasets do not follow the i.i.d(independent and identically distributed) assumption. These distribution shift situations seriously influence machine learning applications’ robustness, fairness, and trustworthiness in the medical domain. Hence, in this paper, we adopt the CycleGAN(Generative Adversarial Networks) method to cycle train the CT(Computer Tomography) data from different scanners/manufacturers, which aims to eliminate the distribution shift from diverse data terminals, on the basis of our previous work[14]. However, due to the model collapse problem and generative mechanisms of the GAN-based model, the images we generated contained serious artifacts. To remove the boundary marks and artifacts, we adopt score-based diffusion generative models to refine the images voxel-wisely. This innovative combination of two generative models enhances the quality of data providers while maintaining significant features. Meanwhile, we use five paired patients’ medical images to deal with the evaluation experiments with SSIM(structural similarity index measure) metrics and the segmentation model’s performance comparison. We conclude that CycleGAN can be utilized as an efficient data augmentation technique rather than a distribution-shift-eliminating method. While the denoising diffusion model is more suitable for dealing with the distribution shift problem aroused by the different terminal modules. In addition, another limitation of generative methods applied in medical images is the difficulty in obtaining large and diverse datasets that accurately capture the complexity of biological structure and variability. In future works, we will evaluate the original and generative datasets by experimenting with a broader range of supervised methods. We will implement the generative methods under the federated learning architecture, which can preserve their benefits and eliminate the distribution shift problem in a broader range.

查看原文本刊更多论文

消除分布偏移的体素医学影像泛化技术

如今，越来越多的机器学习方法被应用于医学领域。当训练数据集和测试数据集不符合 i.i.d（独立且同分布）假设时，用于医学图像分类、预测和分割任务的监督学习方法总是会出现性能下降的情况。这些分布偏移情况严重影响了机器学习应用在医学领域的鲁棒性、公平性和可信度。因此，本文在前期工作[14]的基础上，采用CycleGAN（生成对抗网络）方法对来自不同扫描仪/制造商的CT（计算机断层扫描）数据进行循环训练，旨在消除来自不同数据终端的分布偏移。然而，由于基于 GAN 模型的模型崩溃问题和生成机制，我们生成的图像包含严重的伪影。为了去除边界标记和伪影，我们采用了基于分数的扩散生成模型来对图像进行体素细化。这种将两种生成模型相结合的创新方法既提高了数据提供者的质量，又保留了重要特征。同时，我们使用五张配对的患者医学图像进行了 SSIM（结构相似性指数测量）指标的评估实验和分割模型的性能比较。我们得出结论：CycleGAN 可以作为一种高效的数据增强技术，而不是一种消除分布偏移的方法。而去噪扩散模型更适合处理不同终端模块引起的分布偏移问题。此外，生成方法在医学影像中应用的另一个局限是难以获得能准确捕捉生物结构和变异性复杂性的大型多样化数据集。在未来的工作中，我们将通过试验更广泛的监督方法来评估原始数据集和生成数据集。我们将在联合学习架构下实施生成方法，这样可以在更大范围内保留其优点并消除分布偏移问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Knowledge Discovery from Data COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

6.70

自引率

5.60%

发文量

172

审稿时长

3 months

期刊介绍： TKDD welcomes papers on a full range of research in the knowledge discovery and analysis of diverse forms of data. Such subjects include, but are not limited to: scalable and effective algorithms for data mining and big data analysis, mining brain networks, mining data streams, mining multi-media data, mining high-dimensional data, mining text, Web, and semi-structured data, mining spatial and temporal data, data mining for community generation, social network analysis, and graph structured data, security and privacy issues in data mining, visual, interactive and online data mining, pre-processing and post-processing for data mining, robust and scalable statistical methods, data mining languages, foundations of data mining, KDD framework and process, and novel applications and infrastructures exploiting data mining technology including massively parallel processing and cloud computing platforms. TKDD encourages papers that explore the above subjects in the context of large distributed networks of computers, parallel or multiprocessing computers, or new data devices. TKDD also encourages papers that describe emerging data mining applications that cannot be satisfied by the current data mining technology.