Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes

medRxiv - Radiology and Imaging Pub Date : 2024-08-23 DOI:10.1101/2024.08.23.24312461

Li Zhang, Basu Jindal, Ahmed Alaa, Robert Weinreb, David Wilson, Eran Segal, James Zou, Pengtao Xie

{"title":"Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes","authors":"Li Zhang, Basu Jindal, Ahmed Alaa, Robert Weinreb, David Wilson, Eran Segal, James Zou, Pengtao Xie","doi":"10.1101/2024.08.23.24312461","DOIUrl":null,"url":null,"abstract":"Semantic segmentation of medical images is pivotal in applications like disease diagnosis and treatment planning. While deep learning has excelled in automating this task, a major hurdle is the need for numerous annotated segmentation masks, which are resource-intensive to produce due to the required expertise and time. This scenario often leads to ultra low-data regimes, where annotated images are extremely limited, posing significant challenges for the generalization of conventional deep learning methods on test images. To address this, we introduce a generative deep learning framework, which uniquely generates high-quality paired segmentation masks and medical images, serving as auxiliary data for training robust models in data-scarce environments. Unlike traditional generative models that treat data generation and segmentation model training as separate processes, our method employs multi-level optimization for end-to-end data generation. This approach allows segmentation performance to directly influence the data generation process, ensuring that the generated data is specifically tailored to enhance the performance of the segmentation model. Our method demonstrated strong generalization performance across 9 diverse medical image segmentation tasks and on 16 datasets, in ultra-low data regimes, spanning various diseases, organs, and imaging modalities. When applied to various segmentation models, it achieved performance improvements of 10-20\\% (absolute), in both same-domain and out-of-domain scenarios. Notably, it requires 8 to 20 times less training data than existing methods to achieve comparable results. This advancement significantly improves the feasibility and cost-effectiveness of applying deep learning in medical imaging, particularly in scenarios with limited data availability.","PeriodicalId":501358,"journal":{"name":"medRxiv - Radiology and Imaging","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Radiology and Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.23.24312461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Semantic segmentation of medical images is pivotal in applications like disease diagnosis and treatment planning. While deep learning has excelled in automating this task, a major hurdle is the need for numerous annotated segmentation masks, which are resource-intensive to produce due to the required expertise and time. This scenario often leads to ultra low-data regimes, where annotated images are extremely limited, posing significant challenges for the generalization of conventional deep learning methods on test images. To address this, we introduce a generative deep learning framework, which uniquely generates high-quality paired segmentation masks and medical images, serving as auxiliary data for training robust models in data-scarce environments. Unlike traditional generative models that treat data generation and segmentation model training as separate processes, our method employs multi-level optimization for end-to-end data generation. This approach allows segmentation performance to directly influence the data generation process, ensuring that the generated data is specifically tailored to enhance the performance of the segmentation model. Our method demonstrated strong generalization performance across 9 diverse medical image segmentation tasks and on 16 datasets, in ultra-low data regimes, spanning various diseases, organs, and imaging modalities. When applied to various segmentation models, it achieved performance improvements of 10-20\% (absolute), in both same-domain and out-of-domain scenarios. Notably, it requires 8 to 20 times less training data than existing methods to achieve comparable results. This advancement significantly improves the feasibility and cost-effectiveness of applying deep learning in medical imaging, particularly in scenarios with limited data availability.

查看原文本刊更多论文

生成式人工智能实现超低数据量下的医学图像分割

医学图像的语义分割在疾病诊断和治疗规划等应用中至关重要。虽然深度学习在自动完成这项任务方面表现出色，但一个主要障碍是需要大量带注释的分割掩码，而制作这些掩码需要大量的专业知识和时间，因此是资源密集型的。这种情况往往会导致超低数据状态，在这种状态下，注释图像极其有限，这给传统深度学习方法在测试图像上的泛化带来了巨大挑战。为了解决这个问题，我们引入了一种生成式深度学习框架，它能独特地生成高质量的配对分割掩码和医学图像，作为在数据稀缺环境中训练稳健模型的辅助数据。传统的生成式模型将数据生成和分割模型训练视为两个独立的过程，而我们的方法则不同，它采用多层次优化来实现端到端的数据生成。这种方法允许分割性能直接影响数据生成过程，确保生成的数据专门用于提高分割模型的性能。我们的方法在 9 种不同的医学影像分割任务和 16 个数据集上，在超低数据量条件下，在跨越各种疾病、器官和成像模式的情况下，都表现出了很强的泛化性能。当应用于各种分割模型时，该方法在同域和域外场景中的性能提高了10-20%（绝对值）。值得注意的是，它所需的训练数据比现有方法少 8 到 20 倍，就能获得类似的结果。这一进步大大提高了在医学成像中应用深度学习的可行性和成本效益，尤其是在数据可用性有限的场景中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

medRxiv - Radiology and Imaging

自引率

0.00%

发文量