How much data do you need? An analysis of pelvic multi-organ segmentation in a limited data context.

IF 2.4 4区医学 Q3 ENGINEERING, BIOMEDICAL

Physical and Engineering Sciences in Medicine Pub Date : 2025-03-01 Epub Date: 2025-03-11 DOI:10.1007/s13246-024-01514-w

Febrio Lunardo, Laura Baker, Alex Tan, John Baines, Timothy Squire, Jason A Dowling, Mostafa Rahimi Azghadi, Ashley G Gillman

{"title":"How much data do you need? An analysis of pelvic multi-organ segmentation in a limited data context.","authors":"Febrio Lunardo, Laura Baker, Alex Tan, John Baines, Timothy Squire, Jason A Dowling, Mostafa Rahimi Azghadi, Ashley G Gillman","doi":"10.1007/s13246-024-01514-w","DOIUrl":null,"url":null,"abstract":"<p><p>Training deep learning models generally requires large, costly datasets which can limit their application towards in-house segmentation tasks. This study investigates the trade-off in dataset size within the context of pelvic multi-organ MR segmentation where we evaluate the performance of nnU-Net, a well-known segmentation model, under conditions of limited domain and data availability. 12 participants undergoing treatment on an Elekta Unity were recruited, acquiring 58 MR images, with 4 participants (12 images) withheld for testing. Prostate, seminal vesicles (SV), bladder and rectum were contoured in each image by a radiation oncologist. Seven models were trained on progressively smaller subsets of the training dataset, simulating a limited dataset setting. To investigate the efficacy of data augmentation, another set of identical models were trained without augmentation. The performance of the networks was evaluated via the Dice Similarity Coefficient, mean surface distance, and 95% Hausdorff distance metrics. When trained with entire training dataset (46 images), the model achieved a mean Dice coefficient of 0.903 (Prostate), 0.851 (SV), 0.884 (Rectum) and 0.967 (Bladder). Segmentation performance remained stable when the number of training sets was > 12 images from 4 participants, but rapidly dropped in smaller data subsets. Data augmentation was found to be influential across all dataset sizes, but especially in very small datasets. This study demonstrated nnU-Net's proficiency in performing male pelvic multi-organ segmentation under a limited domain, a single scanner, and under limited data constraints. We found that the performance degradation was often modest until a threshold is reached (12 images), below which it dropped significantly. Data augmentation improved performance across all data sizes, but especially for very small datasets. We conclude that nnU-Net's low data requirement can be advantageous for in-house cases with consistent protocol and scarce data availability.</p>","PeriodicalId":48490,"journal":{"name":"Physical and Engineering Sciences in Medicine","volume":" ","pages":"409-419"},"PeriodicalIF":2.4000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11996946/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical and Engineering Sciences in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13246-024-01514-w","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/11 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Training deep learning models generally requires large, costly datasets which can limit their application towards in-house segmentation tasks. This study investigates the trade-off in dataset size within the context of pelvic multi-organ MR segmentation where we evaluate the performance of nnU-Net, a well-known segmentation model, under conditions of limited domain and data availability. 12 participants undergoing treatment on an Elekta Unity were recruited, acquiring 58 MR images, with 4 participants (12 images) withheld for testing. Prostate, seminal vesicles (SV), bladder and rectum were contoured in each image by a radiation oncologist. Seven models were trained on progressively smaller subsets of the training dataset, simulating a limited dataset setting. To investigate the efficacy of data augmentation, another set of identical models were trained without augmentation. The performance of the networks was evaluated via the Dice Similarity Coefficient, mean surface distance, and 95% Hausdorff distance metrics. When trained with entire training dataset (46 images), the model achieved a mean Dice coefficient of 0.903 (Prostate), 0.851 (SV), 0.884 (Rectum) and 0.967 (Bladder). Segmentation performance remained stable when the number of training sets was > 12 images from 4 participants, but rapidly dropped in smaller data subsets. Data augmentation was found to be influential across all dataset sizes, but especially in very small datasets. This study demonstrated nnU-Net's proficiency in performing male pelvic multi-organ segmentation under a limited domain, a single scanner, and under limited data constraints. We found that the performance degradation was often modest until a threshold is reached (12 images), below which it dropped significantly. Data augmentation improved performance across all data sizes, but especially for very small datasets. We conclude that nnU-Net's low data requirement can be advantageous for in-house cases with consistent protocol and scarce data availability.

查看原文本刊更多论文

您需要多少数据？对有限数据背景下骨盆多器官分割的分析。

训练深度学习模型通常需要庞大、昂贵的数据集，这可能会限制它们在内部分割任务中的应用。本研究调查了骨盆多器官MR分割背景下数据集大小的权衡，在此背景下，我们评估了nnU-Net（一个著名的分割模型）在有限域和数据可用性条件下的性能。招募了12名接受Elekta Unity治疗的参与者，获得了58张MR图像，其中4名参与者（12张图像）保留用于测试。前列腺、精囊（SV）、膀胱和直肠均由放射肿瘤学家绘制。七个模型在训练数据集的逐渐较小的子集上进行训练，模拟有限的数据集设置。为了研究数据增强的有效性，我们对另一组相同的模型进行了不增强的训练。网络的性能通过骰子相似系数、平均表面距离和95%豪斯多夫距离指标进行评估。当使用整个训练数据集（46张图像）进行训练时，该模型的平均Dice系数为0.903（前列腺），0.851 (SV), 0.884（直肠）和0.967（膀胱）。当训练集的数量为4个参与者的12张图像时，分割性能保持稳定，但在较小的数据子集中迅速下降。发现数据扩充对所有数据集大小都有影响，但在非常小的数据集中尤其如此。本研究证明了nnU-Net在有限的领域、单个扫描仪和有限的数据约束下熟练地进行男性盆腔多器官分割。我们发现，在达到一个阈值（12个图像）之前，性能的下降通常是适度的，低于这个阈值，性能就会显著下降。数据增强提高了所有数据大小的性能，特别是对于非常小的数据集。我们得出结论，nnU-Net的低数据需求对于具有一致协议和稀缺数据可用性的内部案例是有利的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Physical and Engineering Sciences in Medicine Multiple-

CiteScore

8.40

自引率

4.50%

发文量

110