Exploring the Trade-Off between generalist and specialized Models: A center-based comparative analysis for glioblastoma segmentation

IF 3.7 2区医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Medical Informatics Pub Date : 2024-08-15 DOI:10.1016/j.ijmedinf.2024.105604

F. Javier Gil-Terrón , Pablo Ferri , Víctor Montosa-i-Micó , María Gómez Mahiques , Carles Lopez-Mateu , Pau Martí , Juan M. García-Gómez , Elies Fuster-Garcia

{"title":"Exploring the Trade-Off between generalist and specialized Models: A center-based comparative analysis for glioblastoma segmentation","authors":"F. Javier Gil-Terrón , Pablo Ferri , Víctor Montosa-i-Micó , María Gómez Mahiques , Carles Lopez-Mateu , Pau Martí , Juan M. García-Gómez , Elies Fuster-Garcia","doi":"10.1016/j.ijmedinf.2024.105604","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Inherent variations between inter-center data can undermine the robustness of segmentation models when applied at a specific center (dataset shift). We investigated whether specialized center-specific models are more effective compared to generalist models based on multi-center data, and how center-specific data could enhance the performance of generalist models within a particular center using a fine-tuning transfer learning approach. For this purpose, we studied the dataset shift at center level and conducted a comparative analysis to assess the impact of data source on glioblastoma segmentation models.</p></div><div><h3>Methods & Materials</h3><p>The three key components of dataset shift were studied: prior probability shift—variations in tumor size or tissue distribution among centers; covariate shift—inter-center MRI alterations; and concept shift—different criteria for tumor segmentation. BraTS 2021 dataset was used, which includes 1251 cases from 23 centers. Thereafter, 155 deep-learning models were developed and compared, including 1) generalist models trained with multi-center data, 2) specialized models using only center-specific data, and 3) fine-tuned generalist models using center-specific data.</p></div><div><h3>Results</h3><p>The three key components of dataset shift were characterized. The amount of covariate shift was substantial, indicating large variations in MR imaging between different centers. Glioblastoma segmentation models tend to perform best when using data from the application center. Generalist models, trained with over 700 samples, achieved a median Dice score of 88.98%. Specialized models surpassed this with 200 cases, while fine-tuned models outperformed with 50 cases.</p></div><div><h3>Conclusions</h3><p>The influence of dataset shift on model performance is evident. Fine-tuned and specialized models, utilizing data from the evaluated center, outperform generalist models, which rely on data from other centers. These approaches could encourage medical centers to develop customized models for their local use, enhancing the accuracy and reliability of glioblastoma segmentation in a context where dataset shift is inevitable.</p></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1386505624002673/pdfft?md5=35645a5384badcc390d85f2158697875&pid=1-s2.0-S1386505624002673-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624002673","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

Inherent variations between inter-center data can undermine the robustness of segmentation models when applied at a specific center (dataset shift). We investigated whether specialized center-specific models are more effective compared to generalist models based on multi-center data, and how center-specific data could enhance the performance of generalist models within a particular center using a fine-tuning transfer learning approach. For this purpose, we studied the dataset shift at center level and conducted a comparative analysis to assess the impact of data source on glioblastoma segmentation models.

Methods & Materials

The three key components of dataset shift were studied: prior probability shift—variations in tumor size or tissue distribution among centers; covariate shift—inter-center MRI alterations; and concept shift—different criteria for tumor segmentation. BraTS 2021 dataset was used, which includes 1251 cases from 23 centers. Thereafter, 155 deep-learning models were developed and compared, including 1) generalist models trained with multi-center data, 2) specialized models using only center-specific data, and 3) fine-tuned generalist models using center-specific data.

Results

The three key components of dataset shift were characterized. The amount of covariate shift was substantial, indicating large variations in MR imaging between different centers. Glioblastoma segmentation models tend to perform best when using data from the application center. Generalist models, trained with over 700 samples, achieved a median Dice score of 88.98%. Specialized models surpassed this with 200 cases, while fine-tuned models outperformed with 50 cases.

Conclusions

The influence of dataset shift on model performance is evident. Fine-tuned and specialized models, utilizing data from the evaluated center, outperform generalist models, which rely on data from other centers. These approaches could encourage medical centers to develop customized models for their local use, enhancing the accuracy and reliability of glioblastoma segmentation in a context where dataset shift is inevitable.

查看原文本刊更多论文

探索通用模型与专用模型之间的权衡：基于中心的胶质母细胞瘤分割比较分析

简介当应用于特定中心（数据集转移）时，中心间数据的固有差异会削弱分割模型的稳健性。我们研究了特定中心的专业模型是否比基于多中心数据的通用模型更有效，以及特定中心的数据如何利用微调迁移学习方法提高特定中心内通用模型的性能。为此，我们研究了中心层面的数据集转移，并进行了比较分析，以评估数据源对胶质母细胞瘤分割模型的影响：我们研究了数据集移动的三个关键组成部分：先验概率移动--中心间肿瘤大小或组织分布的变化；协变量移动--中心间核磁共振成像的改变；概念移动--肿瘤分割标准的不同。BraTS 2021 数据集包括来自 23 个中心的 1251 个病例。之后，开发了155个深度学习模型并进行了比较，其中包括：1）使用多中心数据训练的通用模型；2）仅使用特定中心数据的专业模型；3）使用特定中心数据的微调通用模型：结果：数据集偏移的三个关键部分都有特征。协变量偏移量很大，表明不同中心之间的磁共振成像差异很大。胶质母细胞瘤分割模型在使用应用中心的数据时往往表现最佳。使用 700 多个样本训练的通用模型的中位 Dice 得分为 88.98%。专业模型在使用 200 个案例时超过了这一水平，而微调模型在使用 50 个案例时表现更好：结论：数据集转移对模型性能的影响显而易见。结论：数据集转移对模型性能的影响显而易见。利用被评估中心数据的微调模型和专业模型优于依靠其他中心数据的通用模型。这些方法可以鼓励医疗中心开发适合本地使用的定制模型，在数据集迁移不可避免的情况下提高胶质母细胞瘤分割的准确性和可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Medical Informatics 医学-计算机：信息系统

CiteScore

8.90

自引率

4.10%

发文量

217

审稿时长

42 days

期刊介绍： International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings. The scope of journal covers: Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.; Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc. Educational computer based programs pertaining to medical informatics or medicine in general; Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.