Generalizable Magnetic Resonance Imaging-based Nasopharyngeal Carcinoma Delineation: Bridging Gaps Across Multiple Centers and Raters With Active Learning.
{"title":"Generalizable Magnetic Resonance Imaging-based Nasopharyngeal Carcinoma Delineation: Bridging Gaps Across Multiple Centers and Raters With Active Learning.","authors":"Xiangde Luo, Hongqiu Wang, Jinfeng Xu, Lu Li, Yue Zhao, Yuan He, Hui Huang, Jianghong Xiao, Tao Song, Shichuan Zhang, Shaoting Zhang, Guotai Wang, Wenjun Liao","doi":"10.1016/j.ijrobp.2024.11.064","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To develop a deep learning method exploiting active learning and source-free domain adaptation for gross tumor volume delineation in nasopharyngeal carcinoma (NPC), addressing the variability and inaccuracy when deploying segmentation models in multicenter and multirater settings.</p><p><strong>Methods and materials: </strong>One thousand fifty-seven magnetic resonance imaging scans of patients with NPC from 5 hospitals were retrospectively collected and annotated by experts from the same medical group with consensus for multicenter adaptation evaluation. One data set was used for model development (source domain), with the remaining 4 for adaptation testing (target domains). Meanwhile, another set of 170 patients with NPC, with annotations delineated by 4 independent experts, was created for multirater adaptation evaluation. We evaluated the pretrained model's migration ability to the 4 multicenter and 4 multirater target domains. Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95), and other metrics were used for quantitative evaluations.</p><p><strong>Results: </strong>In the adaptation of dataset5 to other data sets, our source-free active learning adaptation method only requires limited labeled target samples (only 20%) to achieve a median DSC ranging from 0.70 to 0.86 and a median HD95 ranging from 3.16 to 7.21 mm for 4 target centers, and 0.78 to 0.85 and 3.64 to 6.00 mm for 4 multirater data sets. For DSC, our results for 3 of 4 multicenter data sets and all multirater data sets showed no statistical difference compared to the fully supervised U-Net model (P values > 0.05) and significantly surpassed comparison models for 3 multicenter data sets and all multirater data sets (P values < 0.05). Clinical assessment showed that our method-generated delineations can be used both in multicenter and multirater scenarios after minor refinement (revision ratio <10% and median time <2 minutes).</p><p><strong>Conclusions: </strong>The proposed method effectively minimizes domain gaps and delivers encouraging performance compared with fully supervised learning models with limited labeled training samples, offering a promising and practical solution for accurate and generalizable gross tumor volume segmentation in NPC.</p>","PeriodicalId":14215,"journal":{"name":"International Journal of Radiation Oncology Biology Physics","volume":" ","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Radiation Oncology Biology Physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ijrobp.2024.11.064","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To develop a deep learning method exploiting active learning and source-free domain adaptation for gross tumor volume delineation in nasopharyngeal carcinoma (NPC), addressing the variability and inaccuracy when deploying segmentation models in multicenter and multirater settings.
Methods and materials: One thousand fifty-seven magnetic resonance imaging scans of patients with NPC from 5 hospitals were retrospectively collected and annotated by experts from the same medical group with consensus for multicenter adaptation evaluation. One data set was used for model development (source domain), with the remaining 4 for adaptation testing (target domains). Meanwhile, another set of 170 patients with NPC, with annotations delineated by 4 independent experts, was created for multirater adaptation evaluation. We evaluated the pretrained model's migration ability to the 4 multicenter and 4 multirater target domains. Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95), and other metrics were used for quantitative evaluations.
Results: In the adaptation of dataset5 to other data sets, our source-free active learning adaptation method only requires limited labeled target samples (only 20%) to achieve a median DSC ranging from 0.70 to 0.86 and a median HD95 ranging from 3.16 to 7.21 mm for 4 target centers, and 0.78 to 0.85 and 3.64 to 6.00 mm for 4 multirater data sets. For DSC, our results for 3 of 4 multicenter data sets and all multirater data sets showed no statistical difference compared to the fully supervised U-Net model (P values > 0.05) and significantly surpassed comparison models for 3 multicenter data sets and all multirater data sets (P values < 0.05). Clinical assessment showed that our method-generated delineations can be used both in multicenter and multirater scenarios after minor refinement (revision ratio <10% and median time <2 minutes).
Conclusions: The proposed method effectively minimizes domain gaps and delivers encouraging performance compared with fully supervised learning models with limited labeled training samples, offering a promising and practical solution for accurate and generalizable gross tumor volume segmentation in NPC.
期刊介绍:
International Journal of Radiation Oncology • Biology • Physics (IJROBP), known in the field as the Red Journal, publishes original laboratory and clinical investigations related to radiation oncology, radiation biology, medical physics, and both education and health policy as it relates to the field.
This journal has a particular interest in original contributions of the following types: prospective clinical trials, outcomes research, and large database interrogation. In addition, it seeks reports of high-impact innovations in single or combined modality treatment, tumor sensitization, normal tissue protection (including both precision avoidance and pharmacologic means), brachytherapy, particle irradiation, and cancer imaging. Technical advances related to dosimetry and conformal radiation treatment planning are of interest, as are basic science studies investigating tumor physiology and the molecular biology underlying cancer and normal tissue radiation response.