External validation of a multimodality deep-learning normal tissue complication probability model for mandibular osteoradionecrosis trained on 3D radiation distribution maps and clinical variables

IF 3.4 Q2 ONCOLOGY

Physics and Imaging in Radiation Oncology Pub Date : 2024-10-01 DOI:10.1016/j.phro.2024.100668

Laia Humbert-Vidan , Christian R. Hansen , Vinod Patel , Jørgen Johansen , Andrew P. King , Teresa Guerrero Urbano

{"title":"External validation of a multimodality deep-learning normal tissue complication probability model for mandibular osteoradionecrosis trained on 3D radiation distribution maps and clinical variables","authors":"Laia Humbert-Vidan , Christian R. Hansen , Vinod Patel , Jørgen Johansen , Andrew P. King , Teresa Guerrero Urbano","doi":"10.1016/j.phro.2024.100668","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and purpose</h3><div>While the inclusion of spatial dose information in deep learning (DL)-based normal-tissue complication probability (NTCP) models has been the focus of recent research studies, external validation is still lacking. This study aimed to externally validate a DL-based NTCP model for mandibular osteoradionecrosis (ORN) trained on 3D radiation dose distribution maps and clinical variables.</div></div><div><h3>Methods and materials</h3><div>A 3D DenseNet-40 convolutional neural network (3D-mDN40) was trained on clinical and radiation dose distribution maps on a retrospective class-balanced matched cohort of 184 subjects. A second model (3D-DN40) was trained on dose maps only and both DL models were compared to a logistic regression (LR) model trained on DVH metrics and clinical variables. All models were externally validated by means of their discriminative ability and calibration on an independent dataset of 82 subjects.</div></div><div><h3>Results</h3><div>No significant difference in performance was observed between models. In internal validation, these exhibited similar Brier scores around 0.2, Log Loss values of 0.6–0.7 and ROC AUC values around 0.7 (internal) and 0.6 (external). Differences in clinical variable distributions and their effect sizes were observed between internal and external cohorts, such as smoking status (0.6 vs. 0.1) and chemotherapy (0.1 vs. −0.5), respectively.</div></div><div><h3>Conclusion</h3><div>To our knowledge, this is the first study to externally validate a multimodality DL-based ORN NTCP model. Utilising mandible dose distribution maps, these models show promise for enhancing spatial risk assessment and guiding dental and oncological decision-making, though further research is essential to address overfitting and domain shift for reliable clinical use.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"32 ","pages":"Article 100668"},"PeriodicalIF":3.4000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics and Imaging in Radiation Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405631624001386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background and purpose

While the inclusion of spatial dose information in deep learning (DL)-based normal-tissue complication probability (NTCP) models has been the focus of recent research studies, external validation is still lacking. This study aimed to externally validate a DL-based NTCP model for mandibular osteoradionecrosis (ORN) trained on 3D radiation dose distribution maps and clinical variables.

Methods and materials

A 3D DenseNet-40 convolutional neural network (3D-mDN40) was trained on clinical and radiation dose distribution maps on a retrospective class-balanced matched cohort of 184 subjects. A second model (3D-DN40) was trained on dose maps only and both DL models were compared to a logistic regression (LR) model trained on DVH metrics and clinical variables. All models were externally validated by means of their discriminative ability and calibration on an independent dataset of 82 subjects.

Results

No significant difference in performance was observed between models. In internal validation, these exhibited similar Brier scores around 0.2, Log Loss values of 0.6–0.7 and ROC AUC values around 0.7 (internal) and 0.6 (external). Differences in clinical variable distributions and their effect sizes were observed between internal and external cohorts, such as smoking status (0.6 vs. 0.1) and chemotherapy (0.1 vs. −0.5), respectively.

Conclusion

To our knowledge, this is the first study to externally validate a multimodality DL-based ORN NTCP model. Utilising mandible dose distribution maps, these models show promise for enhancing spatial risk assessment and guiding dental and oncological decision-making, though further research is essential to address overfitting and domain shift for reliable clinical use.

查看原文本刊更多论文

根据三维辐射分布图和临床变量训练的下颌骨骨坏死多模态深度学习正常组织并发症概率模型的外部验证

背景和目的虽然将空间剂量信息纳入基于深度学习（DL）的正常组织并发症概率（NTCP）模型是近期研究的重点，但仍缺乏外部验证。本研究旨在从外部验证基于三维辐射剂量分布图和临床变量训练的下颌骨骨坏死（ORN）深度学习并发症概率（NTCP）模型。方法和材料在184名受试者的回顾性类平衡匹配队列中，根据临床和辐射剂量分布图训练了一个三维DenseNet-40卷积神经网络（3D-mDN40）。第二个模型（3D-DN40）仅在剂量分布图上进行了训练，两个 DL 模型都与在 DVH 指标和临床变量上训练的逻辑回归 (LR) 模型进行了比较。所有模型都在一个由 82 名受试者组成的独立数据集上通过判别能力和校准进行了外部验证。在内部验证中，这些模型表现出相似的 Brier 分数（0.2 左右）、Log Loss 值（0.6-0.7）和 ROC AUC 值（0.7（内部）和 0.6（外部））。内部和外部队列之间的临床变量分布及其效应大小存在差异，如吸烟状态（0.6 vs. 0.1）和化疗（0.1 vs. -0.5）。利用下颌骨剂量分布图，这些模型有望加强空间风险评估，并指导牙科和肿瘤决策，但为了可靠地应用于临床，解决过拟合和域偏移问题还需要进一步的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊