Prognostic models for radiation-induced complications after radiotherapy in head and neck cancer patients.

IF 8.8 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Toshihiko Takada, Makbule Tambas, Enrico Clementel, Artuur Leeuwenberg, Marjan Sharabiani, Johanna Aag Damen, Zoë S Dunias, Jan F Nauta, Demy L Idema, Jungyeon Choi, Lotta M Meijerink, Johannes A Langendijk, Karel Gm Moons, Ewoud Schuit
{"title":"Prognostic models for radiation-induced complications after radiotherapy in head and neck cancer patients.","authors":"Toshihiko Takada, Makbule Tambas, Enrico Clementel, Artuur Leeuwenberg, Marjan Sharabiani, Johanna Aag Damen, Zoë S Dunias, Jan F Nauta, Demy L Idema, Jungyeon Choi, Lotta M Meijerink, Johannes A Langendijk, Karel Gm Moons, Ewoud Schuit","doi":"10.1002/14651858.CD014745.pub2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Radiotherapy is the mainstay of treatment for head and neck cancer (HNC) but may induce various side effects on surrounding normal tissues. To reach an optimal balance between tumour control and toxicity prevention, normal tissue complication probability (NTCP) models have been reported to predict the risk of radiation-induced side effects in patients with HNC. However, the quality of study design, conduct, and analysis (i.e. risk of bias (ROB)), as well as the predictive performance of these models, remains to be evaluated.</p><p><strong>Objectives: </strong>To identify, describe and appraise NTCP models to predict the risk of radiation-induced side effects in patients with HNC.</p><p><strong>Search methods: </strong>We searched Ovid MEDLINE, Embase and the World Health Organization International Clinical Trials Registry Platform from conception to January 2024. In addition, we screened references cited in the retrieved articles.</p><p><strong>Selection criteria: </strong>Two review authors independently included articles reporting on the development and external validation of NTCP models to predict any type of radiation-induced side effects in patients with HNC.</p><p><strong>Data collection and analysis: </strong>One reviewer extracted data from each article based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies and assessed their applicability and ROB using the Prediction model ROB Assessment Tool, while another reviewer carefully verified the results. For models externally validated at least twice for the same outcome as their original developmental study, we performed qualitative analyses of the model performance. The GRADE system was not applied, since it has not been established for reviews of prognostic model studies.</p><p><strong>Main results: </strong>Amongst 592 models developed from 143 articles, including 140,767 HNC patients, only 49 (8%) models from six articles were judged to have low ROB and low concerns for applicability. No external validation was performed for 480 models (81%). For the remaining 112 models and six additional models which were not eligible for the present review, 152 external validations were performed in 34,304 patients with HNC in 41 articles. The results of models externally validated at least twice are discussed below. Models for xerostomia Amongst 275 models for xerostomia, two models were externally validated at least twice. The Beetz 2012b model for xerostomia six months after radiotherapy was validated in two studies. C-statistics ranged from 0.70 to 0.74. Calibration performance was reported in one study. One validation study was rated as having low ROB in all domains, while the other was rated as having high ROB in the analysis domain. The Cavallo 2021 model for acute xerostomia during radiotherapy for patients with nasopharyngeal cancer was externally validated in the same study, using two different types of cohorts. C-statistics ranged from 0.68 to 0.73 and calibration plots were reported in both cohorts. Both validations were rated as having unclear ROB in the participants' domain because no detailed information about recruiting was provided. Models for dysphagia Amongst 86 models for dysphagia, two models were externally validated at least twice. The Christianen 2012 model for dysphagia six months after radiotherapy was validated in five studies. C-statistics ranged from 0.66 to 0.75. Calibration performance was assessed in all of them, while four of them were rated as having high ROB in the analysis domain due to the small sample size. The Wopken 2014b model for tube feeding dependence six months after radiotherapy was validated in three external validation studies. C-statistics ranged from 0.79 to 0.95, while calibration was evaluated in all studies. Due to the small size of the validation datasets, they were judged as having high ROB in the analysis domain. Models for hypothyroidism Of 66 models for hypothyroidism, two models were externally validated at least twice. In addition, there was another model which was not originally developed for patients with HNC, but validated in this domain. The Boomsma 2012 for hypothyroidism within two years after radiotherapy was externally validated in two studies. C-statistics ranged from 0.64 to 0.74, while only one study reported its calibration performance. Both validation studies were rated as having high ROB in the analysis domain. The Ronjom 2013 model for radiation-induced hypothyroidism was validated in three studies. C-statistics ranged from 0.65 to 0.69 and calibration plots were reported in only one study. Two validation studies were judged as having high and the other was rated as having unclear ROB in the analysis domain. The Cella 2012 model was originally developed to predict radiation-induced hypothyroidism in patients with Hodgkin's lymphoma. In two validation studies in patients with HNC, c-statistics ranged from 0.65 to 0.68, but calibration performance was not reported. One validation study was rated as having a high ROB and the other was rated as being unclear in the analysis domain. Models for temporal lobe injury Amongst six models for temporal lobe injury, two were externally validated at least twice. The OuYang 2023 model, using deep learning in patients with nasopharyngeal cancer, was validated in the same paper using two different cohorts. C-statistics ranged from 0.80 to 0.82, while calibration performance was assessed in both cohorts. Both validations were judged as having low ROB in all domains. The Wen 2021 model was developed to predict temporal lobe injury in newly diagnosed nasopharyngeal cancer patients. The model was validated by OuYang 2023 using two cohorts. C-statistics ranged from 0.77 to 0.79, while calibration performance was not reported. Both validations were judged as having unclear ROB in the analysis domain. Models for outcomes related to hoarseness, fatigue, nausea-vomiting, throat pain, aspiration No models were externally validated at least twice.</p><p><strong>Authors' conclusions: </strong>Amongst 592 developed models, a limited number had adequate quality. Only one-fifth were externally validated, of which, only nine models at least twice. These nine models showed acceptable discriminative performance at external validation. However, their calibration performance was not always reported. Furthermore, most validation studies were judged as having high ROB, mainly due to problems in the analysis domain. In conclusion, this review shows the need for more external validation studies before the implementation of developed models in clinical practice and improvement of the quality of conducting and reporting of prediction model studies.</p>","PeriodicalId":10473,"journal":{"name":"Cochrane Database of Systematic Reviews","volume":"9 ","pages":"CD014745"},"PeriodicalIF":8.8000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12421721/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cochrane Database of Systematic Reviews","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/14651858.CD014745.pub2","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Radiotherapy is the mainstay of treatment for head and neck cancer (HNC) but may induce various side effects on surrounding normal tissues. To reach an optimal balance between tumour control and toxicity prevention, normal tissue complication probability (NTCP) models have been reported to predict the risk of radiation-induced side effects in patients with HNC. However, the quality of study design, conduct, and analysis (i.e. risk of bias (ROB)), as well as the predictive performance of these models, remains to be evaluated.

Objectives: To identify, describe and appraise NTCP models to predict the risk of radiation-induced side effects in patients with HNC.

Search methods: We searched Ovid MEDLINE, Embase and the World Health Organization International Clinical Trials Registry Platform from conception to January 2024. In addition, we screened references cited in the retrieved articles.

Selection criteria: Two review authors independently included articles reporting on the development and external validation of NTCP models to predict any type of radiation-induced side effects in patients with HNC.

Data collection and analysis: One reviewer extracted data from each article based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies and assessed their applicability and ROB using the Prediction model ROB Assessment Tool, while another reviewer carefully verified the results. For models externally validated at least twice for the same outcome as their original developmental study, we performed qualitative analyses of the model performance. The GRADE system was not applied, since it has not been established for reviews of prognostic model studies.

Main results: Amongst 592 models developed from 143 articles, including 140,767 HNC patients, only 49 (8%) models from six articles were judged to have low ROB and low concerns for applicability. No external validation was performed for 480 models (81%). For the remaining 112 models and six additional models which were not eligible for the present review, 152 external validations were performed in 34,304 patients with HNC in 41 articles. The results of models externally validated at least twice are discussed below. Models for xerostomia Amongst 275 models for xerostomia, two models were externally validated at least twice. The Beetz 2012b model for xerostomia six months after radiotherapy was validated in two studies. C-statistics ranged from 0.70 to 0.74. Calibration performance was reported in one study. One validation study was rated as having low ROB in all domains, while the other was rated as having high ROB in the analysis domain. The Cavallo 2021 model for acute xerostomia during radiotherapy for patients with nasopharyngeal cancer was externally validated in the same study, using two different types of cohorts. C-statistics ranged from 0.68 to 0.73 and calibration plots were reported in both cohorts. Both validations were rated as having unclear ROB in the participants' domain because no detailed information about recruiting was provided. Models for dysphagia Amongst 86 models for dysphagia, two models were externally validated at least twice. The Christianen 2012 model for dysphagia six months after radiotherapy was validated in five studies. C-statistics ranged from 0.66 to 0.75. Calibration performance was assessed in all of them, while four of them were rated as having high ROB in the analysis domain due to the small sample size. The Wopken 2014b model for tube feeding dependence six months after radiotherapy was validated in three external validation studies. C-statistics ranged from 0.79 to 0.95, while calibration was evaluated in all studies. Due to the small size of the validation datasets, they were judged as having high ROB in the analysis domain. Models for hypothyroidism Of 66 models for hypothyroidism, two models were externally validated at least twice. In addition, there was another model which was not originally developed for patients with HNC, but validated in this domain. The Boomsma 2012 for hypothyroidism within two years after radiotherapy was externally validated in two studies. C-statistics ranged from 0.64 to 0.74, while only one study reported its calibration performance. Both validation studies were rated as having high ROB in the analysis domain. The Ronjom 2013 model for radiation-induced hypothyroidism was validated in three studies. C-statistics ranged from 0.65 to 0.69 and calibration plots were reported in only one study. Two validation studies were judged as having high and the other was rated as having unclear ROB in the analysis domain. The Cella 2012 model was originally developed to predict radiation-induced hypothyroidism in patients with Hodgkin's lymphoma. In two validation studies in patients with HNC, c-statistics ranged from 0.65 to 0.68, but calibration performance was not reported. One validation study was rated as having a high ROB and the other was rated as being unclear in the analysis domain. Models for temporal lobe injury Amongst six models for temporal lobe injury, two were externally validated at least twice. The OuYang 2023 model, using deep learning in patients with nasopharyngeal cancer, was validated in the same paper using two different cohorts. C-statistics ranged from 0.80 to 0.82, while calibration performance was assessed in both cohorts. Both validations were judged as having low ROB in all domains. The Wen 2021 model was developed to predict temporal lobe injury in newly diagnosed nasopharyngeal cancer patients. The model was validated by OuYang 2023 using two cohorts. C-statistics ranged from 0.77 to 0.79, while calibration performance was not reported. Both validations were judged as having unclear ROB in the analysis domain. Models for outcomes related to hoarseness, fatigue, nausea-vomiting, throat pain, aspiration No models were externally validated at least twice.

Authors' conclusions: Amongst 592 developed models, a limited number had adequate quality. Only one-fifth were externally validated, of which, only nine models at least twice. These nine models showed acceptable discriminative performance at external validation. However, their calibration performance was not always reported. Furthermore, most validation studies were judged as having high ROB, mainly due to problems in the analysis domain. In conclusion, this review shows the need for more external validation studies before the implementation of developed models in clinical practice and improvement of the quality of conducting and reporting of prediction model studies.

头颈癌放疗后放射并发症的预后模型。
背景:放疗是头颈癌(HNC)的主要治疗方法,但可能对周围正常组织产生各种副作用。为了在肿瘤控制和毒性预防之间达到最佳平衡,正常组织并发症概率(NTCP)模型已被报道用于预测HNC患者辐射引起的副作用的风险。然而,研究设计、实施和分析的质量(即偏倚风险(risk of bias, ROB))以及这些模型的预测性能仍有待评估。目的:识别、描述和评估NTCP模型,以预测HNC患者辐射引起的副作用的风险。检索方法:检索Ovid MEDLINE、Embase和世界卫生组织国际临床试验注册平台,检索时间从母体受孕至2024年1月。此外,我们筛选了检索文章中引用的参考文献。选择标准:两位综述作者独立地纳入了报道NTCP模型的发展和外部验证的文章,以预测HNC患者任何类型的辐射引起的副作用。数据收集和分析:一位审稿人根据《预测建模研究系统评价关键评价和数据提取清单》从每篇文章中提取数据,并使用预测模型ROB评估工具评估其适用性和ROB,另一位审稿人仔细验证结果。对于与原始开发研究结果相同的至少两次外部验证的模型,我们对模型性能进行定性分析。GRADE系统没有被应用,因为它还没有被建立来评价预后模型研究。主要结果:在143篇文章的592个模型中,包括140,767例HNC患者,6篇文章中只有49个(8%)模型被认为是低ROB和低关注适用性。480个模型(81%)未进行外部验证。对于剩余的112个模型和另外6个不符合本综述的模型,41篇文章中的34,304例HNC患者进行了152次外部验证。经过至少两次外部验证的模型的结果将在下面讨论。在275个口干症模型中,有两个模型至少进行了两次外部验证。两项研究验证了放疗后6个月口干症的Beetz 2012b模型。c统计量从0.70到0.74不等。一项研究报告了校准性能。一项验证研究被评为在所有领域具有低ROB,而另一项被评为在分析领域具有高ROB。在同一项研究中,使用两种不同类型的队列,对用于鼻咽癌患者放疗期间急性口干的Cavallo 2021模型进行了外部验证。c统计量范围为0.68至0.73,两个队列均报告了校准图。由于没有提供有关招募的详细信息,这两种验证都被认为在参与者的领域中具有不明确的ROB。在86个吞咽困难模型中,有两个模型至少进行了两次外部验证。放疗后6个月吞咽困难的Christianen 2012模型在5项研究中得到验证。c统计量从0.66到0.75不等。对所有的校准性能进行了评估,其中四个由于样本量小,在分析域中被评为具有高ROB。Wopken 2014b放疗后6个月管饲依赖模型在3个外部验证研究中得到验证。c -统计量范围为0.79至0.95,所有研究均对校准进行了评估。由于验证数据集的规模较小,因此在分析域中判断它们具有较高的ROB。66个甲状腺功能减退模型中,有2个模型进行了至少2次外部验证。此外,还有另一个模型,最初不是为HNC患者开发的,但在该领域得到了验证。在两项研究中,Boomsma 2012用于治疗放疗后两年内的甲状腺功能减退。c -统计量范围为0.64 ~ 0.74,只有一项研究报告了其校准性能。两项验证研究在分析领域都被评为具有高ROB。Ronjom 2013放射性甲状腺功能减退模型在三项研究中得到验证。c统计量范围为0.65至0.69,校准图仅在一项研究中报告。两项验证研究在分析领域被判定为具有高的ROB,另一项被评为具有不明确的ROB。Cella 2012模型最初是为了预测霍奇金淋巴瘤患者放射诱导的甲状腺功能减退而开发的。在两项针对HNC患者的验证研究中,c统计量范围为0.65至0.68,但未报道校准性能。 一项验证研究被评为具有高ROB,另一项被评为在分析领域不明确。在6个颞叶损伤模型中,有2个模型至少进行了两次外部验证。欧阳2023模型在鼻咽癌患者中使用深度学习,在同一篇论文中使用两个不同的队列进行了验证。c统计量范围为0.80至0.82,同时对两个队列的校准性能进行评估。两种验证在所有领域都被判定为具有较低的ROB。wen2021模型用于预测新诊断鼻咽癌患者颞叶损伤。欧阳2023使用两个队列验证了该模型。c统计量范围为0.77至0.79,而校准性能未报告。两种验证都被判断为在分析域中具有不明确的ROB。与声音嘶哑、疲劳、恶心呕吐、咽喉痛、误吸相关的结果模型均未进行至少两次外部验证。作者的结论是:在592个已开发的模型中,有限的几个具有足够的质量。只有五分之一的模型被外部验证,其中只有9个模型至少被验证两次。这9个模型在外部验证中表现出可接受的判别性能。然而,它们的校准性能并不总是被报道。此外,大多数验证研究被判定为具有高ROB,主要是由于分析领域的问题。总之,本综述表明,在将开发的模型应用于临床实践之前,需要进行更多的外部验证研究,并提高预测模型研究的指导和报告质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.60
自引率
2.40%
发文量
173
审稿时长
1-2 weeks
期刊介绍: The Cochrane Database of Systematic Reviews (CDSR) stands as the premier database for systematic reviews in healthcare. It comprises Cochrane Reviews, along with protocols for these reviews, editorials, and supplements. Owned and operated by Cochrane, a worldwide independent network of healthcare stakeholders, the CDSR (ISSN 1469-493X) encompasses a broad spectrum of health-related topics, including health services.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信