Winnie Lay, Ha My Ngoc Nguyen, Elias El-Barhoun, Rory F Kokelaar, Justin M Yeung
{"title":"Artificial Intelligence Models Using Magnetic Resonance Imaging to Predict Response to Chemoradiotherapy in Rectal Cancer: A Systematic Review.","authors":"Winnie Lay, Ha My Ngoc Nguyen, Elias El-Barhoun, Rory F Kokelaar, Justin M Yeung","doi":"10.1111/ans.70724","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pathological complete response (pCR) following neoadjuvant chemoradiotherapy (nCRT) in locally advanced rectal cancer (LARC) is a key prognostic marker with implications for response-adapted management. Although magnetic resonance imaging (MRI) is central to response assessment, differentiating residual tumour from treatment-related changes remains challenging. Artificial intelligence (AI) and machine learning (ML) models applied to MRI show promise in predicting pCR; however, variability in methodology and performance limits clinical translation.</p><p><strong>Methods: </strong>A search of Embase, Medline, Cochrane and Web of Science was conducted in April 2025 in accordance with Preferred Reporting Items for Reviews and Meta-Analysis (PRISMA) guidelines. Eligible studies used MRI-only AI or ML models to predict pCR following chemoradiotherapy in adults with rectal cancer. Screening, full-text review and data extraction were performed independently by two reviewers. Risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2).</p><p><strong>Results: </strong>Twenty-two studies comprising 94 predictive models were included. Most studies were retrospective, used T2-weighted MRI and demonstrated variability in MRI protocols, modelling methods and validation strategies. Only five studies conducted external validation. Median AUC was 0.801, with performance ranging from poor to excellent (AUC 0.49-0.997).</p><p><strong>Conclusion: </strong>MRI-based AI models demonstrate moderate discriminative performance for predicting pCR following neoadjuvant therapy in LARC. However, methodological heterogeneity, inconsistent reporting and limited external validation currently hinder generalisability. Greater methodological standardisation and multicentre external validation are required before clinical implementation.</p>","PeriodicalId":8158,"journal":{"name":"ANZ Journal of Surgery","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2026-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ANZ Journal of Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/ans.70724","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Pathological complete response (pCR) following neoadjuvant chemoradiotherapy (nCRT) in locally advanced rectal cancer (LARC) is a key prognostic marker with implications for response-adapted management. Although magnetic resonance imaging (MRI) is central to response assessment, differentiating residual tumour from treatment-related changes remains challenging. Artificial intelligence (AI) and machine learning (ML) models applied to MRI show promise in predicting pCR; however, variability in methodology and performance limits clinical translation.
Methods: A search of Embase, Medline, Cochrane and Web of Science was conducted in April 2025 in accordance with Preferred Reporting Items for Reviews and Meta-Analysis (PRISMA) guidelines. Eligible studies used MRI-only AI or ML models to predict pCR following chemoradiotherapy in adults with rectal cancer. Screening, full-text review and data extraction were performed independently by two reviewers. Risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2).
Results: Twenty-two studies comprising 94 predictive models were included. Most studies were retrospective, used T2-weighted MRI and demonstrated variability in MRI protocols, modelling methods and validation strategies. Only five studies conducted external validation. Median AUC was 0.801, with performance ranging from poor to excellent (AUC 0.49-0.997).
Conclusion: MRI-based AI models demonstrate moderate discriminative performance for predicting pCR following neoadjuvant therapy in LARC. However, methodological heterogeneity, inconsistent reporting and limited external validation currently hinder generalisability. Greater methodological standardisation and multicentre external validation are required before clinical implementation.
背景:局部晚期直肠癌(LARC)新辅助放化疗(nCRT)后的病理完全缓解(pCR)是一个关键的预后指标,对反应适应管理具有重要意义。尽管磁共振成像(MRI)是反应评估的核心,但区分残余肿瘤与治疗相关的变化仍然具有挑战性。应用于MRI的人工智能(AI)和机器学习(ML)模型有望预测pCR;然而,方法和性能的可变性限制了临床翻译。方法:于2025年4月按照PRISMA (Preferred Reporting Items for Reviews and Meta-Analysis)指南对Embase、Medline、Cochrane和Web of Science进行检索。符合条件的研究仅使用mri AI或ML模型来预测成人直肠癌放化疗后的pCR。筛选、全文审查和数据提取由两名审稿人独立完成。使用诊断准确性研究质量评估(QUADAS-2)评估偏倚风险。结果:共纳入22项研究,共94个预测模型。大多数研究是回顾性的,使用t2加权MRI,并证明MRI方案,建模方法和验证策略的可变性。只有5项研究进行了外部验证。中位AUC为0.801,表现从差到优(AUC 0.49 ~ 0.997)。结论:基于mri的人工智能模型在预测LARC新辅助治疗后的pCR方面表现出适度的判别能力。然而,方法的异质性、不一致的报告和有限的外部验证目前阻碍了推广。在临床实施之前,需要更大的方法标准化和多中心外部验证。
期刊介绍:
ANZ Journal of Surgery is published by Wiley on behalf of the Royal Australasian College of Surgeons to provide a medium for the publication of peer-reviewed original contributions related to clinical practice and/or research in all fields of surgery and related disciplines. It also provides a programme of continuing education for surgeons. All articles are peer-reviewed by at least two researchers expert in the field of the submitted paper.