Clinical prediction models using machine learning in oncology: challenges and recommendations.

BMJ oncology Pub Date : 2025-10-07 eCollection Date: 2025-01-01 DOI:10.1136/bmjonc-2025-000914

Gary S Collins, Mae Chester-Jones, Stephen Gerry, Jie Ma, Joao Matos, Jyoti Sehjal, Biruk Tsegaye, Paula Dhiman

{"title":"Clinical prediction models using machine learning in oncology: challenges and recommendations.","authors":"Gary S Collins, Mae Chester-Jones, Stephen Gerry, Jie Ma, Joao Matos, Jyoti Sehjal, Biruk Tsegaye, Paula Dhiman","doi":"10.1136/bmjonc-2025-000914","DOIUrl":null,"url":null,"abstract":"<p><p>Clinical prediction models are widely developed in the field of oncology, providing individualised risk estimates to aid diagnosis and prognosis. Machine learning methods are increasingly being used to develop prediction models, yet many suffer from methodological flaws limiting clinical implementation. This review outlines key considerations for developing robust, equitable prediction models in cancer care. Critical steps include systematic review of existing models, protocol development, registration, end-user engagement, sample size calculations and ensuring data representativeness across target populations. Technical challenges encompass handling missing data, addressing fairness across demographic groups and managing complex data structures, including censored observations, competing risks or clustering effects. Comprehensive internal and external evaluation requires assessment of both statistical performance (discrimination and calibration) and clinical utility. Implementation barriers include limited stakeholder engagement, insufficient clinical utility evidence, a lack of consideration of workflow integration and the absence of post-deployment monitoring plans. Despite significant potential for personalising cancer care, most prediction models remain unimplemented due to these methodological and translational challenges. Addressing these considerations from study design through post implementation monitoring is essential for developing trustworthy tools that bridge the gap between model development and clinical practice in oncology.</p>","PeriodicalId":72436,"journal":{"name":"BMJ oncology","volume":"4 1","pages":"e000914"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506039/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ oncology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjonc-2025-000914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Clinical prediction models are widely developed in the field of oncology, providing individualised risk estimates to aid diagnosis and prognosis. Machine learning methods are increasingly being used to develop prediction models, yet many suffer from methodological flaws limiting clinical implementation. This review outlines key considerations for developing robust, equitable prediction models in cancer care. Critical steps include systematic review of existing models, protocol development, registration, end-user engagement, sample size calculations and ensuring data representativeness across target populations. Technical challenges encompass handling missing data, addressing fairness across demographic groups and managing complex data structures, including censored observations, competing risks or clustering effects. Comprehensive internal and external evaluation requires assessment of both statistical performance (discrimination and calibration) and clinical utility. Implementation barriers include limited stakeholder engagement, insufficient clinical utility evidence, a lack of consideration of workflow integration and the absence of post-deployment monitoring plans. Despite significant potential for personalising cancer care, most prediction models remain unimplemented due to these methodological and translational challenges. Addressing these considerations from study design through post implementation monitoring is essential for developing trustworthy tools that bridge the gap between model development and clinical practice in oncology.

查看原文本刊更多论文

肿瘤学中使用机器学习的临床预测模型：挑战和建议。

临床预测模型在肿瘤学领域得到了广泛的发展，提供个性化的风险估计，以帮助诊断和预后。机器学习方法越来越多地被用于开发预测模型，但许多方法上的缺陷限制了临床实施。这篇综述概述了在癌症治疗中建立稳健、公平的预测模型的关键考虑因素。关键步骤包括系统地审查现有模型、制定方案、注册、最终用户参与、样本量计算和确保目标人群的数据代表性。技术挑战包括处理缺失数据，解决人口群体之间的公平性问题，以及管理复杂的数据结构，包括审查后的观察结果、竞争风险或聚类效应。全面的内部和外部评估需要评估统计性能（区分和校准）和临床效用。实施障碍包括利益相关者参与有限、临床效用证据不足、缺乏对工作流程集成的考虑以及缺乏部署后监测计划。尽管个性化癌症治疗具有巨大的潜力，但由于这些方法和转化方面的挑战，大多数预测模型仍未实现。从研究设计到实施后监测，解决这些问题对于开发可信赖的工具至关重要，这些工具可以弥合肿瘤模型开发和临床实践之间的差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMJ oncology

自引率

0.00%

发文量