Allison M Ng, Du Q Huynh, Rebecca A D'Alonzo, Synat Keam, Pejman Rowshanfarzad, Anna K Nowak, Suki Gill, Alistair M Cook, Martin A Ebert
{"title":"Reinforcement learning with mechanistic models to optimise radiotherapy and immunotherapy combinations: a proof of concept.","authors":"Allison M Ng, Du Q Huynh, Rebecca A D'Alonzo, Synat Keam, Pejman Rowshanfarzad, Anna K Nowak, Suki Gill, Alistair M Cook, Martin A Ebert","doi":"10.1088/1361-6560/ae0863","DOIUrl":null,"url":null,"abstract":"<p><p><i>Objective.</i>To investigate the use of reinforcement learning (RL) algorithms to optimise complex combination cancer therapies. The RL algorithm investigated the effect of varying the radiotherapy (RT) dose in each fraction when administered in conjunction with the immune checkpoint inhibitors (ICIs) anti-PD-1 and anti-CTLA-4.<i>Approach.</i>Data were available for BALB/c mice inoculated with a syngeneic mesothelioma tumour on the flank, treated with combination RT and ICI with tumour growth subsequently measured. A deep<i>Q</i>-network (DQN) and a double DQN were trained using a mechanistic model fitted to the mesothelioma volumes to simulate the dynamics of the tumour microenvironment. Two reward functions were created for the RL algorithm to optimise: the first only considered tumour cell killing, while the second penalised treatment schedules with higher total RT dose. Comparison with experimental results was via the tumour control probability (TCP).<i>Main Results.</i>All the TCPs obtained with the RL algorithm exceeded the TCPs obtained with the same mechanistic model when only 1 or 2 fractions of RT were administered. However, the baseline schedule of 2 Gy per fraction outperformed the treatment schedules generated by RL.<i>Significance.</i>This study highlights the potential for RL to explore the vast solution space of possible treatment schedules, conceivably at the individual patient level.</p>","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/ae0863","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objective.To investigate the use of reinforcement learning (RL) algorithms to optimise complex combination cancer therapies. The RL algorithm investigated the effect of varying the radiotherapy (RT) dose in each fraction when administered in conjunction with the immune checkpoint inhibitors (ICIs) anti-PD-1 and anti-CTLA-4.Approach.Data were available for BALB/c mice inoculated with a syngeneic mesothelioma tumour on the flank, treated with combination RT and ICI with tumour growth subsequently measured. A deepQ-network (DQN) and a double DQN were trained using a mechanistic model fitted to the mesothelioma volumes to simulate the dynamics of the tumour microenvironment. Two reward functions were created for the RL algorithm to optimise: the first only considered tumour cell killing, while the second penalised treatment schedules with higher total RT dose. Comparison with experimental results was via the tumour control probability (TCP).Main Results.All the TCPs obtained with the RL algorithm exceeded the TCPs obtained with the same mechanistic model when only 1 or 2 fractions of RT were administered. However, the baseline schedule of 2 Gy per fraction outperformed the treatment schedules generated by RL.Significance.This study highlights the potential for RL to explore the vast solution space of possible treatment schedules, conceivably at the individual patient level.
期刊介绍:
The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry