基于大语言模型的宫颈癌放射治疗方案自动规划可行性研究。

IF 3.3 2区医学 Q2 ONCOLOGY

Radiation Oncology Pub Date : 2025-05-15 DOI:10.1186/s13014-025-02660-5

Shuoyang Wei, Ankang Hu, Yongguang Liang, Jingru Yang, Lang Yu, Wenbo Li, Bo Yang, Jie Qiu

{"title":"基于大语言模型的宫颈癌放射治疗方案自动规划可行性研究。","authors":"Shuoyang Wei, Ankang Hu, Yongguang Liang, Jingru Yang, Lang Yu, Wenbo Li, Bo Yang, Jie Qiu","doi":"10.1186/s13014-025-02660-5","DOIUrl":null,"url":null,"abstract":"Background: Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in various domains, presents an opportunity to automate and enhance this optimization process.Purpose: This study seeks to evaluate the capacity of LLMs to generate radiotherapy treatment plans comparable to those crafted by human medical physicists, focusing on target volume conformity and organs-at-risk (OARs) dose sparing. The goal is to automate the optimization process of radiotherapy treatment plans through the utilization of LLMs.Methods: Multiple LLMs were employed to adjust optimization parameters for radiotherapy treatment plans, using a dataset comprising 35 cervical cancer patients treated with volumetric modulated arc therapy (VMAT). Customized prompts were applied to 5 patients to tailor the LLMs, which were subsequently tested on 30 patients. Evaluation metrics included target volume conformity, dose homogeneity, monitor units (MU) value, and OARs dose sparing, comparing plans generated by various LLMs to manual plans.Results: With the exception of Gemini-1.5-flash, which faced challenges due to hallucinations, Qwen-2.5-max and Llama-3.2 produced acceptable VMAT plans in 16.3 ± 5.0 and 9.8 ± 2.1 min, respectively, outperforming an experienced human physicist's time cost of about 20 min. The average conformity index (CI) for Qwen-2.5-max plans, Llama-3.2 plans, and manual plans on the test set were 0.929 ± 0.007, 0.928 ± 0.007, and 0.926 ± 0.007, respectively. The average homogeneity index (HI) was 0.058 ± 0.006, 0.059 ± 0.005, and 0.065 ± 0.006, respectively. While there was a significant difference in target volume conformity between LLM plans and manual plans, OARs dose sparing showed no significant variations. In lateral comparisons among different LLMs, no statistically significant differences were observed in the PTV dose, OARs dose sparing, and target volume conformity between Qwen-2.5-max and Llama-3.2 plans.Conclusions: Through an assessment of LLM-generated plans and clinical plans in terms of target volume conformity and OARs dose sparing, this study provides preliminary evidence supporting the viability of LLMs for optimizing radiotherapy treatment plans. The implementation of LLMs demonstrates the potential for enhancing clinical workflows and reducing the workload associated with treatment planning.","PeriodicalId":49639,"journal":{"name":"Radiation Oncology","volume":"20 1","pages":"77"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083153/pdf/","citationCount":"0","resultStr":"{\"title\":\"Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model.\",\"authors\":\"Shuoyang Wei, Ankang Hu, Yongguang Liang, Jingru Yang, Lang Yu, Wenbo Li, Bo Yang, Jie Qiu\",\"doi\":\"10.1186/s13014-025-02660-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in various domains, presents an opportunity to automate and enhance this optimization process.Purpose: This study seeks to evaluate the capacity of LLMs to generate radiotherapy treatment plans comparable to those crafted by human medical physicists, focusing on target volume conformity and organs-at-risk (OARs) dose sparing. The goal is to automate the optimization process of radiotherapy treatment plans through the utilization of LLMs.Methods: Multiple LLMs were employed to adjust optimization parameters for radiotherapy treatment plans, using a dataset comprising 35 cervical cancer patients treated with volumetric modulated arc therapy (VMAT). Customized prompts were applied to 5 patients to tailor the LLMs, which were subsequently tested on 30 patients. Evaluation metrics included target volume conformity, dose homogeneity, monitor units (MU) value, and OARs dose sparing, comparing plans generated by various LLMs to manual plans.Results: With the exception of Gemini-1.5-flash, which faced challenges due to hallucinations, Qwen-2.5-max and Llama-3.2 produced acceptable VMAT plans in 16.3 ± 5.0 and 9.8 ± 2.1 min, respectively, outperforming an experienced human physicist's time cost of about 20 min. The average conformity index (CI) for Qwen-2.5-max plans, Llama-3.2 plans, and manual plans on the test set were 0.929 ± 0.007, 0.928 ± 0.007, and 0.926 ± 0.007, respectively. The average homogeneity index (HI) was 0.058 ± 0.006, 0.059 ± 0.005, and 0.065 ± 0.006, respectively. While there was a significant difference in target volume conformity between LLM plans and manual plans, OARs dose sparing showed no significant variations. In lateral comparisons among different LLMs, no statistically significant differences were observed in the PTV dose, OARs dose sparing, and target volume conformity between Qwen-2.5-max and Llama-3.2 plans.Conclusions: Through an assessment of LLM-generated plans and clinical plans in terms of target volume conformity and OARs dose sparing, this study provides preliminary evidence supporting the viability of LLMs for optimizing radiotherapy treatment plans. The implementation of LLMs demonstrates the potential for enhancing clinical workflows and reducing the workload associated with treatment planning.\",\"PeriodicalId\":49639,\"journal\":{\"name\":\"Radiation Oncology\",\"volume\":\"20 1\",\"pages\":\"77\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083153/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Radiation Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13014-025-02660-5\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiation Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13014-025-02660-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景：放射治疗计划传统上涉及复杂和耗时的过程，通常依赖于试错方法。人工智能的出现，特别是大型语言模型（llm）的出现，超越了人类在各个领域的能力和现有算法，为自动化和增强这一优化过程提供了机会。目的：本研究旨在评估LLMs产生与人类医学物理学家制定的放射治疗计划相当的能力，重点关注靶体积一致性和危险器官（OARs）剂量节约。目标是通过利用llm实现放射治疗计划的自动化优化过程。方法：使用多个llm来调整放射治疗计划的优化参数，数据集包括35例接受体积调制电弧治疗（VMAT）的宫颈癌患者。对5名患者应用定制提示来定制llm，随后对30名患者进行了测试。评估指标包括目标体积一致性、剂量均匀性、监测单位（MU）值和OARs剂量节约，将各种llm生成的计划与人工计划进行比较。结果：除Gemini-1.5-flash存在幻觉挑战外，Qwen-2.5-max和Llama-3.2分别在16.3±5.0和9.8±2.1 min内生成可接受的VMAT方案，优于经验丰富的人类物理学家约20 min的时间成本。Qwen-2.5-max方案、Llama-3.2方案和手动方案在测试集上的平均符合指数（CI）分别为0.929±0.007、0.928±0.007和0.926±0.007。平均均匀性指数（HI）分别为0.058±0.006、0.059±0.005和0.065±0.006。虽然LLM计划和手动计划在靶体积一致性方面存在显著差异，但OARs剂量节约没有显着差异。在不同llm的横向比较中，Qwen-2.5-max和Llama-3.2方案在PTV剂量、OARs剂量节约和靶体积一致性方面均无统计学差异。结论：通过对llm生成计划和临床计划在靶体积一致性和OARs剂量节约方面的评估，本研究为llm优化放疗治疗计划的可行性提供了初步证据。llm的实施证明了加强临床工作流程和减少与治疗计划相关的工作量的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model.

查看原文本刊更多论文

Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model.

Background: Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in various domains, presents an opportunity to automate and enhance this optimization process.

Purpose: This study seeks to evaluate the capacity of LLMs to generate radiotherapy treatment plans comparable to those crafted by human medical physicists, focusing on target volume conformity and organs-at-risk (OARs) dose sparing. The goal is to automate the optimization process of radiotherapy treatment plans through the utilization of LLMs.

Methods: Multiple LLMs were employed to adjust optimization parameters for radiotherapy treatment plans, using a dataset comprising 35 cervical cancer patients treated with volumetric modulated arc therapy (VMAT). Customized prompts were applied to 5 patients to tailor the LLMs, which were subsequently tested on 30 patients. Evaluation metrics included target volume conformity, dose homogeneity, monitor units (MU) value, and OARs dose sparing, comparing plans generated by various LLMs to manual plans.

Results: With the exception of Gemini-1.5-flash, which faced challenges due to hallucinations, Qwen-2.5-max and Llama-3.2 produced acceptable VMAT plans in 16.3 ± 5.0 and 9.8 ± 2.1 min, respectively, outperforming an experienced human physicist's time cost of about 20 min. The average conformity index (CI) for Qwen-2.5-max plans, Llama-3.2 plans, and manual plans on the test set were 0.929 ± 0.007, 0.928 ± 0.007, and 0.926 ± 0.007, respectively. The average homogeneity index (HI) was 0.058 ± 0.006, 0.059 ± 0.005, and 0.065 ± 0.006, respectively. While there was a significant difference in target volume conformity between LLM plans and manual plans, OARs dose sparing showed no significant variations. In lateral comparisons among different LLMs, no statistically significant differences were observed in the PTV dose, OARs dose sparing, and target volume conformity between Qwen-2.5-max and Llama-3.2 plans.

Conclusions: Through an assessment of LLM-generated plans and clinical plans in terms of target volume conformity and OARs dose sparing, this study provides preliminary evidence supporting the viability of LLMs for optimizing radiotherapy treatment plans. The implementation of LLMs demonstrates the potential for enhancing clinical workflows and reducing the workload associated with treatment planning.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Radiation Oncology ONCOLOGY-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

CiteScore

6.50

自引率

2.80%

发文量

181

审稿时长

3-6 weeks

期刊介绍： Radiation Oncology encompasses all aspects of research that impacts on the treatment of cancer using radiation. It publishes findings in molecular and cellular radiation biology, radiation physics, radiation technology, and clinical oncology.