基于多智能体深度强化学习的乳房放射治疗影响绘画。

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2025-01-23 DOI:10.1002/mp.17615

Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang

{"title":"基于多智能体深度强化学习的乳房放射治疗影响绘画。","authors":"Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang","doi":"10.1002/mp.17615","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are <span></span><math>\n <semantics>\n <mrow>\n <mn>77.759</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>8.904</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$77.759{\\mathrm{\\ \\% }}( { \\pm 8.904{\\mathrm{\\ \\% }}} )$</annotation>\n </semantics></math> and <span></span><math>\n <semantics>\n <mrow>\n <mn>8.522</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n <mspace></mspace>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>11.469</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$8.522{\\mathrm{\\ cc\\ }}( { \\pm 11.469{\\mathrm{\\ cc}}} )$</annotation>\n </semantics></math>, compared to <span></span><math>\n <semantics>\n <mrow>\n <mn>78.568</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>9.094</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$78.568{\\mathrm{\\ \\% }}( { \\pm 9.094{\\mathrm{\\ \\% }}} )$</annotation>\n </semantics></math> and <span></span><math>\n <semantics>\n <mrow>\n <mn>34.298</mn>\n <mspace></mspace>\n <mi>cc</mi>\n <mspace></mspace>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>36.297</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$34.298\\ {\\mathrm{cc}}\\ ( { \\pm 36.297{\\mathrm{\\ cc}}} )$</annotation>\n </semantics></math> cc of clinical plans.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 4","pages":"2015-2024"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Breast radiation therapy fluence painting with multi-agent deep reinforcement learning\",\"authors\":\"Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang\",\"doi\":\"10.1002/mp.17615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>77.759</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>8.904</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$77.759{\\\\mathrm{\\\\ \\\\% }}( { \\\\pm 8.904{\\\\mathrm{\\\\ \\\\% }}} )$</annotation>\\n </semantics></math> and <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>8.522</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n <mspace></mspace>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>11.469</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$8.522{\\\\mathrm{\\\\ cc\\\\ }}( { \\\\pm 11.469{\\\\mathrm{\\\\ cc}}} )$</annotation>\\n </semantics></math>, compared to <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>78.568</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>9.094</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$78.568{\\\\mathrm{\\\\ \\\\% }}( { \\\\pm 9.094{\\\\mathrm{\\\\ \\\\% }}} )$</annotation>\\n </semantics></math> and <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>34.298</mn>\\n <mspace></mspace>\\n <mi>cc</mi>\\n <mspace></mspace>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>36.297</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$34.298\\\\ {\\\\mathrm{cc}}\\\\ ( { \\\\pm 36.297{\\\\mathrm{\\\\ cc}}} )$</annotation>\\n </semantics></math> cc of clinical plans.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 4\",\"pages\":\"2015-2024\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/mp.17615\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17615","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

背景：电子补偿（ECOMP）技术用于乳腺放射治疗具有良好的剂量一致性和均匀性。然而，手工灌流涂布工艺对临床高效操作提出了挑战。目的：为了促进乳腺放射治疗的临床治疗计划自动化，我们利用强化学习（RL）开发了一个自动计划工具，该工具可以在临床相关目标的指导下迭代编辑影响图。方法：经机构审查委员会（IRB）批准，回顾性收集70例经6MV切向光子束ECOMP技术治疗的患者（20/50为培训/测试）。在影响力图中每个像素被分配一个强化学习代理来执行独立的动作。生成束眼投影剂量分布图，形成状态信息作为RL网络的输入。通过预测Q值，选择逐像素操作来修改影响图中的特定像素值，以提高整体计划质量。剂量计算完成后，根据目标覆盖率和剂量均匀性的变化计算奖励信号反馈给RL框架，用于更新网络参数。通过剂量分布和剂量学终点（即乳腺PTV V90%、乳腺PTV V95%、乳腺PTV V105%、肺V20 Gy、心脏V5 Gy、Dmax）对RL生成方案进行评价，并与临床方案进行比较。结果：RL剂产生ECOMP治疗方案的时间约为90 s。在等剂量分布和剂量学终点方面，RL计划显示出与临床计划相当的计划质量。均值乳房PTV V95%,乳房PTV V105% RL计划的77.759%(±8.904%){\ mathrm 77.759美元 {\ \% }}( { \ {\ mathrm 8.904点 {\ \% }}} )$ 8.522美元和8.522 cc(±11.469 cc) {\ mathrm {\ cc \}} ({\ pm 11.469 {\ mathrm {\ cc }}} )$ , 78.568%(±9.094%){\ mathrm 78.568美元 {\ \% }}( { \ {\ mathrm 9.094点 {\ \% }}} )$ 34.298美元和34.298 cc(±36.297 cc) \ {\ mathrm {cc}} \ ({\ pm 36.297 {\ mathrm {\ cc}}})美元cc临床计划。结论：所建立的RL框架能有效生成临床可接受的乳房ECOMP方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Breast radiation therapy fluence painting with multi-agent deep reinforcement learning

Background

The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.

Purpose

To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.

Methods

With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.

Results

The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are $77.759 % (\pm 8.904 %)$ and $8.522 cc (\pm 11.469 cc)$ , compared to $78.568 % (\pm 9.094 %)$ and $34.298 cc (\pm 36.297 cc)$ cc of clinical plans.

Conclusion

The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.