Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang
{"title":"基于多智能体深度强化学习的乳房放射治疗影响绘画。","authors":"Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang","doi":"10.1002/mp.17615","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are <span></span><math>\n <semantics>\n <mrow>\n <mn>77.759</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>8.904</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$77.759{\\mathrm{\\ \\% }}( { \\pm 8.904{\\mathrm{\\ \\% }}} )$</annotation>\n </semantics></math> and <span></span><math>\n <semantics>\n <mrow>\n <mn>8.522</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n <mspace></mspace>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>11.469</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$8.522{\\mathrm{\\ cc\\ }}( { \\pm 11.469{\\mathrm{\\ cc}}} )$</annotation>\n </semantics></math>, compared to <span></span><math>\n <semantics>\n <mrow>\n <mn>78.568</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>9.094</mn>\n <mrow>\n <mspace></mspace>\n <mo>%</mo>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$78.568{\\mathrm{\\ \\% }}( { \\pm 9.094{\\mathrm{\\ \\% }}} )$</annotation>\n </semantics></math> and <span></span><math>\n <semantics>\n <mrow>\n <mn>34.298</mn>\n <mspace></mspace>\n <mi>cc</mi>\n <mspace></mspace>\n <mo>(</mo>\n <mrow>\n <mo>±</mo>\n <mn>36.297</mn>\n <mrow>\n <mspace></mspace>\n <mi>cc</mi>\n </mrow>\n </mrow>\n <mo>)</mo>\n </mrow>\n <annotation>$34.298\\ {\\mathrm{cc}}\\ ( { \\pm 36.297{\\mathrm{\\ cc}}} )$</annotation>\n </semantics></math> cc of clinical plans.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 4","pages":"2015-2024"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Breast radiation therapy fluence painting with multi-agent deep reinforcement learning\",\"authors\":\"Yang Dongrong, Li Xinyi, Yoo Sua, Blitzblau Rachel, McDuff Susan, Stephens Sarah, Segars Paul, Wu Q Jackie, Sheng Yang\",\"doi\":\"10.1002/mp.17615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>77.759</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>8.904</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$77.759{\\\\mathrm{\\\\ \\\\% }}( { \\\\pm 8.904{\\\\mathrm{\\\\ \\\\% }}} )$</annotation>\\n </semantics></math> and <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>8.522</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n <mspace></mspace>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>11.469</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$8.522{\\\\mathrm{\\\\ cc\\\\ }}( { \\\\pm 11.469{\\\\mathrm{\\\\ cc}}} )$</annotation>\\n </semantics></math>, compared to <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>78.568</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>9.094</mn>\\n <mrow>\\n <mspace></mspace>\\n <mo>%</mo>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$78.568{\\\\mathrm{\\\\ \\\\% }}( { \\\\pm 9.094{\\\\mathrm{\\\\ \\\\% }}} )$</annotation>\\n </semantics></math> and <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>34.298</mn>\\n <mspace></mspace>\\n <mi>cc</mi>\\n <mspace></mspace>\\n <mo>(</mo>\\n <mrow>\\n <mo>±</mo>\\n <mn>36.297</mn>\\n <mrow>\\n <mspace></mspace>\\n <mi>cc</mi>\\n </mrow>\\n </mrow>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$34.298\\\\ {\\\\mathrm{cc}}\\\\ ( { \\\\pm 36.297{\\\\mathrm{\\\\ cc}}} )$</annotation>\\n </semantics></math> cc of clinical plans.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 4\",\"pages\":\"2015-2024\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/mp.17615\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17615","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Breast radiation therapy fluence painting with multi-agent deep reinforcement learning
Background
The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.
Purpose
To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.
Methods
With institutional review board (IRB) approval, 70 patients treated with 6MV tangential photon beams with ECOMP technique were retrospectively collected and included in this study (20/50 for training/testing). Each pixel in the fluence map was assigned a reinforcement learning agent to perform independent action. Beam-eye-view projected dose profiles were generated to form state information as the input of the RL network. By predicting the Q value, pixel-wise actions were selected to modify specific pixel value in the fluence maps to improve overall plan quality. After dose calculation, reward signal calculated from the variation of target coverage and dose homogeneity was fed back to the RL framework and used to update network parameters. The RL generated plans were evaluated with dose distribution and dosimetric endpoints (i.e., Breast PTV V90%, Breast PTV V95%, Breast PTV V105%, Lung V20 Gy, Heart V5 Gy, Dmax) and compared with clinical plans.
Results
The RL agent took around 90 s to generate a ECOMP treatment plan. The RL plans exhibited plan quality comparable to clinical plans in terms of isodose distribution and dosimetric endpoints. The mean Breast PTV V95%, Breast PTV V105% of RL plans are and , compared to and cc of clinical plans.
Conclusion
The developed RL framework efficiently generates breast ECOMP plans with clinical acceptable plan quality.
期刊介绍:
Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments
Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.