Rahimeh Rouhi , Stéphane Niyoteka , Alexandre Carré , Samir Achkar , Pierre-Antoine Laurent , Mouhamadou Bachir Ba , Cristina Veres , Théophraste Henry , Maria Vakalopoulou , Roger Sun , Sophie Espenel , Linda Mrissa , Adrien Laville , Cyrus Chargari , Eric Deutsch , Charlotte Robert
{"title":"Automatic gross tumor volume segmentation with failure detection for safe implementation in locally advanced cervical cancer","authors":"Rahimeh Rouhi , Stéphane Niyoteka , Alexandre Carré , Samir Achkar , Pierre-Antoine Laurent , Mouhamadou Bachir Ba , Cristina Veres , Théophraste Henry , Maria Vakalopoulou , Roger Sun , Sophie Espenel , Linda Mrissa , Adrien Laville , Cyrus Chargari , Eric Deutsch , Charlotte Robert","doi":"10.1016/j.phro.2024.100578","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Purpose</h3><p>Automatic segmentation methods have greatly changed the RadioTherapy (RT) workflow, but still need to be extended to target volumes. In this paper, Deep Learning (DL) models were compared for Gross Tumor Volume (GTV) segmentation in locally advanced cervical cancer, and a novel investigation into failure detection was introduced by utilizing radiomic features.</p></div><div><h3>Methods and materials</h3><p>We trained eight DL models (UNet, VNet, SegResNet, SegResNetVAE) for 2D and 3D segmentation. Ensembling individually trained models during cross-validation generated the final segmentation. To detect failures, binary classifiers were trained using radiomic features extracted from segmented GTVs as inputs, aiming to classify contours based on whether their Dice Similarity Coefficient <span><math><mrow><mo>(</mo><mi>DSC</mi><mo>)</mo><mo><</mo><mi>T</mi></mrow></math></span> and <span><math><mrow><mi>DSC</mi><mo>⩾</mo><mi>T</mi></mrow></math></span>. Two distinct cohorts of T2-Weighted (T2W) pre-RT MR images captured in 2D sequences were used: one retrospective cohort consisting of 115 LACC patients from 30 scanners, and the other prospective cohort, comprising 51 patients from 7 scanners, used for testing.</p></div><div><h3>Results</h3><p>Segmentation by 2D-SegResNet achieved the best DSC, Surface DSC (<span><math><mrow><msub><mrow><mi>SDSC</mi></mrow><mrow><mn>3</mn><mi>mm</mi></mrow></msub></mrow></math></span>), and 95th Hausdorff Distance (95HD): DSC = 0.72 ± 0.16, <span><math><mrow><msub><mrow><mi>SDSC</mi></mrow><mrow><mn>3</mn><mi>mm</mi></mrow></msub></mrow></math></span>=0.66 ± 0.17, and 95HD = 14.6 ± 9.0 mm without missing segmentation (<span><math><mrow><mi>M</mi></mrow></math></span>=0) on the test cohort. Failure detection could generate precision (<span><math><mrow><mi>P</mi><mo>=</mo><mn>0.88</mn></mrow></math></span>), recall (<span><math><mrow><mi>R</mi><mo>=</mo><mn>0.75</mn></mrow></math></span>), F1-score (<span><math><mrow><mi>F</mi><mo>=</mo><mn>0.81</mn></mrow></math></span>), and accuracy (<span><math><mrow><mi>A</mi><mo>=</mo><mn>0.86</mn></mrow></math></span>) using Logistic Regression (LR) classifier on the test cohort with a threshold T = 0.67 on DSC values.</p></div><div><h3>Conclusions</h3><p>Our study revealed that segmentation accuracy varies slightly among different DL methods, with 2D networks outperforming 3D networks in 2D MRI sequences. Doctors found the time-saving aspect advantageous. The proposed failure detection could guide doctors in sensitive cases.</p></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405631624000484/pdfft?md5=7b13fafc60565c0e7bc24b1dadfb8ba2&pid=1-s2.0-S2405631624000484-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics and Imaging in Radiation Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405631624000484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Purpose
Automatic segmentation methods have greatly changed the RadioTherapy (RT) workflow, but still need to be extended to target volumes. In this paper, Deep Learning (DL) models were compared for Gross Tumor Volume (GTV) segmentation in locally advanced cervical cancer, and a novel investigation into failure detection was introduced by utilizing radiomic features.
Methods and materials
We trained eight DL models (UNet, VNet, SegResNet, SegResNetVAE) for 2D and 3D segmentation. Ensembling individually trained models during cross-validation generated the final segmentation. To detect failures, binary classifiers were trained using radiomic features extracted from segmented GTVs as inputs, aiming to classify contours based on whether their Dice Similarity Coefficient and . Two distinct cohorts of T2-Weighted (T2W) pre-RT MR images captured in 2D sequences were used: one retrospective cohort consisting of 115 LACC patients from 30 scanners, and the other prospective cohort, comprising 51 patients from 7 scanners, used for testing.
Results
Segmentation by 2D-SegResNet achieved the best DSC, Surface DSC (), and 95th Hausdorff Distance (95HD): DSC = 0.72 ± 0.16, =0.66 ± 0.17, and 95HD = 14.6 ± 9.0 mm without missing segmentation (=0) on the test cohort. Failure detection could generate precision (), recall (), F1-score (), and accuracy () using Logistic Regression (LR) classifier on the test cohort with a threshold T = 0.67 on DSC values.
Conclusions
Our study revealed that segmentation accuracy varies slightly among different DL methods, with 2D networks outperforming 3D networks in 2D MRI sequences. Doctors found the time-saving aspect advantageous. The proposed failure detection could guide doctors in sensitive cases.