Dual convolution-transformer UNet (DCT-UNet) for organs at risk and clinical target volume segmentation in MRI for cervical cancer brachytherapy.

IF 3.3 3区医学 Q2 ENGINEERING, BIOMEDICAL

Physics in medicine and biology Pub Date : 2024-10-18 DOI:10.1088/1361-6560/ad84b2

Gayoung Kim, Akila N Viswanathan, Rohini Bhatia, Yosef Landman, Michael Roumeliotis, Beth Erickson, Ehud J Schmidt, Junghoon Lee

{"title":"Dual convolution-transformer UNet (DCT-UNet) for organs at risk and clinical target volume segmentation in MRI for cervical cancer brachytherapy.","authors":"Gayoung Kim, Akila N Viswanathan, Rohini Bhatia, Yosef Landman, Michael Roumeliotis, Beth Erickson, Ehud J Schmidt, Junghoon Lee","doi":"10.1088/1361-6560/ad84b2","DOIUrl":null,"url":null,"abstract":"Objective. MRI is the standard imaging modality for high-dose-rate brachytherapy of cervical cancer. Precise contouring of organs at risk (OARs) and high-risk clinical target volume (HR-CTV) from MRI is a crucial step for radiotherapy planning and treatment. However, conventional manual contouring has limitations in terms of accuracy as well as procedural time. To overcome these, we propose a deep learning approach to automatically segment OARs (bladder, rectum, and sigmoid colon) and HR-CTV from female pelvic MRI.Approach. In the proposed pipeline, a coarse multi-organ segmentation model first segments all structures, from which a region of interest is computed for each structure. Then, each organ is segmented using an organ-specific fine segmentation model separately trained for each organ. To account for variable sizes of HR-CTV, a size-adaptive multi-model approach was employed. For coarse and fine segmentations, we designed a dual convolution-transformer UNet (DCT-UNet) which uses dual-path encoder consisting of convolution and transformer blocks. To evaluate our model, OAR segmentations were compared to the clinical contours drawn by the attending radiation oncologist. For HR-CTV, four sets of contours (clinical + three additional sets) were obtained to produce a consensus ground truth as well as for inter/intra-observer variability analysis.Main results. DCT-UNet achieved dice similarity coefficient (mean ± SD) of 0.932 ± 0.032 (bladder), 0.786 ± 0.090 (rectum), 0.663 ± 0.180 (sigmoid colon), and 0.741 ± 0.076 (HR-CTV), outperforming other state-of-the-art models. Notably, the size-adaptive multi-model significantly improved HR-CTV segmentation compared to a single-model. Furthermore, significant inter/intra-observer variability was observed, and our model showed comparable performance to all observers. Computation time for the entire pipeline per subject was 12.59 ± 0.79 s, which is significantly shorter than the typical manual contouring time of >15 min.Significance. These experimental results demonstrate that our model has great utility in cervical cancer brachytherapy by enabling fast and accurate automatic segmentation, and has potential in improving consistency in contouring. DCT-UNet source code is available athttps://github.com/JHU-MICA/DCT-UNet.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/ad84b2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Objective. MRI is the standard imaging modality for high-dose-rate brachytherapy of cervical cancer. Precise contouring of organs at risk (OARs) and high-risk clinical target volume (HR-CTV) from MRI is a crucial step for radiotherapy planning and treatment. However, conventional manual contouring has limitations in terms of accuracy as well as procedural time. To overcome these, we propose a deep learning approach to automatically segment OARs (bladder, rectum, and sigmoid colon) and HR-CTV from female pelvic MRI.Approach. In the proposed pipeline, a coarse multi-organ segmentation model first segments all structures, from which a region of interest is computed for each structure. Then, each organ is segmented using an organ-specific fine segmentation model separately trained for each organ. To account for variable sizes of HR-CTV, a size-adaptive multi-model approach was employed. For coarse and fine segmentations, we designed a dual convolution-transformer UNet (DCT-UNet) which uses dual-path encoder consisting of convolution and transformer blocks. To evaluate our model, OAR segmentations were compared to the clinical contours drawn by the attending radiation oncologist. For HR-CTV, four sets of contours (clinical + three additional sets) were obtained to produce a consensus ground truth as well as for inter/intra-observer variability analysis.Main results. DCT-UNet achieved dice similarity coefficient (mean ± SD) of 0.932 ± 0.032 (bladder), 0.786 ± 0.090 (rectum), 0.663 ± 0.180 (sigmoid colon), and 0.741 ± 0.076 (HR-CTV), outperforming other state-of-the-art models. Notably, the size-adaptive multi-model significantly improved HR-CTV segmentation compared to a single-model. Furthermore, significant inter/intra-observer variability was observed, and our model showed comparable performance to all observers. Computation time for the entire pipeline per subject was 12.59 ± 0.79 s, which is significantly shorter than the typical manual contouring time of >15 min.Significance. These experimental results demonstrate that our model has great utility in cervical cancer brachytherapy by enabling fast and accurate automatic segmentation, and has potential in improving consistency in contouring. DCT-UNet source code is available athttps://github.com/JHU-MICA/DCT-UNet.

查看原文本刊更多论文

用于宫颈癌近距离放射治疗磁共振成像中危险器官和临床目标体积分割的双卷积-变换器 UNet (DCT-UNet)。

目的：核磁共振成像是宫颈癌高剂量率近距离放射治疗的标准成像方式。通过核磁共振成像精确勾画高危器官（OAR）和高危临床靶体积（HR-CTV）是放疗计划和治疗的关键步骤。然而，传统的手动轮廓绘制在准确性和程序时间方面存在局限性。为了克服这些问题，我们提出了一种深度学习方法，用于自动分割女性盆腔 MRI 中的 OAR（膀胱、直肠和乙状结肠）和 HR-CTV：在所提出的管道中，一个粗略的多器官分割模型首先分割所有结构，然后计算每个结构的感兴趣区。然后，使用针对每个器官单独训练的特定器官精细分割模型对每个器官进行分割。为了考虑 HR-CTV 的不同尺寸，我们采用了尺寸自适应多模型方法。对于粗细分割，我们设计了一个双卷积-变换器 UNet（DCT-UNet），它使用由卷积块和变换器块组成的双路径编码器。为了评估我们的模型，我们将 OAR 分割结果与放射肿瘤主治医生绘制的临床轮廓进行了比较。对于 HR-CTV，我们获得了四组轮廓（临床轮廓+三组附加轮廓），以生成一致的基本真相，并进行观察者间/观察者内的差异分析：主要结果：DCT-UNet 的骰子相似系数（平均值±SD）分别为 0.932±0.032（膀胱）、0.786±0.090（直肠）、0.663±0.180（乙状结肠）和 0.741±0.076（HR-CTV），优于其他先进模型。值得注意的是，与单一模型相比，尺寸自适应多模型明显改善了 HR-CTV 的分割。此外，还观察到观察者之间/观察者内部存在明显的差异，而我们的模型与所有观察者的表现相当。每个受试者整个管道的计算时间为（12.59±0.79）秒，明显短于一般的人工轮廓绘制时间（>15 分钟）：这些实验结果表明，我们的模型可以实现快速、准确的自动分割，在宫颈癌近距离治疗中大有用武之地，并有望提高轮廓绘制的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry