3D lymphoma segmentation on PET/CT images via multi-scale information fusion with cross-attention

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2025-03-20 DOI:10.1002/mp.17763

Huan Huang, Liheng Qiu, Shenmiao Yang, Longxi Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Chen Zhao, Weihua Zhou

{"title":"3D lymphoma segmentation on PET/CT images via multi-scale information fusion with cross-attention","authors":"Huan Huang, Liheng Qiu, Shenmiao Yang, Longxi Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Chen Zhao, Weihua Zhou","doi":"10.1002/mp.17763","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Accurate segmentation of diffuse large B-cell lymphoma (DLBCL) lesions is challenging due to their complex patterns in medical imaging. Traditional methods often struggle to delineate these lesions accurately.</p>\n </section>\n \n <section>\n \n <h3> Objective</h3>\n \n <p>This study aims to develop a precise segmentation method for DLBCL using 18F-fluorodeoxyglucose (18F-FDG) positron emission tomography (PET) and computed tomography (CT) images.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We propose a 3D segmentation method based on an encoder-decoder architecture. The encoder incorporates a dual-branch design based on the shifted window transformer to extract features from both PET and CT modalities. To enhance feature integration, we introduce a multi-scale information fusion (MSIF) module that performs multi-scale feature fusion using cross-attention mechanisms with a shifted window framework. A gated neural network within the MSIF module dynamically adjusts feature weights to balance the contributions from each modality. The model is optimized using the dice similarity coefficient (DSC) loss function, minimizing discrepancies between the model prediction and ground truth. Additionally, we computed the total metabolic tumor volume (TMTV) and performed statistical analyses on the results.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The model was trained and validated on a private dataset of 165 DLBCL patients and a publicly available dataset (autoPET) containing 145 PET/CT scans of lymphoma patients. Both datasets were analyzed using five-fold cross-validation. On the private dataset, our model achieved a DSC of 0.7512, sensitivity of 0.7548, precision of 0.7611, an average surface distance (ASD) of 3.61 mm, and a Hausdorff distance at the 95th percentile (HD95) of 15.25 mm. On the autoPET dataset, the model achieved a DSC of 0.7441, sensitivity of 0.7573, precision of 0.7427, ASD of 5.83 mm, and HD95 of 21.27 mm, outperforming state-of-the-art methods (<i>p</i> < 0.05, <i>t</i>-test). For TMTV quantification, Pearson correlation coefficients of 0.91 (private dataset) and 0.86 (autoPET) were observed, with <i>R</i><sup>2</sup> values of 0.89 and 0.75, respectively. Extensive ablation studies demonstrated the MSIF module's contribution to enhanced segmentation accuracy.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>This study presents an effective automatic segmentation method for DLBCL that leverages the complementary strengths of PET and CT imaging. The method demonstrates robust performance on both private and publicly available datasets, ensuring its reliability and generalizability. Our method provides clinicians with more precise tumor delineation, which can improve the accuracy of diagnostic interpretations and assist in treatment planning for DLBCL patients. The code for the proposed method is available at https://github.com/chenzhao2023/lymphoma_seg.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 6","pages":"4371-4389"},"PeriodicalIF":3.2000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17763","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Accurate segmentation of diffuse large B-cell lymphoma (DLBCL) lesions is challenging due to their complex patterns in medical imaging. Traditional methods often struggle to delineate these lesions accurately.

Objective

This study aims to develop a precise segmentation method for DLBCL using 18F-fluorodeoxyglucose (18F-FDG) positron emission tomography (PET) and computed tomography (CT) images.

Methods

We propose a 3D segmentation method based on an encoder-decoder architecture. The encoder incorporates a dual-branch design based on the shifted window transformer to extract features from both PET and CT modalities. To enhance feature integration, we introduce a multi-scale information fusion (MSIF) module that performs multi-scale feature fusion using cross-attention mechanisms with a shifted window framework. A gated neural network within the MSIF module dynamically adjusts feature weights to balance the contributions from each modality. The model is optimized using the dice similarity coefficient (DSC) loss function, minimizing discrepancies between the model prediction and ground truth. Additionally, we computed the total metabolic tumor volume (TMTV) and performed statistical analyses on the results.

Results

The model was trained and validated on a private dataset of 165 DLBCL patients and a publicly available dataset (autoPET) containing 145 PET/CT scans of lymphoma patients. Both datasets were analyzed using five-fold cross-validation. On the private dataset, our model achieved a DSC of 0.7512, sensitivity of 0.7548, precision of 0.7611, an average surface distance (ASD) of 3.61 mm, and a Hausdorff distance at the 95th percentile (HD95) of 15.25 mm. On the autoPET dataset, the model achieved a DSC of 0.7441, sensitivity of 0.7573, precision of 0.7427, ASD of 5.83 mm, and HD95 of 21.27 mm, outperforming state-of-the-art methods (p < 0.05, t-test). For TMTV quantification, Pearson correlation coefficients of 0.91 (private dataset) and 0.86 (autoPET) were observed, with R² values of 0.89 and 0.75, respectively. Extensive ablation studies demonstrated the MSIF module's contribution to enhanced segmentation accuracy.

Conclusion

This study presents an effective automatic segmentation method for DLBCL that leverages the complementary strengths of PET and CT imaging. The method demonstrates robust performance on both private and publicly available datasets, ensuring its reliability and generalizability. Our method provides clinicians with more precise tumor delineation, which can improve the accuracy of diagnostic interpretations and assist in treatment planning for DLBCL patients. The code for the proposed method is available at https://github.com/chenzhao2023/lymphoma_seg.

查看原文本刊更多论文

基于交叉关注的多尺度信息融合PET/CT图像三维淋巴瘤分割。

背景：弥漫性大b细胞淋巴瘤（DLBCL）病变的准确分割是具有挑战性的，因为它们在医学成像中具有复杂的模式。传统的方法往往难以准确地描绘这些病变。目的：利用18f -氟脱氧葡萄糖（18F-FDG）正电子发射断层扫描（PET）和计算机断层扫描（CT）图像建立DLBCL的精确分割方法。方法：提出了一种基于编码器-解码器结构的三维分割方法。编码器结合了基于移位窗口变压器的双支路设计，以提取PET和CT模式的特征。为了增强特征集成，我们引入了一个多尺度信息融合（MSIF）模块，该模块使用跨注意机制和移位窗口框架进行多尺度特征融合。MSIF模块内的门控神经网络动态调整特征权重以平衡每个模态的贡献。使用骰子相似系数（DSC）损失函数对模型进行优化，使模型预测与实际情况之间的差异最小化。此外，我们计算了总代谢肿瘤体积（TMTV），并对结果进行了统计分析。结果：该模型在包含165名DLBCL患者的私人数据集和包含145名淋巴瘤患者PET/CT扫描的公开数据集（autoPET）上进行了训练和验证。两个数据集采用五重交叉验证进行分析。在私有数据集上，模型的DSC为0.7512，灵敏度为0.7548，精度为0.7611，平均表面距离（ASD）为3.61 mm，第95百分位（HD95）的豪斯多夫距离（Hausdorff distance）为15.25 mm。在autoPET数据集上，该模型的DSC为0.7441，灵敏度为0.7573，精度为0.7427，ASD为5.83 mm， HD95为21.27 mm，优于现有方法（p 2值分别为0.89和0.75）。广泛的消融研究证明了MSIF模块对提高分割精度的贡献。结论：本研究提出了一种有效的DLBCL自动分割方法，利用PET和CT成像的互补优势。该方法在私有数据集和公共数据集上都表现出鲁棒性，确保了其可靠性和泛化性。我们的方法为临床医生提供了更精确的肿瘤描绘，这可以提高诊断解释的准确性，并有助于DLBCL患者的治疗计划。所建议的方法的代码可在https://github.com/chenzhao2023/lymphoma_seg上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.