Semi-supervised temporal attention network for lung 4D CT ventilation estimation

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics Pub Date : 2025-04-10 DOI:10.1016/j.compmedimag.2025.102551

Peng Xue , Jingyang Zhang , Lei Ma , Yixuan Li , Huizhong Ji , Tonglong Ren , Zhanming Hu , Meirong Ren , Zhili Zhang , Enqing Dong

{"title":"Semi-supervised temporal attention network for lung 4D CT ventilation estimation","authors":"Peng Xue , Jingyang Zhang , Lei Ma , Yixuan Li , Huizhong Ji , Tonglong Ren , Zhanming Hu , Meirong Ren , Zhili Zhang , Enqing Dong","doi":"10.1016/j.compmedimag.2025.102551","DOIUrl":null,"url":null,"abstract":"<div><div>Computed tomography (CT)-derived ventilation estimation, also known as CT ventilation imaging (CTVI), is emerging as a potentially crucial tool for designing functional avoidance radiotherapy treatment plans and evaluating therapy responses. However, most conventional CTVI methods are highly dependent on deformation fields from image registration to track volume variations, making them susceptible to registration errors and resulting in low estimation accuracy. In addition, existing deep learning-based CTVI methods typically have the issue of requiring a large amount of labeled data and cannot fully utilize temporal characteristics of 4D CT images. To address these issues, we propose a semi-supervised temporal attention (S<sup>2</sup>TA) network for lung 4D CT ventilation estimation. Specifically, the semi-supervised learning framework involves a teacher model for generating pseudo-labels from unlabeled 4D CT images, to train a student model that takes both labeled and unlabeled 4D CT images as input. The teacher model is updated as the moving average of the instantly trained student, to prevent it from being abruptly impacted by incorrect pseudo-labels. Furthermore, to fully exploit the temporal information of 4D CT images, a temporal attention architecture is designed to effectively capture the temporal relationships across multiple phases in 4D CT image sequence. Extensive experiments on three publicly available thoracic 4D CT datasets show that our proposed method can achieve higher estimation accuracy than state-of-the-art methods, which could potentially be used for lung functional avoidance radiotherapy and treatment response modeling.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"123 ","pages":"Article 102551"},"PeriodicalIF":5.4000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125000606","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Computed tomography (CT)-derived ventilation estimation, also known as CT ventilation imaging (CTVI), is emerging as a potentially crucial tool for designing functional avoidance radiotherapy treatment plans and evaluating therapy responses. However, most conventional CTVI methods are highly dependent on deformation fields from image registration to track volume variations, making them susceptible to registration errors and resulting in low estimation accuracy. In addition, existing deep learning-based CTVI methods typically have the issue of requiring a large amount of labeled data and cannot fully utilize temporal characteristics of 4D CT images. To address these issues, we propose a semi-supervised temporal attention (S²TA) network for lung 4D CT ventilation estimation. Specifically, the semi-supervised learning framework involves a teacher model for generating pseudo-labels from unlabeled 4D CT images, to train a student model that takes both labeled and unlabeled 4D CT images as input. The teacher model is updated as the moving average of the instantly trained student, to prevent it from being abruptly impacted by incorrect pseudo-labels. Furthermore, to fully exploit the temporal information of 4D CT images, a temporal attention architecture is designed to effectively capture the temporal relationships across multiple phases in 4D CT image sequence. Extensive experiments on three publicly available thoracic 4D CT datasets show that our proposed method can achieve higher estimation accuracy than state-of-the-art methods, which could potentially be used for lung functional avoidance radiotherapy and treatment response modeling.

查看原文本刊更多论文

肺4D CT通气估计的半监督时间注意网络

计算机断层扫描（CT）衍生的通气估计，也称为CT通气成像（CTVI），正在成为设计功能回避放疗治疗计划和评估治疗反应的潜在关键工具。然而，大多数传统的CTVI方法从图像配准到跟踪体积变化都高度依赖于变形场，容易出现配准误差，导致估计精度较低。此外，现有的基于深度学习的CTVI方法通常存在需要大量标记数据的问题，不能充分利用4D CT图像的时间特征。为了解决这些问题，我们提出了一种用于肺4D CT通气估计的半监督时间注意（S2TA）网络。具体来说，半监督学习框架包括一个教师模型，用于从未标记的4D CT图像中生成伪标签，以训练一个学生模型，该模型将标记和未标记的4D CT图像作为输入。教师模型被更新为瞬时训练的学生的移动平均，以防止它被不正确的伪标签突然影响。此外，为了充分利用4D CT图像的时间信息，设计了一种时间关注架构，有效捕捉4D CT图像序列中多个阶段的时间关系。在三个公开可用的胸部4D CT数据集上进行的大量实验表明，我们提出的方法比目前最先进的方法可以达到更高的估计精度，这可能用于肺功能回避放疗和治疗反应建模。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computerized Medical Imaging and Graphics 医学-核医学

CiteScore

10.70

自引率

3.50%

发文量

审稿时长

26 days

期刊介绍： The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.