Peng Xue , Jingyang Zhang , Lei Ma , Yixuan Li , Huizhong Ji , Tonglong Ren , Zhanming Hu , Meirong Ren , Zhili Zhang , Enqing Dong
{"title":"Semi-supervised temporal attention network for lung 4D CT ventilation estimation","authors":"Peng Xue , Jingyang Zhang , Lei Ma , Yixuan Li , Huizhong Ji , Tonglong Ren , Zhanming Hu , Meirong Ren , Zhili Zhang , Enqing Dong","doi":"10.1016/j.compmedimag.2025.102551","DOIUrl":null,"url":null,"abstract":"<div><div>Computed tomography (CT)-derived ventilation estimation, also known as CT ventilation imaging (CTVI), is emerging as a potentially crucial tool for designing functional avoidance radiotherapy treatment plans and evaluating therapy responses. However, most conventional CTVI methods are highly dependent on deformation fields from image registration to track volume variations, making them susceptible to registration errors and resulting in low estimation accuracy. In addition, existing deep learning-based CTVI methods typically have the issue of requiring a large amount of labeled data and cannot fully utilize temporal characteristics of 4D CT images. To address these issues, we propose a semi-supervised temporal attention (S<sup>2</sup>TA) network for lung 4D CT ventilation estimation. Specifically, the semi-supervised learning framework involves a teacher model for generating pseudo-labels from unlabeled 4D CT images, to train a student model that takes both labeled and unlabeled 4D CT images as input. The teacher model is updated as the moving average of the instantly trained student, to prevent it from being abruptly impacted by incorrect pseudo-labels. Furthermore, to fully exploit the temporal information of 4D CT images, a temporal attention architecture is designed to effectively capture the temporal relationships across multiple phases in 4D CT image sequence. Extensive experiments on three publicly available thoracic 4D CT datasets show that our proposed method can achieve higher estimation accuracy than state-of-the-art methods, which could potentially be used for lung functional avoidance radiotherapy and treatment response modeling.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"123 ","pages":"Article 102551"},"PeriodicalIF":5.4000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125000606","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Computed tomography (CT)-derived ventilation estimation, also known as CT ventilation imaging (CTVI), is emerging as a potentially crucial tool for designing functional avoidance radiotherapy treatment plans and evaluating therapy responses. However, most conventional CTVI methods are highly dependent on deformation fields from image registration to track volume variations, making them susceptible to registration errors and resulting in low estimation accuracy. In addition, existing deep learning-based CTVI methods typically have the issue of requiring a large amount of labeled data and cannot fully utilize temporal characteristics of 4D CT images. To address these issues, we propose a semi-supervised temporal attention (S2TA) network for lung 4D CT ventilation estimation. Specifically, the semi-supervised learning framework involves a teacher model for generating pseudo-labels from unlabeled 4D CT images, to train a student model that takes both labeled and unlabeled 4D CT images as input. The teacher model is updated as the moving average of the instantly trained student, to prevent it from being abruptly impacted by incorrect pseudo-labels. Furthermore, to fully exploit the temporal information of 4D CT images, a temporal attention architecture is designed to effectively capture the temporal relationships across multiple phases in 4D CT image sequence. Extensive experiments on three publicly available thoracic 4D CT datasets show that our proposed method can achieve higher estimation accuracy than state-of-the-art methods, which could potentially be used for lung functional avoidance radiotherapy and treatment response modeling.
期刊介绍:
The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.