Guiping Qian , Huaqiong Wang , Shan Luo , Yiming Sun , Dingguo Yu , Xiaodiao Chen , Fan Zhang
{"title":"Encoder-shared visual state space network for anterior segment reconstruction","authors":"Guiping Qian , Huaqiong Wang , Shan Luo , Yiming Sun , Dingguo Yu , Xiaodiao Chen , Fan Zhang","doi":"10.1016/j.compmedimag.2025.102631","DOIUrl":null,"url":null,"abstract":"<div><div>The 3D (three-dimensional) reconstruction of the anterior segment obtained from AS-OCT scanning devices is essential for diagnosing and monitoring cornea and iris, as well as for localizing and quantifying keratitis. However, this process faces two significant challenges: (1) The consecutive images acquired through rotational scanning are difficult to align and register; (2) The existing medical image segmentation technology cannot effectively segment the cornea, which are critical preprocessing steps for effective 3D visualization of the anterior segment. To tackle these dual challenges, an encoder-shared visual state space network for the 3D reconstruction of the anterior segment is proposed. This network integrates image alignment and segmentation into a unified framework. It employs the same encoder to handle spatial images for both alignment and segmentation tasks. A visual state space projection method is utilized to compute the homography matrix of adjacent images, thereby facilitating their alignment. Furthermore, we introduce a channel-wise visual state space fusion technique in conjunction with a decoder block that captures complex contextual interdependencies, enhances shape-preserving feature representation, and improves segmentation accuracy. Based on the resulting corneal segmentation outcomes, we accurately reconstruct 3D volume data from the aligned images. Experimental results on the AIDK-Align and CORNEA datasets demonstrate that our proposed method exhibits remarkable performance in terms of anterior segment alignment, corneal segmentation and 3D reconstruction. Furthermore, we compared encoder-shared visual state space network with state-of-the-art medical image segmentation methods and image alignment algorithms, highlighting its advantages in both alignment and segmentation precision. Our code will be made available at <span><span>https://github.com/qianguiping/Es-VSS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"124 ","pages":"Article 102631"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125001405","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The 3D (three-dimensional) reconstruction of the anterior segment obtained from AS-OCT scanning devices is essential for diagnosing and monitoring cornea and iris, as well as for localizing and quantifying keratitis. However, this process faces two significant challenges: (1) The consecutive images acquired through rotational scanning are difficult to align and register; (2) The existing medical image segmentation technology cannot effectively segment the cornea, which are critical preprocessing steps for effective 3D visualization of the anterior segment. To tackle these dual challenges, an encoder-shared visual state space network for the 3D reconstruction of the anterior segment is proposed. This network integrates image alignment and segmentation into a unified framework. It employs the same encoder to handle spatial images for both alignment and segmentation tasks. A visual state space projection method is utilized to compute the homography matrix of adjacent images, thereby facilitating their alignment. Furthermore, we introduce a channel-wise visual state space fusion technique in conjunction with a decoder block that captures complex contextual interdependencies, enhances shape-preserving feature representation, and improves segmentation accuracy. Based on the resulting corneal segmentation outcomes, we accurately reconstruct 3D volume data from the aligned images. Experimental results on the AIDK-Align and CORNEA datasets demonstrate that our proposed method exhibits remarkable performance in terms of anterior segment alignment, corneal segmentation and 3D reconstruction. Furthermore, we compared encoder-shared visual state space network with state-of-the-art medical image segmentation methods and image alignment algorithms, highlighting its advantages in both alignment and segmentation precision. Our code will be made available at https://github.com/qianguiping/Es-VSS.
as - oct扫描设备获得的角膜前段3D(三维)重建对于角膜和虹膜的诊断和监测以及角膜炎的定位和量化至关重要。然而,这一过程面临两个重大挑战:(1)通过旋转扫描获得的连续图像难以对齐和配准;(2)现有医学图像分割技术无法对角膜进行有效分割,而这是有效实现角膜前段三维可视化的关键预处理步骤。为了解决这两个问题,提出了一种编码器共享的视觉状态空间网络,用于前段的三维重建。该网络将图像对齐和分割集成到一个统一的框架中。它采用相同的编码器来处理空间图像的对齐和分割任务。利用视觉状态空间投影法计算相邻图像的单应性矩阵,从而便于相邻图像的对齐。此外,我们引入了一种通道视觉状态空间融合技术,该技术与解码器块相结合,可以捕获复杂的上下文相互依赖性,增强形状保持特征表示,并提高分割精度。基于所得的角膜分割结果,我们从对齐的图像中准确地重建三维体数据。在AIDK-Align和CORNEA数据集上的实验结果表明,我们提出的方法在角膜前段对齐、角膜分割和三维重建方面具有显著的性能。此外,我们将编码器共享的视觉状态空间网络与最先进的医学图像分割方法和图像对齐算法进行了比较,突出了其在对齐和分割精度方面的优势。我们的代码将在https://github.com/qianguiping/Es-VSS上提供。
期刊介绍:
The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.