Global region reidentification for camera relocalization in video-based surgical navigation.

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery Pub Date : 2026-05-08 DOI:10.1007/s11548-026-03650-y

Roger D Soberanis-Mukul, Ryan Chou, Chin Hang Ryan Chan, Jan Emily Mangulabnan, Lalithkumar Seenivasan, Simon Bonaventura Ertlmaier, S Swaroop Vedula, Russell H Taylor, Masaru Ishii, Gregory Hager, Mathias Unberath

{"title":"Global region reidentification for camera relocalization in video-based surgical navigation.","authors":"Roger D Soberanis-Mukul, Ryan Chou, Chin Hang Ryan Chan, Jan Emily Mangulabnan, Lalithkumar Seenivasan, Simon Bonaventura Ertlmaier, S Swaroop Vedula, Russell H Taylor, Masaru Ishii, Gregory Hager, Mathias Unberath","doi":"10.1007/s11548-026-03650-y","DOIUrl":null,"url":null,"abstract":"Purpose: Vision-based navigation systems rely on the registered camera poses in the CT space to guide surgeons. However, while it is possible to provide an approximate initialization, this registration becomes outdated as the endoscopic camera leaves and reenters the anatomy. Endoscopic camera relocalization is the process of determining the position of an endoscope relative to an anatomical reference after reinsertion. However, accurately reidentifying the global surgical scene and estimating camera pose have proven challenging due to the varying appearance of endoscopic sequences.Methods: We present a training-free approach to accurately reidentify the region of interest (ROI) and estimate the camera position of a query image after reinsertion. This method utilizes previously observed images with known poses and a CT scan. By combining advanced foundation models with classical techniques, we globally reidentify a prior image of the ROI, which is then used for image-based feature matching and pose recovery via the Perspective-n-Point algorithm.Results: We conducted experiments on eight sequences from three cadaver studies. Our results show that our method accurately reidentifies when the endoscope reaches the ROI and identifies suitable image pairs for PnP-based pose estimation. It achieves an average translation error of 1.74 mm and a rotational error of 0.09 radians, making it suitable for reinitialization in image-based navigation without human intervention.Conclusion: Our work presents a training-free approach for detecting when the endoscope reenters the ROI and estimating the camera's pose after reinsertions. The approach demonstrates promising results contributing toward enabling pose reinitialization for vision-based surgical applications.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2026-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-026-03650-y","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Vision-based navigation systems rely on the registered camera poses in the CT space to guide surgeons. However, while it is possible to provide an approximate initialization, this registration becomes outdated as the endoscopic camera leaves and reenters the anatomy. Endoscopic camera relocalization is the process of determining the position of an endoscope relative to an anatomical reference after reinsertion. However, accurately reidentifying the global surgical scene and estimating camera pose have proven challenging due to the varying appearance of endoscopic sequences.

Methods: We present a training-free approach to accurately reidentify the region of interest (ROI) and estimate the camera position of a query image after reinsertion. This method utilizes previously observed images with known poses and a CT scan. By combining advanced foundation models with classical techniques, we globally reidentify a prior image of the ROI, which is then used for image-based feature matching and pose recovery via the Perspective-n-Point algorithm.

Results: We conducted experiments on eight sequences from three cadaver studies. Our results show that our method accurately reidentifies when the endoscope reaches the ROI and identifies suitable image pairs for PnP-based pose estimation. It achieves an average translation error of 1.74 mm and a rotational error of 0.09 radians, making it suitable for reinitialization in image-based navigation without human intervention.

Conclusion: Our work presents a training-free approach for detecting when the endoscope reenters the ROI and estimating the camera's pose after reinsertions. The approach demonstrates promising results contributing toward enabling pose reinitialization for vision-based surgical applications.

查看原文本刊更多论文

基于视频的手术导航中摄像机定位的全局区域再识别。

目的：基于视觉的导航系统依靠在CT空间中注册的摄像机姿态来指导外科医生。然而，虽然可以提供一个近似的初始化，但随着内窥镜相机离开并重新进入解剖结构，这种注册变得过时了。内窥镜相机重新定位是确定内窥镜在重新插入后相对于解剖学参考的位置的过程。然而，由于内窥镜序列的不同外观，准确地重新识别整个手术场景和估计相机姿势已被证明具有挑战性。方法：我们提出了一种无需训练的方法来准确地重新识别感兴趣区域（ROI）并估计重新插入后查询图像的相机位置。该方法利用先前观察到的已知姿势的图像和CT扫描。通过将先进的基础模型与经典技术相结合，我们在全局上重新识别ROI的先验图像，然后通过Perspective-n-Point算法将其用于基于图像的特征匹配和姿态恢复。结果：我们对3具尸体的8个序列进行了实验。结果表明，当内窥镜到达感兴趣区域时，我们的方法可以准确地重新识别，并为基于pnp的姿态估计识别合适的图像对。平均平移误差为1.74 mm，旋转误差为0.09弧度，适用于基于图像的导航重新初始化，无需人工干预。结论：我们的工作提供了一种无需训练的方法来检测内窥镜何时重新进入ROI并估计重新插入后相机的姿态。该方法显示了有希望的结果，有助于实现基于视觉的外科应用的姿势重新初始化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Computer Assisted Radiology and Surgery ENGINEERING, BIOMEDICAL-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

CiteScore

5.90

自引率

6.70%

发文量

243

审稿时长

6-12 weeks

期刊介绍： The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.