下一代手术导航：无标记的多视角6DoF手术器械姿态估计

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-05-10 DOI:10.1016/j.media.2025.103613

Jonas Hein , Nicola Cavalcanti , Daniel Suter , Lukas Zingg , Fabio Carrillo , Lilian Calvet , Mazda Farshad , Nassir Navab , Marc Pollefeys , Philipp Fürnstahl

{"title":"下一代手术导航：无标记的多视角6DoF手术器械姿态估计","authors":"Jonas Hein , Nicola Cavalcanti , Daniel Suter , Lukas Zingg , Fabio Carrillo , Lilian Calvet , Mazda Farshad , Nassir Navab , Marc Pollefeys , Philipp Fürnstahl","doi":"10.1016/j.media.2025.103613","DOIUrl":null,"url":null,"abstract":"<div><div>State-of-the-art research of traditional computer vision is increasingly leveraged in the surgical domain. A particular focus in computer-assisted surgery is to replace marker-based tracking systems for instrument localization with pure image-based 6DoF pose estimation using deep-learning methods. However, state-of-the-art single-view pose estimation methods do not yet meet the accuracy required for surgical navigation. In this context, we investigate the benefits of multi-view setups for highly accurate and occlusion-robust 6DoF pose estimation of surgical instruments and derive recommendations for an ideal camera system that addresses the challenges in the operating room. Our contributions are threefold. First, we present a multi-view RGB-D video dataset of ex-vivo spine surgeries, captured with static and head-mounted cameras and including rich annotations for surgeon, instruments, and patient anatomy. Second, we perform an extensive evaluation of three state-of-the-art single-view and multi-view pose estimation methods, analyzing the impact of camera quantities and positioning, limited real-world data, and static, hybrid, or fully mobile camera setups on the pose accuracy, occlusion robustness, and generalizability. Third, we design a multi-camera system for marker-less surgical instrument tracking, achieving an average position error of 1.01mm and orientation error of 0.89° for a surgical drill, and 2.79mm and 3.33° for a screwdriver under optimal conditions. Our results demonstrate that marker-less tracking of surgical instruments is becoming a feasible alternative to existing marker-based systems.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103613"},"PeriodicalIF":11.8000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Next-generation surgical navigation: Marker-less multi-view 6DoF pose estimation of surgical instruments\",\"authors\":\"Jonas Hein , Nicola Cavalcanti , Daniel Suter , Lukas Zingg , Fabio Carrillo , Lilian Calvet , Mazda Farshad , Nassir Navab , Marc Pollefeys , Philipp Fürnstahl\",\"doi\":\"10.1016/j.media.2025.103613\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>State-of-the-art research of traditional computer vision is increasingly leveraged in the surgical domain. A particular focus in computer-assisted surgery is to replace marker-based tracking systems for instrument localization with pure image-based 6DoF pose estimation using deep-learning methods. However, state-of-the-art single-view pose estimation methods do not yet meet the accuracy required for surgical navigation. In this context, we investigate the benefits of multi-view setups for highly accurate and occlusion-robust 6DoF pose estimation of surgical instruments and derive recommendations for an ideal camera system that addresses the challenges in the operating room. Our contributions are threefold. First, we present a multi-view RGB-D video dataset of ex-vivo spine surgeries, captured with static and head-mounted cameras and including rich annotations for surgeon, instruments, and patient anatomy. Second, we perform an extensive evaluation of three state-of-the-art single-view and multi-view pose estimation methods, analyzing the impact of camera quantities and positioning, limited real-world data, and static, hybrid, or fully mobile camera setups on the pose accuracy, occlusion robustness, and generalizability. Third, we design a multi-camera system for marker-less surgical instrument tracking, achieving an average position error of 1.01mm and orientation error of 0.89° for a surgical drill, and 2.79mm and 3.33° for a screwdriver under optimal conditions. Our results demonstrate that marker-less tracking of surgical instruments is becoming a feasible alternative to existing marker-based systems.</div></div>\",\"PeriodicalId\":18328,\"journal\":{\"name\":\"Medical image analysis\",\"volume\":\"103 \",\"pages\":\"Article 103613\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical image analysis\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1361841525001604\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001604","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

传统计算机视觉的最新研究越来越多地应用于外科领域。计算机辅助手术的一个特别重点是用使用深度学习方法的纯基于图像的6DoF姿态估计取代基于标记的仪器定位跟踪系统。然而，最先进的单视图姿态估计方法还不能满足手术导航所需的精度。在这种情况下，我们研究了多视角设置对手术器械的高精度和抗闭塞性6DoF姿态估计的好处，并得出了解决手术室挑战的理想相机系统的建议。我们的贡献是三重的。首先，我们展示了一个多视图RGB-D离体脊柱手术视频数据集，该数据集由静态和头戴式摄像机捕获，并包括对外科医生、器械和患者解剖结构的丰富注释。其次，我们对三种最先进的单视图和多视图姿态估计方法进行了广泛的评估，分析了相机数量和定位、有限的真实世界数据以及静态、混合或完全移动相机设置对姿态精度、遮挡鲁棒性和通用性的影响。第三，我们设计了一个用于无标记手术器械跟踪的多摄像头系统，在最佳条件下，手术钻头的平均位置误差为1.01 mm，方向误差为0.89°，螺丝刀的平均位置误差为2.79 mm，方向误差为3.33°。我们的研究结果表明，手术器械的无标记跟踪正在成为现有的基于标记的系统的可行替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Next-generation surgical navigation: Marker-less multi-view 6DoF pose estimation of surgical instruments

查看原文本刊更多论文

Next-generation surgical navigation: Marker-less multi-view 6DoF pose estimation of surgical instruments

State-of-the-art research of traditional computer vision is increasingly leveraged in the surgical domain. A particular focus in computer-assisted surgery is to replace marker-based tracking systems for instrument localization with pure image-based 6DoF pose estimation using deep-learning methods. However, state-of-the-art single-view pose estimation methods do not yet meet the accuracy required for surgical navigation. In this context, we investigate the benefits of multi-view setups for highly accurate and occlusion-robust 6DoF pose estimation of surgical instruments and derive recommendations for an ideal camera system that addresses the challenges in the operating room. Our contributions are threefold. First, we present a multi-view RGB-D video dataset of ex-vivo spine surgeries, captured with static and head-mounted cameras and including rich annotations for surgeon, instruments, and patient anatomy. Second, we perform an extensive evaluation of three state-of-the-art single-view and multi-view pose estimation methods, analyzing the impact of camera quantities and positioning, limited real-world data, and static, hybrid, or fully mobile camera setups on the pose accuracy, occlusion robustness, and generalizability. Third, we design a multi-camera system for marker-less surgical instrument tracking, achieving an average position error of 1.01 mm and orientation error of 0.89° for a surgical drill, and 2.79 mm and 3.33° for a screwdriver under optimal conditions. Our results demonstrate that marker-less tracking of surgical instruments is becoming a feasible alternative to existing marker-based systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.