Next-generation surgical navigation: Marker-less multi-view 6DoF pose estimation of surgical instruments

IF 10.7 1区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jonas Hein , Nicola Cavalcanti , Daniel Suter , Lukas Zingg , Fabio Carrillo , Lilian Calvet , Mazda Farshad , Nassir Navab , Marc Pollefeys , Philipp Fürnstahl
{"title":"Next-generation surgical navigation: Marker-less multi-view 6DoF pose estimation of surgical instruments","authors":"Jonas Hein ,&nbsp;Nicola Cavalcanti ,&nbsp;Daniel Suter ,&nbsp;Lukas Zingg ,&nbsp;Fabio Carrillo ,&nbsp;Lilian Calvet ,&nbsp;Mazda Farshad ,&nbsp;Nassir Navab ,&nbsp;Marc Pollefeys ,&nbsp;Philipp Fürnstahl","doi":"10.1016/j.media.2025.103613","DOIUrl":null,"url":null,"abstract":"<div><div>State-of-the-art research of traditional computer vision is increasingly leveraged in the surgical domain. A particular focus in computer-assisted surgery is to replace marker-based tracking systems for instrument localization with pure image-based 6DoF pose estimation using deep-learning methods. However, state-of-the-art single-view pose estimation methods do not yet meet the accuracy required for surgical navigation. In this context, we investigate the benefits of multi-view setups for highly accurate and occlusion-robust 6DoF pose estimation of surgical instruments and derive recommendations for an ideal camera system that addresses the challenges in the operating room. Our contributions are threefold. First, we present a multi-view RGB-D video dataset of ex-vivo spine surgeries, captured with static and head-mounted cameras and including rich annotations for surgeon, instruments, and patient anatomy. Second, we perform an extensive evaluation of three state-of-the-art single-view and multi-view pose estimation methods, analyzing the impact of camera quantities and positioning, limited real-world data, and static, hybrid, or fully mobile camera setups on the pose accuracy, occlusion robustness, and generalizability. Third, we design a multi-camera system for marker-less surgical instrument tracking, achieving an average position error of 1.01<!--> <!-->mm and orientation error of 0.89° for a surgical drill, and 2.79<!--> <!-->mm and 3.33° for a screwdriver under optimal conditions. Our results demonstrate that marker-less tracking of surgical instruments is becoming a feasible alternative to existing marker-based systems.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103613"},"PeriodicalIF":10.7000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001604","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

State-of-the-art research of traditional computer vision is increasingly leveraged in the surgical domain. A particular focus in computer-assisted surgery is to replace marker-based tracking systems for instrument localization with pure image-based 6DoF pose estimation using deep-learning methods. However, state-of-the-art single-view pose estimation methods do not yet meet the accuracy required for surgical navigation. In this context, we investigate the benefits of multi-view setups for highly accurate and occlusion-robust 6DoF pose estimation of surgical instruments and derive recommendations for an ideal camera system that addresses the challenges in the operating room. Our contributions are threefold. First, we present a multi-view RGB-D video dataset of ex-vivo spine surgeries, captured with static and head-mounted cameras and including rich annotations for surgeon, instruments, and patient anatomy. Second, we perform an extensive evaluation of three state-of-the-art single-view and multi-view pose estimation methods, analyzing the impact of camera quantities and positioning, limited real-world data, and static, hybrid, or fully mobile camera setups on the pose accuracy, occlusion robustness, and generalizability. Third, we design a multi-camera system for marker-less surgical instrument tracking, achieving an average position error of 1.01 mm and orientation error of 0.89° for a surgical drill, and 2.79 mm and 3.33° for a screwdriver under optimal conditions. Our results demonstrate that marker-less tracking of surgical instruments is becoming a feasible alternative to existing marker-based systems.
下一代手术导航:无标记的多视角6DoF手术器械姿态估计
传统计算机视觉的最新研究越来越多地应用于外科领域。计算机辅助手术的一个特别重点是用使用深度学习方法的纯基于图像的6DoF姿态估计取代基于标记的仪器定位跟踪系统。然而,最先进的单视图姿态估计方法还不能满足手术导航所需的精度。在这种情况下,我们研究了多视角设置对手术器械的高精度和抗闭塞性6DoF姿态估计的好处,并得出了解决手术室挑战的理想相机系统的建议。我们的贡献是三重的。首先,我们展示了一个多视图RGB-D离体脊柱手术视频数据集,该数据集由静态和头戴式摄像机捕获,并包括对外科医生、器械和患者解剖结构的丰富注释。其次,我们对三种最先进的单视图和多视图姿态估计方法进行了广泛的评估,分析了相机数量和定位、有限的真实世界数据以及静态、混合或完全移动相机设置对姿态精度、遮挡鲁棒性和通用性的影响。第三,我们设计了一个用于无标记手术器械跟踪的多摄像头系统,在最佳条件下,手术钻头的平均位置误差为1.01 mm,方向误差为0.89°,螺丝刀的平均位置误差为2.79 mm,方向误差为3.33°。我们的研究结果表明,手术器械的无标记跟踪正在成为现有的基于标记的系统的可行替代方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medical image analysis
Medical image analysis 工程技术-工程:生物医学
CiteScore
22.10
自引率
6.40%
发文量
309
审稿时长
6.6 months
期刊介绍: Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信