Jingwei Song, Ray Zhang, Wenwei Zhang, Hao Zhou, Maani Ghaffari
{"title":"SLAM assisted 3D tracking system for laparoscopic surgery","authors":"Jingwei Song, Ray Zhang, Wenwei Zhang, Hao Zhou, Maani Ghaffari","doi":"arxiv-2409.11688","DOIUrl":null,"url":null,"abstract":"A major limitation of minimally invasive surgery is the difficulty in\naccurately locating the internal anatomical structures of the target organ due\nto the lack of tactile feedback and transparency. Augmented reality (AR) offers\na promising solution to overcome this challenge. Numerous studies have shown\nthat combining learning-based and geometric methods can achieve accurate\npreoperative and intraoperative data registration. This work proposes a\nreal-time monocular 3D tracking algorithm for post-registration tasks. The\nORB-SLAM2 framework is adopted and modified for prior-based 3D tracking. The\nprimitive 3D shape is used for fast initialization of the monocular SLAM. A\npseudo-segmentation strategy is employed to separate the target organ from the\nbackground for tracking purposes, and the geometric prior of the 3D shape is\nincorporated as an additional constraint in the pose graph. Experiments from\nin-vivo and ex-vivo tests demonstrate that the proposed 3D tracking system\nprovides robust 3D tracking and effectively handles typical challenges such as\nfast motion, out-of-field-of-view scenarios, partial visibility, and\n\"organ-background\" relative motion.","PeriodicalId":501031,"journal":{"name":"arXiv - CS - Robotics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11688","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A major limitation of minimally invasive surgery is the difficulty in
accurately locating the internal anatomical structures of the target organ due
to the lack of tactile feedback and transparency. Augmented reality (AR) offers
a promising solution to overcome this challenge. Numerous studies have shown
that combining learning-based and geometric methods can achieve accurate
preoperative and intraoperative data registration. This work proposes a
real-time monocular 3D tracking algorithm for post-registration tasks. The
ORB-SLAM2 framework is adopted and modified for prior-based 3D tracking. The
primitive 3D shape is used for fast initialization of the monocular SLAM. A
pseudo-segmentation strategy is employed to separate the target organ from the
background for tracking purposes, and the geometric prior of the 3D shape is
incorporated as an additional constraint in the pose graph. Experiments from
in-vivo and ex-vivo tests demonstrate that the proposed 3D tracking system
provides robust 3D tracking and effectively handles typical challenges such as
fast motion, out-of-field-of-view scenarios, partial visibility, and
"organ-background" relative motion.