dVPose: Automated Data Collection and Dataset for 6D Pose Estimation of Robotic Surgical Instruments

Nicholas Greene, Wenkai Luo, P. Kazanzides
{"title":"dVPose: Automated Data Collection and Dataset for 6D Pose Estimation of Robotic Surgical Instruments","authors":"Nicholas Greene, Wenkai Luo, P. Kazanzides","doi":"10.1109/ISMR57123.2023.10130238","DOIUrl":null,"url":null,"abstract":"We present dVPose, a realistic multi-modality dataset intended for use in the development and evaluation of real-time single-shot deep-learning based 6D pose estimation algorithms on a head mounted display (HMD). In addition to the dataset, our contribution includes an automated (robotic) data collection platform that integrates an accurate optical tracking system to provide the ground-truth poses. We collected a comprehensive set of data for vision-based 6D pose estimation, including images and poses of the extra-corporeal portions of the instruments and endoscope of a da Vinci surgical robot. The images are collected using the multi-camera rig of the Microsoft HoloLens 2 HMD, mounted on a UR10 robot, and the corresponding poses are collected by optically tracking both the instruments/endoscope and HMD. The intended application is to enable markerless localization of the HMD with respect to the da Vinci robot, considering that the instruments and endoscope are among the few robotic components that are not covered by sterile drapes. Our dataset features synchronized images from the RGB, depth, and grayscale cameras of the HoloLens 2 device. It is unique in that it provides medically focused images, provides images from a HoloLens 2 device where object tracking is a fundamental task, and provides data from multiple visible-light cameras in addition to depth. Furthermore, the automated data collection platform can be easily adapted to collect images and ground-truth poses of other objects.","PeriodicalId":276757,"journal":{"name":"2023 International Symposium on Medical Robotics (ISMR)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Symposium on Medical Robotics (ISMR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISMR57123.2023.10130238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We present dVPose, a realistic multi-modality dataset intended for use in the development and evaluation of real-time single-shot deep-learning based 6D pose estimation algorithms on a head mounted display (HMD). In addition to the dataset, our contribution includes an automated (robotic) data collection platform that integrates an accurate optical tracking system to provide the ground-truth poses. We collected a comprehensive set of data for vision-based 6D pose estimation, including images and poses of the extra-corporeal portions of the instruments and endoscope of a da Vinci surgical robot. The images are collected using the multi-camera rig of the Microsoft HoloLens 2 HMD, mounted on a UR10 robot, and the corresponding poses are collected by optically tracking both the instruments/endoscope and HMD. The intended application is to enable markerless localization of the HMD with respect to the da Vinci robot, considering that the instruments and endoscope are among the few robotic components that are not covered by sterile drapes. Our dataset features synchronized images from the RGB, depth, and grayscale cameras of the HoloLens 2 device. It is unique in that it provides medically focused images, provides images from a HoloLens 2 device where object tracking is a fundamental task, and provides data from multiple visible-light cameras in addition to depth. Furthermore, the automated data collection platform can be easily adapted to collect images and ground-truth poses of other objects.
dVPose:机器人手术器械6D姿态估计的自动数据收集和数据集
我们提出了dVPose,一个现实的多模态数据集,旨在用于开发和评估基于头戴式显示器(HMD)的实时单镜头深度学习的6D姿态估计算法。除了数据集,我们的贡献还包括一个自动化(机器人)数据收集平台,该平台集成了一个精确的光学跟踪系统,以提供地面真实的姿势。我们收集了一套全面的数据,用于基于视觉的6D姿态估计,包括仪器和达芬奇手术机器人内窥镜的外部部分的图像和姿态。使用安装在UR10机器人上的Microsoft HoloLens 2 HMD的多摄像头采集图像,并通过光学跟踪仪器/内窥镜和HMD收集相应的姿势。考虑到仪器和内窥镜是为数不多的没有被无菌窗帘覆盖的机器人部件,预期的应用是使HMD相对于达芬奇机器人的无标记定位成为可能。我们的数据集具有来自HoloLens 2设备的RGB,深度和灰度相机的同步图像。它的独特之处在于,它提供医学聚焦图像,提供来自HoloLens 2设备的图像,其中物体跟踪是一项基本任务,并提供来自多个可见光相机的数据。此外,自动化数据采集平台可以很容易地适应采集其他物体的图像和地真姿态。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信