A Nested U-Structure for Instrument Segmentation in Robotic Surgery

2023 International Conference on Advanced Robotics and Mechatronics (ICARM) Pub Date : 2023-07-08 DOI:10.1109/ICARM58088.2023.10218893

Yanjie Xia, Shaochen Wang, Zheng Kan

{"title":"A Nested U-Structure for Instrument Segmentation in Robotic Surgery","authors":"Yanjie Xia, Shaochen Wang, Zheng Kan","doi":"10.1109/ICARM58088.2023.10218893","DOIUrl":null,"url":null,"abstract":"Robot-assisted surgery has made great progress with the development of medical imaging and robotics technology. Medical scene understanding can greatly improve surgical performance while the semantic segmentation of the robotic instrument is a key enabling technology for robot-assisted surgery. However, how to locate an instrument's position and estimate their pose in complex surgical environments is still a challenging fundamental problem. In this paper, pixel-wise instrument segmentation is investigated. The contributions of the paper are twofold: 1) We proposed a two-level nested U-structure model, which is an encoder-decoder architecture with skip-connections and each layer of the network structure adopts a U-structure instead of a simple superposition of convolutional layers. The model can capture more context information from multiple scales and better fuse the local and global information to achieve high-quality segmentation. 2) Experiments have been conducted to qualitatively and quantitatively show the performance of our approach on three segmentation tasks: the binary segmentation, the parts segmentation, and the type segmentation, respectively. The results show that our method significantly improves the segmentation performance and outperforms state-of-the-art approaches.","PeriodicalId":220013,"journal":{"name":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARM58088.2023.10218893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Robot-assisted surgery has made great progress with the development of medical imaging and robotics technology. Medical scene understanding can greatly improve surgical performance while the semantic segmentation of the robotic instrument is a key enabling technology for robot-assisted surgery. However, how to locate an instrument's position and estimate their pose in complex surgical environments is still a challenging fundamental problem. In this paper, pixel-wise instrument segmentation is investigated. The contributions of the paper are twofold: 1) We proposed a two-level nested U-structure model, which is an encoder-decoder architecture with skip-connections and each layer of the network structure adopts a U-structure instead of a simple superposition of convolutional layers. The model can capture more context information from multiple scales and better fuse the local and global information to achieve high-quality segmentation. 2) Experiments have been conducted to qualitatively and quantitatively show the performance of our approach on three segmentation tasks: the binary segmentation, the parts segmentation, and the type segmentation, respectively. The results show that our method significantly improves the segmentation performance and outperforms state-of-the-art approaches.

查看原文本刊更多论文

基于嵌套u型结构的机器人手术器械分割

随着医学影像学和机器人技术的发展，机器人辅助手术取得了很大的进步。医疗场景理解可以极大地提高手术效果，而机器人器械的语义分割是机器人辅助手术的关键使能技术。然而，如何在复杂的手术环境中定位器械的位置并估计其姿态仍然是一个具有挑战性的基本问题。本文研究了基于像素的仪器分割方法。本文的贡献有两个方面:1)提出了一种两层嵌套的u结构模型，该模型是一种具有跳过连接的编码器-解码器结构，网络结构的每一层采用u结构，而不是简单的卷积层叠加。该模型可以从多个尺度捕获更多的上下文信息，更好地融合局部和全局信息，实现高质量的分割。2)分别在二值分割、部分分割和类型分割三种分割任务上进行了定性和定量的实验。结果表明，我们的方法显著提高了分割性能，优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Advanced Robotics and Mechatronics (ICARM)

自引率

0.00%

发文量