Monocular suture needle pose detection using synthetic data augmented convolutional neural network.

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery Pub Date : 2025-10-01 Epub Date: 2025-06-24 DOI:10.1007/s11548-025-03467-1

Yifan Wang, Saul Alexis Heredia Perez, Kanako Harada

{"title":"Monocular suture needle pose detection using synthetic data augmented convolutional neural network.","authors":"Yifan Wang, Saul Alexis Heredia Perez, Kanako Harada","doi":"10.1007/s11548-025-03467-1","DOIUrl":null,"url":null,"abstract":"Purpose: Robotic microsurgery enhances the dexterity and stability of the surgeon to perform precise and delicate surgical procedures at microscopic level. Accurate needle pose estimation is critical for robotic micro-suturing, enabling optimized insertion trajectories and facilitating autonomous control. However, accurately estimating the pose of a needle during manipulation, particularly under monocular vision, remains a challenge. This study proposes a convolutional neural network-based method to estimate the pose of a suture needle from monocular images.Methods: The 3D pose of the needle is estimated using keypoints information from 2D images. A convolutional neural network was trained to estimate the positions of keypoints on the needle, specifically the tip, middle and end point. A hybrid dataset comprising images from both real-world and synthetic simulated environments was developed to train the model. Subsequently, an algorithm was designed to estimate the 3D positions of these keypoints. The 2D keypoint detection and 3D orientation estimation were evaluated by translation and orientation error metrics, respectively.Results: Experiments conducted on synthetic data showed that the average translation error of tip point, middle point and end point being 0.107 mm, 0.118 mm and 0.098 mm, and the average orientation angular error was 12.75 <math><mmultiscripts><mrow></mrow> <mrow></mrow> <mo>∘</mo></mmultiscripts> </math> for normal vector and 15.55 <math><mmultiscripts><mrow></mrow> <mrow></mrow> <mo>∘</mo></mmultiscripts> </math> for direction vector. When evaluated on real data, the method demonstrated 2D translation errors averaging 0.047 mm, 0.052 mm and 0.049 mm for the respective keypoints, with 93.85% of detected keypoints having errors below 4 pixels.Conclusions: This study presents a CNN-based method, augmented with synthetic images, to estimate the pose of a suture needle in monocular vision. Experimental results indicate that the method effectively estimates the 2D positions and 3D orientations of the suture needle in synthetic images. The model also shows reasonable performance with real data, highlighting its promise for real-time application in robotic microsurgery.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2019-2030"},"PeriodicalIF":2.3000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12518471/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-025-03467-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Robotic microsurgery enhances the dexterity and stability of the surgeon to perform precise and delicate surgical procedures at microscopic level. Accurate needle pose estimation is critical for robotic micro-suturing, enabling optimized insertion trajectories and facilitating autonomous control. However, accurately estimating the pose of a needle during manipulation, particularly under monocular vision, remains a challenge. This study proposes a convolutional neural network-based method to estimate the pose of a suture needle from monocular images.

Methods: The 3D pose of the needle is estimated using keypoints information from 2D images. A convolutional neural network was trained to estimate the positions of keypoints on the needle, specifically the tip, middle and end point. A hybrid dataset comprising images from both real-world and synthetic simulated environments was developed to train the model. Subsequently, an algorithm was designed to estimate the 3D positions of these keypoints. The 2D keypoint detection and 3D orientation estimation were evaluated by translation and orientation error metrics, respectively.

Results: Experiments conducted on synthetic data showed that the average translation error of tip point, middle point and end point being 0.107 mm, 0.118 mm and 0.098 mm, and the average orientation angular error was 12.75 $^{\circ}$ for normal vector and 15.55 $^{\circ}$ for direction vector. When evaluated on real data, the method demonstrated 2D translation errors averaging 0.047 mm, 0.052 mm and 0.049 mm for the respective keypoints, with 93.85% of detected keypoints having errors below 4 pixels.

Conclusions: This study presents a CNN-based method, augmented with synthetic images, to estimate the pose of a suture needle in monocular vision. Experimental results indicate that the method effectively estimates the 2D positions and 3D orientations of the suture needle in synthetic images. The model also shows reasonable performance with real data, highlighting its promise for real-time application in robotic microsurgery.

查看原文本刊更多论文

基于合成数据增强卷积神经网络的单眼缝合线针位姿检测。

目的：机器人显微外科手术提高了外科医生在显微水平上进行精确和精细手术的灵活性和稳定性。准确的针姿估计对于机器人微缝合至关重要，它可以优化插入轨迹，促进自主控制。然而，在操作过程中准确估计针头的姿势，特别是在单目视力下，仍然是一个挑战。本研究提出一种基于卷积神经网络的单眼图像缝合针位姿估计方法。方法：利用二维图像中的关键点信息估计针的三维姿态。训练卷积神经网络来估计针上关键点的位置，特别是针尖、中点和终点。开发了一个混合数据集，包括来自真实世界和合成模拟环境的图像来训练模型。随后，设计了一种算法来估计这些关键点的三维位置。利用平移误差和方向误差分别对二维关键点检测和三维方向估计进行了评价。结果：在合成数据上进行的实验表明，尖端、中间和终点的平均平移误差分别为0.107 mm、0.118 mm和0.098 mm，法向量和方向向量的平均方位角误差分别为12.75和15.55°。在实际数据上，该方法的二维平移误差平均为0.047 mm、0.052 mm和0.049 mm，其中93.85%的检测到的关键点误差小于4像素。结论：本研究提出了一种基于cnn的方法，并辅以合成图像，来估计单眼视觉下缝合针的姿势。实验结果表明，该方法可以有效地估计合成图像中缝合针的二维位置和三维方向。该模型在实际数据中也显示出合理的性能，突出了其在机器人显微手术中的实时应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Computer Assisted Radiology and Surgery ENGINEERING, BIOMEDICAL-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

CiteScore

5.90

自引率

6.70%

发文量

243

审稿时长

6-12 weeks

期刊介绍： The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.