Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks

IF 2.8 Q3 ENGINEERING, BIOMEDICAL

Healthcare Technology Letters Pub Date : 2024-02-17 DOI:10.1049/htl2.12078

Ummey Tanin, Adrienne Duimering, Christine Law, Jessica Ruzicki, Gabriela Luna, Matthew Holden

{"title":"Performance evaluation in cataract surgery with an ensemble of 2D–3D convolutional neural networks","authors":"Ummey Tanin, Adrienne Duimering, Christine Law, Jessica Ruzicki, Gabriela Luna, Matthew Holden","doi":"10.1049/htl2.12078","DOIUrl":null,"url":null,"abstract":"<p>An important part of surgical training in ophthalmology is understanding how to proficiently perform cataract surgery. Operating skill in cataract surgery is typically assessed by real-time or video-based expert review using a rating scale. This is time-consuming, subjective and labour-intensive. A typical trainee graduates with over 100 complete surgeries, each of which requires review by the surgical educators. Due to the consistently repetitive nature of this task, it lends itself well to machine learning-based evaluation. Recent studies utilize deep learning models trained on tool motion trajectories obtained using additional equipment or robotic systems. However, the process of tool recognition by extracting frames from the videos to perform phase recognition followed by skill assessment is exhaustive. This project proposes a deep learning model for skill evaluation using raw surgery videos that is cost-effective and end-to-end trainable. An advanced ensemble of convolutional neural network models is leveraged to model technical skills in cataract surgeries and is evaluated using a large dataset comprising almost 200 surgical trials. The highest accuracy of 0.8494 is observed on the phacoemulsification step data. Our model yielded an average accuracy of 0.8200 and an average AUC score of 0.8800 for all four phase datasets of cataract surgery proving its robustness against different data. The proposed ensemble model with 2D and 3D convolutional neural networks demonstrated a promising result without using tool motion trajectories to evaluate surgery expertise.</p>","PeriodicalId":37474,"journal":{"name":"Healthcare Technology Letters","volume":"11 2-3","pages":"189-195"},"PeriodicalIF":2.8000,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/htl2.12078","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/htl2.12078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

An important part of surgical training in ophthalmology is understanding how to proficiently perform cataract surgery. Operating skill in cataract surgery is typically assessed by real-time or video-based expert review using a rating scale. This is time-consuming, subjective and labour-intensive. A typical trainee graduates with over 100 complete surgeries, each of which requires review by the surgical educators. Due to the consistently repetitive nature of this task, it lends itself well to machine learning-based evaluation. Recent studies utilize deep learning models trained on tool motion trajectories obtained using additional equipment or robotic systems. However, the process of tool recognition by extracting frames from the videos to perform phase recognition followed by skill assessment is exhaustive. This project proposes a deep learning model for skill evaluation using raw surgery videos that is cost-effective and end-to-end trainable. An advanced ensemble of convolutional neural network models is leveraged to model technical skills in cataract surgeries and is evaluated using a large dataset comprising almost 200 surgical trials. The highest accuracy of 0.8494 is observed on the phacoemulsification step data. Our model yielded an average accuracy of 0.8200 and an average AUC score of 0.8800 for all four phase datasets of cataract surgery proving its robustness against different data. The proposed ensemble model with 2D and 3D convolutional neural networks demonstrated a promising result without using tool motion trajectories to evaluate surgery expertise.

Abstract Image

查看原文本刊更多论文

利用 2D-3D 卷积神经网络集合进行白内障手术的性能评估

眼科手术培训的一个重要部分是了解如何熟练地进行白内障手术。白内障手术的操作技能通常是通过实时或视频专家评审，使用评分表进行评估。这种方法耗时、主观且劳动密集。一名典型的受训人员在毕业时需要完成 100 多例完整的手术，而每例手术都需要由手术教育者进行审查。由于这项任务具有持续重复的性质，因此非常适合基于机器学习的评估。最近的研究利用使用附加设备或机器人系统获得的工具运动轨迹来训练深度学习模型。然而，通过从视频中提取帧来进行阶段识别，然后再进行技能评估，这样的工具识别过程非常繁琐。本项目提出了一种利用原始手术视频进行技能评估的深度学习模型，该模型具有成本效益高、端到端可训练的特点。利用先进的卷积神经网络模型集合对白内障手术中的技术技能进行建模，并使用包含近 200 个手术试验的大型数据集进行评估。在超声乳化步骤数据中观察到的最高准确率为 0.8494。在白内障手术的所有四个阶段数据集上，我们的模型得出了 0.8200 的平均准确率和 0.8800 的平均 AUC 分数，这证明了它对不同数据的鲁棒性。所提出的二维和三维卷积神经网络集合模型在不使用工具运动轨迹来评估手术专业性的情况下取得了很好的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Healthcare Technology Letters Health Professions-Health Information Management

CiteScore

6.10

自引率

4.80%

发文量

审稿时长

22 weeks

期刊介绍： Healthcare Technology Letters aims to bring together an audience of biomedical and electrical engineers, physical and computer scientists, and mathematicians to enable the exchange of the latest ideas and advances through rapid online publication of original healthcare technology research. Major themes of the journal include (but are not limited to): Major technological/methodological areas: Biomedical signal processing Biomedical imaging and image processing Bioinstrumentation (sensors, wearable technologies, etc) Biomedical informatics Major application areas: Cardiovascular and respiratory systems engineering Neural engineering, neuromuscular systems Rehabilitation engineering Bio-robotics, surgical planning and biomechanics Therapeutic and diagnostic systems, devices and technologies Clinical engineering Healthcare information systems, telemedicine, mHealth.