Artificial intelligence classifies surgical technical skills in simulated laparoscopy: a pilot study.

IF 2.4 2区医学 Q2 SURGERY

Surgical Endoscopy And Other Interventional Techniques Pub Date : 2025-06-01 Epub Date: 2025-04-21 DOI:10.1007/s00464-025-11715-3

Orr Erlich-Feingold, Roi Anteby, Eyal Klang, Shelly Soffer, Mordechai Cordoba, Ido Nachmany, Imri Amiel, Yiftach Barash

{"title":"Artificial intelligence classifies surgical technical skills in simulated laparoscopy: a pilot study.","authors":"Orr Erlich-Feingold, Roi Anteby, Eyal Klang, Shelly Soffer, Mordechai Cordoba, Ido Nachmany, Imri Amiel, Yiftach Barash","doi":"10.1007/s00464-025-11715-3","DOIUrl":null,"url":null,"abstract":"Objective: To develop a computer algorithm for the automatic classification of basic surgical skills in laparoscopy. The ability to objectively assess the operative skills of trainees would be invaluable for the success of competency-based medical education. Although technical advancements in computer vision have resulted in promising clinical applications, they have not yet been utilized in surgical education.Methods: A single-institution, prospective study involving faculty and trainee surgeons recruited to use a bench-top simulator in order to complete the \"precision cutting\" task from the Fundamentals of Laparoscopic Surgery. An artificial intelligence (AI) computer algorithm was developed based on a transformer neural network model to classify videos of laparoscopic tasks as either executed by an expert or a novice surgeon. Performance metrics were reported in line with the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis guidelines. The model was trained using fivefold cross-validation. The model's performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, accuracy, F1 score, and area under the curve (AUC). The results were averaged across the folds, and 95% confidence intervals were computed for each metric. ROC curves were plotted to visualize the model's performance.Results: The internal dataset comprised 135 videos from 46 participants recruited between 2022 and 2023. Among these, 30 participants (65.2%) were junior surgical residents or medical students, and 16 (34.8%) were board-certified surgeons with prior laparoscopic experience. Following cross-validation, the AI model achieved an accuracy of 0.867 in classifying between novice and expert groups based on video analysis, independent of task completion time. For single-image classification, the model achieved an accuracy of 0.57.Conclusion: This proof-of-concept study serves as a pilot investigation into the application of AI for classifying surgical skills, demonstrating the utility of computer vision in automatically and objectively classifying surgical expertise. While the results show promise, further validation is necessary to establish its utility in routine surgical training and certification. By providing objective evaluations, this technology could support and enhance the role of human evaluators in surgical education.","PeriodicalId":22174,"journal":{"name":"Surgical Endoscopy And Other Interventional Techniques","volume":" ","pages":"3592-3599"},"PeriodicalIF":2.4000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Surgical Endoscopy And Other Interventional Techniques","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00464-025-11715-3","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To develop a computer algorithm for the automatic classification of basic surgical skills in laparoscopy. The ability to objectively assess the operative skills of trainees would be invaluable for the success of competency-based medical education. Although technical advancements in computer vision have resulted in promising clinical applications, they have not yet been utilized in surgical education.

Methods: A single-institution, prospective study involving faculty and trainee surgeons recruited to use a bench-top simulator in order to complete the "precision cutting" task from the Fundamentals of Laparoscopic Surgery. An artificial intelligence (AI) computer algorithm was developed based on a transformer neural network model to classify videos of laparoscopic tasks as either executed by an expert or a novice surgeon. Performance metrics were reported in line with the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis guidelines. The model was trained using fivefold cross-validation. The model's performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, accuracy, F1 score, and area under the curve (AUC). The results were averaged across the folds, and 95% confidence intervals were computed for each metric. ROC curves were plotted to visualize the model's performance.

Results: The internal dataset comprised 135 videos from 46 participants recruited between 2022 and 2023. Among these, 30 participants (65.2%) were junior surgical residents or medical students, and 16 (34.8%) were board-certified surgeons with prior laparoscopic experience. Following cross-validation, the AI model achieved an accuracy of 0.867 in classifying between novice and expert groups based on video analysis, independent of task completion time. For single-image classification, the model achieved an accuracy of 0.57.

Conclusion: This proof-of-concept study serves as a pilot investigation into the application of AI for classifying surgical skills, demonstrating the utility of computer vision in automatically and objectively classifying surgical expertise. While the results show promise, further validation is necessary to establish its utility in routine surgical training and certification. By providing objective evaluations, this technology could support and enhance the role of human evaluators in surgical education.

查看原文本刊更多论文

人工智能对模拟腹腔镜手术技术技能的分类：一项试点研究。

目的：开发一种用于腹腔镜手术基本技能自动分类的计算机算法。客观评价受训者的操作技能的能力对于以能力为基础的医学教育的成功是非常宝贵的。尽管计算机视觉的技术进步已经导致了有希望的临床应用，但它们尚未在外科教育中得到利用。方法：一项单机构的前瞻性研究，招募教师和实习外科医生使用台式模拟器来完成腹腔镜手术基础的“精确切割”任务。基于变压器神经网络模型，开发了一种人工智能（AI）计算机算法，将腹腔镜手术的视频分为专家和新手。绩效指标的报告符合个体预后或诊断多变量预测模型透明报告指南。该模型使用五重交叉验证进行训练。通过敏感性、特异性、阳性预测值、阴性预测值、准确性、F1评分和曲线下面积（AUC）来评价模型的性能。结果在折叠间取平均值，并为每个指标计算95%置信区间。绘制ROC曲线以显示模型的性能。结果：内部数据集包括来自2022年至2023年期间招募的46名参与者的135个视频。其中，30名参与者（65.2%）是初级外科住院医师或医学生，16名参与者（34.8%）是具有腹腔镜经验的委员会认证外科医生。经过交叉验证，AI模型在不受任务完成时间影响的情况下，基于视频分析对新手和专家组进行分类的准确率为0.867。对于单幅图像分类，该模型的准确率为0.57。结论：这项概念验证研究是对人工智能在外科技能分类中的应用的试点调查，展示了计算机视觉在自动、客观地分类外科专业知识方面的应用。虽然结果显示有希望，但需要进一步验证以确定其在常规外科培训和认证中的实用性。通过提供客观的评估，该技术可以支持和加强人类评估人员在外科教育中的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Surgical Endoscopy And Other Interventional Techniques 医学-外科

CiteScore

6.10

自引率

12.90%

发文量

890

审稿时长

6 months

期刊介绍： Uniquely positioned at the interface between various medical and surgical disciplines, Surgical Endoscopy serves as a focal point for the international surgical community to exchange information on practice, theory, and research. Topics covered in the journal include: -Surgical aspects of: Interventional endoscopy, Ultrasound, Other techniques in the fields of gastroenterology, obstetrics, gynecology, and urology, -Gastroenterologic surgery -Thoracic surgery -Traumatic surgery -Orthopedic surgery -Pediatric surgery