Use of artificial intelligence in the analysis of digital videos of invasive surgical procedures: scoping review.

IF 4.5 3区 医学 Q1 SURGERY
BJS Open Pub Date : 2025-07-01 DOI:10.1093/bjsopen/zraf073
Anni King, George E Fowler, Rhiannon C Macefield, Hamish Walker, Charlie Thomas, Sheraz Markar, Ethan Higgins, Jane M Blazeby, Natalie S Blencowe
{"title":"Use of artificial intelligence in the analysis of digital videos of invasive surgical procedures: scoping review.","authors":"Anni King, George E Fowler, Rhiannon C Macefield, Hamish Walker, Charlie Thomas, Sheraz Markar, Ethan Higgins, Jane M Blazeby, Natalie S Blencowe","doi":"10.1093/bjsopen/zraf073","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Surgical videos are a valuable data source, offering detailed insights into surgical practice. However, video analysis requires specialist clinical knowledge and takes considerable time. Artificial intelligence (AI) has the potential to improve and streamline the interpretation of intraoperative video data. This systematic scoping review aimed to summarize the use of AI in the analysis of videos of surgical procedures and identify evidence gaps.</p><p><strong>Methods: </strong>Systematic searches of Ovid MEDLINE and Embase were performed using search terms 'artificial intelligence', 'video', and 'surgery'. Data extraction included reporting of general study characteristics; the overall objective of AI; descriptions of data sets, AI models, and training; methods of data annotation; and measures of accuracy. Data were summarized descriptively.</p><p><strong>Results: </strong>In all, 122 studies were included. More than half focused on gastrointestinal procedures (75 studies, 61.5%), predominantly cholecystectomy (47, 38.5%). The most common objectives were surgical phase recognition (40 studies, 32.8%), surgical instrument recognition (28, 23.0%), and enhanced intraoperative visualization (23, 18.9%). Of the studies, 79.5% (97) used a single data set and most (92, 75.4%) used supervised machine learning techniques. There was considerable variation across the studies in terms of the number of videos, centres, and contributing surgeons. Forty-seven studies (38.5%) did not report the number of annotators, and details about their experience were frequently omitted (102, 83.6%). Most studies used multiple outcome measures (67, 54.9%), most commonly overall or best accuracy of the AI model (67, 54.9%).</p><p><strong>Conclusion: </strong>This review found that many studies omitted essential methodological details of AI training, testing, data annotation, and validation processes, creating difficulties when interpreting and replicating these studies. Another key finding was the lack of large data sets from multiple centres and surgeons. Future research should focus on curating large, varied, open-access data sets from multiple centres, patients, and surgeons to facilitate accurate evaluation using real-world data.</p>","PeriodicalId":9028,"journal":{"name":"BJS Open","volume":"9 4","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12268333/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BJS Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/bjsopen/zraf073","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Surgical videos are a valuable data source, offering detailed insights into surgical practice. However, video analysis requires specialist clinical knowledge and takes considerable time. Artificial intelligence (AI) has the potential to improve and streamline the interpretation of intraoperative video data. This systematic scoping review aimed to summarize the use of AI in the analysis of videos of surgical procedures and identify evidence gaps.

Methods: Systematic searches of Ovid MEDLINE and Embase were performed using search terms 'artificial intelligence', 'video', and 'surgery'. Data extraction included reporting of general study characteristics; the overall objective of AI; descriptions of data sets, AI models, and training; methods of data annotation; and measures of accuracy. Data were summarized descriptively.

Results: In all, 122 studies were included. More than half focused on gastrointestinal procedures (75 studies, 61.5%), predominantly cholecystectomy (47, 38.5%). The most common objectives were surgical phase recognition (40 studies, 32.8%), surgical instrument recognition (28, 23.0%), and enhanced intraoperative visualization (23, 18.9%). Of the studies, 79.5% (97) used a single data set and most (92, 75.4%) used supervised machine learning techniques. There was considerable variation across the studies in terms of the number of videos, centres, and contributing surgeons. Forty-seven studies (38.5%) did not report the number of annotators, and details about their experience were frequently omitted (102, 83.6%). Most studies used multiple outcome measures (67, 54.9%), most commonly overall or best accuracy of the AI model (67, 54.9%).

Conclusion: This review found that many studies omitted essential methodological details of AI training, testing, data annotation, and validation processes, creating difficulties when interpreting and replicating these studies. Another key finding was the lack of large data sets from multiple centres and surgeons. Future research should focus on curating large, varied, open-access data sets from multiple centres, patients, and surgeons to facilitate accurate evaluation using real-world data.

人工智能在侵入性外科手术数字视频分析中的应用:范围审查。
手术视频是一个有价值的数据源,提供详细的见解手术实践。然而,视频分析需要专业的临床知识,并且需要相当长的时间。人工智能(AI)具有改进和简化术中视频数据解释的潜力。本系统的范围综述旨在总结人工智能在外科手术视频分析中的应用,并确定证据差距。方法:系统检索Ovid MEDLINE和Embase,检索词为“人工智能”、“视频”和“外科”。数据提取包括一般研究特征的报告;人工智能的总体目标;数据集、人工智能模型和训练的描述;数据标注方法;以及准确性的度量。对数据进行描述性总结。结果:共纳入122项研究。超过一半的研究集中在胃肠手术(75项研究,61.5%),主要是胆囊切除术(47项,38.5%)。最常见的目标是手术阶段识别(40项研究,32.8%),手术器械识别(28项,23.0%)和增强术中可视化(23项,18.9%)。在这些研究中,79.5%(97)使用了单个数据集,大多数(92,75.4%)使用了监督式机器学习技术。这些研究在视频、中心和参与手术的外科医生的数量方面存在相当大的差异。47项研究(38.5%)没有报告注释者的数量,并且经常遗漏注释者的经验细节(102,83.6%)。大多数研究使用多个结果测量(67,54.9%),最常见的是人工智能模型的总体或最佳准确性(67,54.9%)。结论:本综述发现,许多研究忽略了人工智能训练、测试、数据注释和验证过程的基本方法学细节,给解释和复制这些研究带来了困难。另一个重要发现是缺乏来自多个中心和外科医生的大型数据集。未来的研究应侧重于整理来自多个中心、患者和外科医生的大型、多样、开放获取的数据集,以促进使用真实世界数据的准确评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BJS Open
BJS Open SURGERY-
CiteScore
6.00
自引率
3.20%
发文量
144
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信