Development, deployment and scaling of operating room-ready artificial intelligence for real-time surgical decision support

IF 12.4 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine Pub Date : 2024-09-03 DOI:10.1038/s41746-024-01225-2

Sergey Protserov, Jaryd Hunter, Haochi Zhang, Pouria Mashouri, Caterina Masino, Michael Brudno, Amin Madani

{"title":"Development, deployment and scaling of operating room-ready artificial intelligence for real-time surgical decision support","authors":"Sergey Protserov, Jaryd Hunter, Haochi Zhang, Pouria Mashouri, Caterina Masino, Michael Brudno, Amin Madani","doi":"10.1038/s41746-024-01225-2","DOIUrl":null,"url":null,"abstract":"Deep learning for computer vision can be leveraged for interpreting surgical scenes and providing surgeons with real-time guidance to avoid complications. However, neither generalizability nor scalability of computer-vision-based surgical guidance systems have been demonstrated, especially to geographic locations that lack hardware and infrastructure necessary for real-time inference. We propose a new equipment-agnostic framework for real-time use in operating suites. Using laparoscopic cholecystectomy and semantic segmentation models for predicting safe/dangerous (“Go”/”No-Go”) zones of dissection as an example use case, this study aimed to develop and test the performance of a novel data pipeline linked to a web-platform that enables real-time deployment from any edge device. To test this infrastructure and demonstrate its scalability and generalizability, lightweight U-Net and SegFormer models were trained on annotated frames from a large and diverse multicenter dataset from 136 institutions, and then tested on a separate prospectively collected dataset. A web-platform was created to enable real-time inference on any surgical video stream, and performance was tested on and optimized for a range of network speeds. The U-Net and SegFormer models respectively achieved mean Dice scores of 57% and 60%, precision 45% and 53%, and recall 82% and 75% for predicting the Go zone, and mean Dice scores of 76% and 76%, precision 68% and 68%, and recall 92% and 92% for predicting the No-Go zone. After optimization of the client-server interaction over the network, we deliver a prediction stream of at least 60 fps and with a maximum round-trip delay of 70 ms for speeds above 8 Mbps. Clinical deployment of machine learning models for surgical guidance is feasible and cost-effective using a generalizable, scalable and equipment-agnostic framework that lacks dependency on hardware with high computing performance or ultra-fast internet connection speed.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":null,"pages":null},"PeriodicalIF":12.4000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41746-024-01225-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Digital Medicine","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41746-024-01225-2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning for computer vision can be leveraged for interpreting surgical scenes and providing surgeons with real-time guidance to avoid complications. However, neither generalizability nor scalability of computer-vision-based surgical guidance systems have been demonstrated, especially to geographic locations that lack hardware and infrastructure necessary for real-time inference. We propose a new equipment-agnostic framework for real-time use in operating suites. Using laparoscopic cholecystectomy and semantic segmentation models for predicting safe/dangerous (“Go”/”No-Go”) zones of dissection as an example use case, this study aimed to develop and test the performance of a novel data pipeline linked to a web-platform that enables real-time deployment from any edge device. To test this infrastructure and demonstrate its scalability and generalizability, lightweight U-Net and SegFormer models were trained on annotated frames from a large and diverse multicenter dataset from 136 institutions, and then tested on a separate prospectively collected dataset. A web-platform was created to enable real-time inference on any surgical video stream, and performance was tested on and optimized for a range of network speeds. The U-Net and SegFormer models respectively achieved mean Dice scores of 57% and 60%, precision 45% and 53%, and recall 82% and 75% for predicting the Go zone, and mean Dice scores of 76% and 76%, precision 68% and 68%, and recall 92% and 92% for predicting the No-Go zone. After optimization of the client-server interaction over the network, we deliver a prediction stream of at least 60 fps and with a maximum round-trip delay of 70 ms for speeds above 8 Mbps. Clinical deployment of machine learning models for surgical guidance is feasible and cost-effective using a generalizable, scalable and equipment-agnostic framework that lacks dependency on hardware with high computing performance or ultra-fast internet connection speed.

Abstract Image

查看原文本刊更多论文

开发、部署和推广可在手术室使用的人工智能，用于实时手术决策支持。

计算机视觉深度学习可用于解释手术场景，并为外科医生提供实时指导，以避免并发症。然而，基于计算机视觉的手术指导系统的通用性和可扩展性均未得到证实，尤其是在缺乏实时推理所需的硬件和基础设施的地理位置。我们为手术室的实时使用提出了一个新的设备无关框架。本研究以腹腔镜胆囊切除术和预测安全/危险（"Go"/"No-Go"）解剖区域的语义分割模型为例，旨在开发和测试与网络平台相连的新型数据管道的性能，该平台可从任何边缘设备进行实时部署。为了测试该基础设施并证明其可扩展性和通用性，我们在来自 136 个机构的大型、多样化多中心数据集的注释帧上训练了轻量级 U-Net 和 SegFormer 模型，然后在单独的前瞻性收集数据集上进行了测试。我们创建了一个网络平台，以便对任何手术视频流进行实时推理，并在各种网速下对性能进行了测试和优化。U-Net 和 SegFormer 模型在预测 "去 "区时的平均 Dice 得分分别为 57% 和 60%，精确度分别为 45% 和 53%，召回率分别为 82% 和 75%；在预测 "不去 "区时的平均 Dice 得分分别为 76% 和 76%，精确度分别为 68% 和 68%，召回率分别为 92% 和 92%。在对网络上的客户端-服务器交互进行优化后，我们提供了至少 60 fps 的预测流，在速度超过 8 Mbps 时，最大往返延迟为 70 毫秒。使用可通用、可扩展和设备无关的框架，将机器学习模型用于外科手术指导是可行的，而且具有成本效益，它不依赖于具有高计算性能或超高速互联网连接速度的硬件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NPJ Digital Medicine Multiple-

CiteScore

25.10

自引率

3.30%

发文量

170

审稿时长

15 weeks

期刊介绍： npj Digital Medicine is an online open-access journal that focuses on publishing peer-reviewed research in the field of digital medicine. The journal covers various aspects of digital medicine, including the application and implementation of digital and mobile technologies in clinical settings, virtual healthcare, and the use of artificial intelligence and informatics. The primary goal of the journal is to support innovation and the advancement of healthcare through the integration of new digital and mobile technologies. When determining if a manuscript is suitable for publication, the journal considers four important criteria: novelty, clinical relevance, scientific rigor, and digital innovation.