[Validation of artificial intelligence algorithms for the surgical practice].

Chirurgie (Heidelberg, Germany) Pub Date : 2025-07-11 DOI:10.1007/s00104-025-02348-2

Annika Reinke

引用次数: 0

Abstract

Background: Artificial intelligence (AI) is increasingly being used in surgery; however, the validation of such systems is often methodologically insufficient.

Objective: Which validation issues arise in surgical AI and what requirements can be derived for clinically meaningful validation strategies?

Methods: Metric-related pitfalls reported in the literature were analyzed, combined with insights from the interdisciplinary consensus process "metrics reloaded" and its ongoing extension to surgical applications.

Results: Recurring weaknesses are observed at the levels of data, metrics and reporting. The lack of consideration of temporal structures and aggregation in video data is particularly critical.

Discussion: A structured, clinically grounded validation is essential for the safe use of surgical AI. The metrics reloaded procedure is currently being adapted to address surgery-specific requirements.

查看原文本刊更多论文

[人工智能算法在外科实践中的验证]。

背景：人工智能（AI）越来越多地应用于外科手术；然而，这种系统的验证通常在方法上是不够的。目的：在外科人工智能中出现了哪些验证问题，对于临床有意义的验证策略可以提出哪些要求？方法：结合跨学科共识过程“指标重新加载”及其持续扩展到外科应用的见解，分析文献中报道的与指标相关的陷阱。结果：在数据、度量和报告的层面上观察到反复出现的弱点。在视频数据中缺乏对时间结构和聚合的考虑尤其重要。讨论：结构化的、有临床基础的验证对于外科人工智能的安全使用至关重要。目前，重新加载程序的指标正在调整，以满足特定手术的要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Chirurgie (Heidelberg, Germany)

自引率

0.00%

发文量