[人工智能算法在外科实践中的验证]。

Chirurgie (Heidelberg, Germany) Pub Date : 2025-07-11 DOI:10.1007/s00104-025-02348-2

Annika Reinke

{"title":"[人工智能算法在外科实践中的验证]。","authors":"Annika Reinke","doi":"10.1007/s00104-025-02348-2","DOIUrl":null,"url":null,"abstract":"Background: Artificial intelligence (AI) is increasingly being used in surgery; however, the validation of such systems is often methodologically insufficient.Objective: Which validation issues arise in surgical AI and what requirements can be derived for clinically meaningful validation strategies?Methods: Metric-related pitfalls reported in the literature were analyzed, combined with insights from the interdisciplinary consensus process \"metrics reloaded\" and its ongoing extension to surgical applications.Results: Recurring weaknesses are observed at the levels of data, metrics and reporting. The lack of consideration of temporal structures and aggregation in video data is particularly critical.Discussion: A structured, clinically grounded validation is essential for the safe use of surgical AI. The metrics reloaded procedure is currently being adapted to address surgery-specific requirements.","PeriodicalId":72588,"journal":{"name":"Chirurgie (Heidelberg, Germany)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Validation of artificial intelligence algorithms for the surgical practice].\",\"authors\":\"Annika Reinke\",\"doi\":\"10.1007/s00104-025-02348-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Artificial intelligence (AI) is increasingly being used in surgery; however, the validation of such systems is often methodologically insufficient.Objective: Which validation issues arise in surgical AI and what requirements can be derived for clinically meaningful validation strategies?Methods: Metric-related pitfalls reported in the literature were analyzed, combined with insights from the interdisciplinary consensus process \\\"metrics reloaded\\\" and its ongoing extension to surgical applications.Results: Recurring weaknesses are observed at the levels of data, metrics and reporting. The lack of consideration of temporal structures and aggregation in video data is particularly critical.Discussion: A structured, clinically grounded validation is essential for the safe use of surgical AI. The metrics reloaded procedure is currently being adapted to address surgery-specific requirements.\",\"PeriodicalId\":72588,\"journal\":{\"name\":\"Chirurgie (Heidelberg, Germany)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chirurgie (Heidelberg, Germany)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s00104-025-02348-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chirurgie (Heidelberg, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00104-025-02348-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景：人工智能（AI）越来越多地应用于外科手术；然而，这种系统的验证通常在方法上是不够的。目的：在外科人工智能中出现了哪些验证问题，对于临床有意义的验证策略可以提出哪些要求？方法：结合跨学科共识过程“指标重新加载”及其持续扩展到外科应用的见解，分析文献中报道的与指标相关的陷阱。结果：在数据、度量和报告的层面上观察到反复出现的弱点。在视频数据中缺乏对时间结构和聚合的考虑尤其重要。讨论：结构化的、有临床基础的验证对于外科人工智能的安全使用至关重要。目前，重新加载程序的指标正在调整，以满足特定手术的要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

[Validation of artificial intelligence algorithms for the surgical practice].

Background: Artificial intelligence (AI) is increasingly being used in surgery; however, the validation of such systems is often methodologically insufficient.

Objective: Which validation issues arise in surgical AI and what requirements can be derived for clinically meaningful validation strategies?

Methods: Metric-related pitfalls reported in the literature were analyzed, combined with insights from the interdisciplinary consensus process "metrics reloaded" and its ongoing extension to surgical applications.

Results: Recurring weaknesses are observed at the levels of data, metrics and reporting. The lack of consideration of temporal structures and aggregation in video data is particularly critical.

Discussion: A structured, clinically grounded validation is essential for the safe use of surgical AI. The metrics reloaded procedure is currently being adapted to address surgery-specific requirements.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Chirurgie (Heidelberg, Germany)

自引率

0.00%

发文量