Effect of Artificial Intelligence-Augmented Human Instruction on Feedback Frequency and Surgical Performance During Simulation Training.

IF 2.1

Journal of surgical education Pub Date : 2025-10-08 DOI:10.1016/j.jsurg.2025.103743

Vanja Davidovic, Bianca Giglio, Abdulmajeed Albeloushi, Ahmad Kh Alhaj, Mohamed Alhantoobi, Rothaina Saeedi, Sabrina Deraiche, Recai Yilmaz, Trisha Tee, Ali M Fazlollahi, Matthew Ha, Abicumaran Uthamacumaran, Neevya Balasubramaniam, José A Correa, Rolando F Del Maestro

{"title":"Effect of Artificial Intelligence-Augmented Human Instruction on Feedback Frequency and Surgical Performance During Simulation Training.","authors":"Vanja Davidovic, Bianca Giglio, Abdulmajeed Albeloushi, Ahmad Kh Alhaj, Mohamed Alhantoobi, Rothaina Saeedi, Sabrina Deraiche, Recai Yilmaz, Trisha Tee, Ali M Fazlollahi, Matthew Ha, Abicumaran Uthamacumaran, Neevya Balasubramaniam, José A Correa, Rolando F Del Maestro","doi":"10.1016/j.jsurg.2025.103743","DOIUrl":null,"url":null,"abstract":"Objective: To determine whether personalized feedback from a human instructor receiving artificial intelligence (AI) error data will result in reduced feedback frequency and improvement of surgical skill compared to AI instruction. As feedback was only provided following AI error detection, a reduced feedback frequency is associated with fewer errors in performance. We hypothesized that AI-augmented personalized instruction would result in reduced feedback frequency and improvement in technical skill.Design: This cross-sectional cohort study was a follow-up of a randomized controlled trial, which utilized the NeuroVR, an immersive virtual reality neurosurgical simulator. Participants were stratified by year in medical school and block randomized to receive one of 3 educational interventions as they performed simulated procedures on the NeuroVR: AI tutor instruction, scripted human instruction, and AI-augmented personalized instruction. Performance was assessed by the feedback frequency and technical skill performance metrics.Clinicaltrials: gov ID: NCT06273579.Setting: Neurosurgical Simulation and Artificial Intelligence Learning Centre, McGill University, Montreal, Canada.Participants: Volunteer sample of medical students from 4 Quebec universities in preparatory, first, or second year without prior use of the NeuroVR. Eighty-eight students participated in the study with 87 included in the final analysis; 1 was excluded due to technical issues.Results: By the third repetition, the AI-augmented personalized instruction group received significantly fewer total instructions (incidence rate ratio [IRR], 1.50 [95% CI, 1.16-1.94] instructions; p < 0.001), and high aspirator force instructions (IRR, 1.71 [95% CI, 1.15-2.55] instructions; p = 0.002), compared to the second repetition. Compared to AI tutor instruction, AI-augmented personalized instruction resulted in a significantly lower rate of healthy tissue removal (p = 0.01), instrument tip separation distance (mean ratio, 1.25 [95% CI, 1.05-1.50] mm; p = 0.008), and aspirator force (mean ratio, 1.68 [95% CI, 1.23-2.31] N; p < 0.001). AI-augmented personalized instruction showed a significant improvement from baseline in all subsequent repetitions for all performance metrics.Conclusions: This cohort study demonstrated that AI-augmented personalized instruction resulted in less frequent feedback, indicating fewer errors in trainee performance, and an improvement in simulated surgical skills.","PeriodicalId":94109,"journal":{"name":"Journal of surgical education","volume":"82 11","pages":"103743"},"PeriodicalIF":2.1000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of surgical education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jsurg.2025.103743","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To determine whether personalized feedback from a human instructor receiving artificial intelligence (AI) error data will result in reduced feedback frequency and improvement of surgical skill compared to AI instruction. As feedback was only provided following AI error detection, a reduced feedback frequency is associated with fewer errors in performance. We hypothesized that AI-augmented personalized instruction would result in reduced feedback frequency and improvement in technical skill.

Design: This cross-sectional cohort study was a follow-up of a randomized controlled trial, which utilized the NeuroVR, an immersive virtual reality neurosurgical simulator. Participants were stratified by year in medical school and block randomized to receive one of 3 educational interventions as they performed simulated procedures on the NeuroVR: AI tutor instruction, scripted human instruction, and AI-augmented personalized instruction. Performance was assessed by the feedback frequency and technical skill performance metrics.

Clinicaltrials: gov ID: NCT06273579.

Setting: Neurosurgical Simulation and Artificial Intelligence Learning Centre, McGill University, Montreal, Canada.

Participants: Volunteer sample of medical students from 4 Quebec universities in preparatory, first, or second year without prior use of the NeuroVR. Eighty-eight students participated in the study with 87 included in the final analysis; 1 was excluded due to technical issues.

Results: By the third repetition, the AI-augmented personalized instruction group received significantly fewer total instructions (incidence rate ratio [IRR], 1.50 [95% CI, 1.16-1.94] instructions; p < 0.001), and high aspirator force instructions (IRR, 1.71 [95% CI, 1.15-2.55] instructions; p = 0.002), compared to the second repetition. Compared to AI tutor instruction, AI-augmented personalized instruction resulted in a significantly lower rate of healthy tissue removal (p = 0.01), instrument tip separation distance (mean ratio, 1.25 [95% CI, 1.05-1.50] mm; p = 0.008), and aspirator force (mean ratio, 1.68 [95% CI, 1.23-2.31] N; p < 0.001). AI-augmented personalized instruction showed a significant improvement from baseline in all subsequent repetitions for all performance metrics.

Conclusions: This cohort study demonstrated that AI-augmented personalized instruction resulted in less frequent feedback, indicating fewer errors in trainee performance, and an improvement in simulated surgical skills.

查看原文本刊更多论文

人工智能增强人工指导对模拟训练中反馈频率和手术性能的影响。

目的：确定与人工智能指导相比，人工智能指导接收人工智能（AI）错误数据的个性化反馈是否会减少反馈频率，提高手术技能。由于只有在AI错误检测之后才会提供反馈，因此反馈频率越低，性能中的错误就越少。我们假设，人工智能增强的个性化教学将减少反馈频率，提高技术技能。设计：本横断面队列研究是一项随机对照试验的后续研究，该试验利用了沉浸式虚拟现实神经外科模拟器NeuroVR。参与者在医学院按年级分层，并随机接受三种教育干预措施中的一种，因为他们在NeuroVR上进行模拟过程：人工智能导师指导、脚本化的人类指导和人工智能增强的个性化指导。通过反馈频率和技术技能绩效度量来评估绩效。临床试验：gov ID: NCT06273579。地点：加拿大蒙特利尔麦吉尔大学神经外科模拟与人工智能学习中心。参与者：来自魁北克四所大学预科、一年级或二年级的医科学生的志愿者样本，之前没有使用过NeuroVR。88名学生参与了这项研究，其中87人被纳入最终分析；1因技术问题被排除。结果：到第三次重复时，人工智能增强个性化指导组收到的总指令明显减少(发生率比[IRR]， 1.50 [95% CI, 1.16-1.94]指令；p 结论：本队列研究表明，人工智能增强个性化指导导致反馈频率降低，表明受训者表现错误减少，并提高了模拟手术技能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of surgical education

自引率

0.00%

发文量