{"title":"Effect of machine learning models on clinician prediction of postoperative complications: the Perioperative ORACLE randomised clinical trial","authors":"","doi":"10.1016/j.bja.2024.08.004","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Anaesthesiologists might be able to mitigate risk if they know which patients are at greatest risk for postoperative complications. This trial examined the impact of machine learning models on clinician risk assessment.</div></div><div><h3>Methods</h3><div>This single-centre, prospective, randomised clinical trial enrolled surgical patients aged ≥18 yr. Anaesthesiologists and nurse anaesthetists providing remote telemedicine support reviewed electronic health records with (assisted group) or without (unassisted group) reviewing machine learning predictions. Clinicians predicted the likelihood of postoperative 30-day all-cause mortality and postoperative acute kidney injury (AKI) within 7 days. The primary outcome was area under the receiver operating characteristic curve (AUROC) for clinician predictions of mortality and AKI, comparing AUROCs between assisted and unassisted assessments.</div></div><div><h3>Results</h3><div>We analysed 5071 patients (mean [range] age: 58 [18-100] yr; 52% female) assessed by 89 clinicians. Of these, 98 (2.2%) patients died within 30 days of surgery and 450 (11.1%) patients sustained AKI. Clinician predictions agreed with the models more strongly in the assisted <em>vs</em> unassisted group (weighted kappa 0.75 <em>vs</em> 0.62 for death, mean difference: 0.13 [95% CI 0.10–0.17]; and 0.79 <em>vs</em> 0.54 for AKI, mean difference: 0.25 [95% CI 0.21–0.29]). Clinical prediction of death was similar between the assisted (AUROC 0.793) and unassisted (AUROC 0.780) groups (mean difference: 0.013 [95% CI –0.070 to 0.097]; <em>P</em>=0.76). Prediction of AKI had an AUROC of 0.734 in the assisted group <em>vs</em> 0.688 in the unassisted group (difference 0.046 [95% CI –0.003 to 0.091]; <em>P</em>=0.06).</div></div><div><h3>Conclusions</h3><div>Clinician performance was not improved by machine learning assistance. Further work is needed to clarify the role of machine learning in real-time perioperative risk stratification.</div></div><div><h3>Clinical trial registration</h3><div>NCT05042804.</div></div>","PeriodicalId":9250,"journal":{"name":"British journal of anaesthesia","volume":null,"pages":null},"PeriodicalIF":9.1000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British journal of anaesthesia","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0007091224004689","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANESTHESIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Anaesthesiologists might be able to mitigate risk if they know which patients are at greatest risk for postoperative complications. This trial examined the impact of machine learning models on clinician risk assessment.
Methods
This single-centre, prospective, randomised clinical trial enrolled surgical patients aged ≥18 yr. Anaesthesiologists and nurse anaesthetists providing remote telemedicine support reviewed electronic health records with (assisted group) or without (unassisted group) reviewing machine learning predictions. Clinicians predicted the likelihood of postoperative 30-day all-cause mortality and postoperative acute kidney injury (AKI) within 7 days. The primary outcome was area under the receiver operating characteristic curve (AUROC) for clinician predictions of mortality and AKI, comparing AUROCs between assisted and unassisted assessments.
Results
We analysed 5071 patients (mean [range] age: 58 [18-100] yr; 52% female) assessed by 89 clinicians. Of these, 98 (2.2%) patients died within 30 days of surgery and 450 (11.1%) patients sustained AKI. Clinician predictions agreed with the models more strongly in the assisted vs unassisted group (weighted kappa 0.75 vs 0.62 for death, mean difference: 0.13 [95% CI 0.10–0.17]; and 0.79 vs 0.54 for AKI, mean difference: 0.25 [95% CI 0.21–0.29]). Clinical prediction of death was similar between the assisted (AUROC 0.793) and unassisted (AUROC 0.780) groups (mean difference: 0.013 [95% CI –0.070 to 0.097]; P=0.76). Prediction of AKI had an AUROC of 0.734 in the assisted group vs 0.688 in the unassisted group (difference 0.046 [95% CI –0.003 to 0.091]; P=0.06).
Conclusions
Clinician performance was not improved by machine learning assistance. Further work is needed to clarify the role of machine learning in real-time perioperative risk stratification.
期刊介绍:
The British Journal of Anaesthesia (BJA) is a prestigious publication that covers a wide range of topics in anaesthesia, critical care medicine, pain medicine, and perioperative medicine. It aims to disseminate high-impact original research, spanning fundamental, translational, and clinical sciences, as well as clinical practice, technology, education, and training. Additionally, the journal features review articles, notable case reports, correspondence, and special articles that appeal to a broader audience.
The BJA is proudly associated with The Royal College of Anaesthetists, The College of Anaesthesiologists of Ireland, and The Hong Kong College of Anaesthesiologists. This partnership provides members of these esteemed institutions with access to not only the BJA but also its sister publication, BJA Education. It is essential to note that both journals maintain their editorial independence.
Overall, the BJA offers a diverse and comprehensive platform for anaesthetists, critical care physicians, pain specialists, and perioperative medicine practitioners to contribute and stay updated with the latest advancements in their respective fields.