Do Treatment Choices by Artificial Intelligence Correspond to Reality? Retrospective Comparative Research with Necrotizing Enterocolitis as a Use Case.
Rosa Verhoeven, Stella Mulia, Elisabeth M W Kooi, Jan B F Hulscher
{"title":"Do Treatment Choices by Artificial Intelligence Correspond to Reality? Retrospective Comparative Research with Necrotizing Enterocolitis as a Use Case.","authors":"Rosa Verhoeven, Stella Mulia, Elisabeth M W Kooi, Jan B F Hulscher","doi":"10.1177/0272989X251324530","DOIUrl":null,"url":null,"abstract":"<p><p>BackgroundIn cases of surgical necrotizing enterocolitis (NEC), the choice between laparotomy (LAP) or comfort care (CC) presents a complex, ethical dilemma. A behavioral artificial intelligence technology (BAIT) decision aid was trained on expert knowledge, providing an output as \"<i>x</i> percentage of experts advise laparotomy for this patient.\" This retrospective study aims to compare this output to clinical practice.DesignVariables required for the decision aid were collected of preterm patients with NEC for whom the decision of LAP or CC had been made. These data were used in 2 BAIT model versions: one center specific, built on the input of experts from the same center as the patients, and a nationwide version, incorporating the input of additional experts. The Mann-Whitney <i>U</i> test compared the model output for the 2 groups (LAP/CC). In addition, model output was classified as advice for LAP or CC, after which the chi-square test assessed correspondence with observed decisions.ResultsForty patients were included in the study (20 LAP). Model output (<i>x</i> percentage of experts advising LAP) was higher in the LAP group than in the CC group (median 95.1% v. 46.1% in the center-specific version and 97.3% v. 67.5% in the nationwide version, both <i>P</i> < 0.001). With an accuracy of 85.0% by the center-specific and 80.0% by the nationwide version, both showed significant correspondence with observed decisions (<i>P</i> < 0.001).LimitationsWe are merely examining a proof of concept of the decision aid using a small number of participants from 1 center.ConclusionsThis retrospective study demonstrates that treatment choices by artificial intelligence align with clinical practice in at least 80% of cases.ImplicationsFollowing prospective validation and ongoing refinements, the decision aid may offer valuable support to practitioners in future NEC cases.HighlightsThis study assesses the output of behavioral artificial intelligence technology in deciding between laparotomy and comfort care in surgical necrotizing enterocolitis.The model output aligns with clinical practice in at least 80% of patient cases.Following prospective validation, the decision aid may offer valuable support to physicians working at the neonatal intensive care unit.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"272989X251324530"},"PeriodicalIF":3.1000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/0272989X251324530","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
BackgroundIn cases of surgical necrotizing enterocolitis (NEC), the choice between laparotomy (LAP) or comfort care (CC) presents a complex, ethical dilemma. A behavioral artificial intelligence technology (BAIT) decision aid was trained on expert knowledge, providing an output as "x percentage of experts advise laparotomy for this patient." This retrospective study aims to compare this output to clinical practice.DesignVariables required for the decision aid were collected of preterm patients with NEC for whom the decision of LAP or CC had been made. These data were used in 2 BAIT model versions: one center specific, built on the input of experts from the same center as the patients, and a nationwide version, incorporating the input of additional experts. The Mann-Whitney U test compared the model output for the 2 groups (LAP/CC). In addition, model output was classified as advice for LAP or CC, after which the chi-square test assessed correspondence with observed decisions.ResultsForty patients were included in the study (20 LAP). Model output (x percentage of experts advising LAP) was higher in the LAP group than in the CC group (median 95.1% v. 46.1% in the center-specific version and 97.3% v. 67.5% in the nationwide version, both P < 0.001). With an accuracy of 85.0% by the center-specific and 80.0% by the nationwide version, both showed significant correspondence with observed decisions (P < 0.001).LimitationsWe are merely examining a proof of concept of the decision aid using a small number of participants from 1 center.ConclusionsThis retrospective study demonstrates that treatment choices by artificial intelligence align with clinical practice in at least 80% of cases.ImplicationsFollowing prospective validation and ongoing refinements, the decision aid may offer valuable support to practitioners in future NEC cases.HighlightsThis study assesses the output of behavioral artificial intelligence technology in deciding between laparotomy and comfort care in surgical necrotizing enterocolitis.The model output aligns with clinical practice in at least 80% of patient cases.Following prospective validation, the decision aid may offer valuable support to physicians working at the neonatal intensive care unit.
期刊介绍:
Medical Decision Making offers rigorous and systematic approaches to decision making that are designed to improve the health and clinical care of individuals and to assist with health care policy development. Using the fundamentals of decision analysis and theory, economic evaluation, and evidence based quality assessment, Medical Decision Making presents both theoretical and practical statistical and modeling techniques and methods from a variety of disciplines.