Timothy Trewren, Nicholas Fitzgerald, Sarah Jaensch, Olivia Nguyen, Alexander Tsymbal, Christina Gao, Brandon Stretton, Stewart Anderson, D-Yin Lin, Dario Winterton, Galina Gheihman, Guy Ludbrook, Kelly Bratkovic, Stephen Bacchi
{"title":"人工智能在围手术期医学教育中的应用:基于案例学习的可行性检验。","authors":"Timothy Trewren, Nicholas Fitzgerald, Sarah Jaensch, Olivia Nguyen, Alexander Tsymbal, Christina Gao, Brandon Stretton, Stewart Anderson, D-Yin Lin, Dario Winterton, Galina Gheihman, Guy Ludbrook, Kelly Bratkovic, Stephen Bacchi","doi":"10.1177/17504589251346634","DOIUrl":null,"url":null,"abstract":"<p><p>The use of artificial intelligence in medicine is rapidly expanding. Large language models, such as ChatGPT, have the potential to enhance perioperative medicine through education and clinical practice. However, concerns remain regarding the accuracy of these models, particularly the risk of hallucinations, generating factually incorrect outputs. This feasibility test explores the use of a large language model-enabled platform to assist in case-based education in perioperative clinical cases.</p><p><strong>Methods: </strong>Five perioperative cases addressing core topics were developed and uploaded to a custom large language model platform. The large language model platform allows free-text questions to be asked to the artificial intelligence, which then uses the derived cases to provide answers. Anaesthetic trainees engaged with the artificial intelligence, asking questions to obtain information regarding history, examination, and investigations. Artificial intelligence question-and-answer pairs were then evaluated independently in duplicate for the presence of inappropriate responses, including hallucinations.</p><p><strong>Results: </strong>The large language model responded appropriately to nearly all questions, with no hallucinations observed. The proportion of questions that were answered appropriately was 99.3% (543/547). In the four instances of inappropriate responses, the large language model declined to provide information in the case description rather than hallucinate.</p><p><strong>Conclusion: </strong>The large language model appears capable of supporting the delivery of case-based perioperative medicine content with a high degree of accuracy.</p>","PeriodicalId":35481,"journal":{"name":"Journal of perioperative practice","volume":" ","pages":"17504589251346634"},"PeriodicalIF":1.0000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence in perioperative medicine education: A feasibility test of case-based learning.\",\"authors\":\"Timothy Trewren, Nicholas Fitzgerald, Sarah Jaensch, Olivia Nguyen, Alexander Tsymbal, Christina Gao, Brandon Stretton, Stewart Anderson, D-Yin Lin, Dario Winterton, Galina Gheihman, Guy Ludbrook, Kelly Bratkovic, Stephen Bacchi\",\"doi\":\"10.1177/17504589251346634\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The use of artificial intelligence in medicine is rapidly expanding. Large language models, such as ChatGPT, have the potential to enhance perioperative medicine through education and clinical practice. However, concerns remain regarding the accuracy of these models, particularly the risk of hallucinations, generating factually incorrect outputs. This feasibility test explores the use of a large language model-enabled platform to assist in case-based education in perioperative clinical cases.</p><p><strong>Methods: </strong>Five perioperative cases addressing core topics were developed and uploaded to a custom large language model platform. The large language model platform allows free-text questions to be asked to the artificial intelligence, which then uses the derived cases to provide answers. Anaesthetic trainees engaged with the artificial intelligence, asking questions to obtain information regarding history, examination, and investigations. Artificial intelligence question-and-answer pairs were then evaluated independently in duplicate for the presence of inappropriate responses, including hallucinations.</p><p><strong>Results: </strong>The large language model responded appropriately to nearly all questions, with no hallucinations observed. The proportion of questions that were answered appropriately was 99.3% (543/547). In the four instances of inappropriate responses, the large language model declined to provide information in the case description rather than hallucinate.</p><p><strong>Conclusion: </strong>The large language model appears capable of supporting the delivery of case-based perioperative medicine content with a high degree of accuracy.</p>\",\"PeriodicalId\":35481,\"journal\":{\"name\":\"Journal of perioperative practice\",\"volume\":\" \",\"pages\":\"17504589251346634\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of perioperative practice\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/17504589251346634\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of perioperative practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/17504589251346634","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
Artificial intelligence in perioperative medicine education: A feasibility test of case-based learning.
The use of artificial intelligence in medicine is rapidly expanding. Large language models, such as ChatGPT, have the potential to enhance perioperative medicine through education and clinical practice. However, concerns remain regarding the accuracy of these models, particularly the risk of hallucinations, generating factually incorrect outputs. This feasibility test explores the use of a large language model-enabled platform to assist in case-based education in perioperative clinical cases.
Methods: Five perioperative cases addressing core topics were developed and uploaded to a custom large language model platform. The large language model platform allows free-text questions to be asked to the artificial intelligence, which then uses the derived cases to provide answers. Anaesthetic trainees engaged with the artificial intelligence, asking questions to obtain information regarding history, examination, and investigations. Artificial intelligence question-and-answer pairs were then evaluated independently in duplicate for the presence of inappropriate responses, including hallucinations.
Results: The large language model responded appropriately to nearly all questions, with no hallucinations observed. The proportion of questions that were answered appropriately was 99.3% (543/547). In the four instances of inappropriate responses, the large language model declined to provide information in the case description rather than hallucinate.
Conclusion: The large language model appears capable of supporting the delivery of case-based perioperative medicine content with a high degree of accuracy.
期刊介绍:
The Journal of Perioperative Practice (JPP) is the official journal of the Association for Perioperative Practice (AfPP). It is an international, peer reviewed journal with a multidisciplinary ethos across all aspects of perioperative care. The overall aim of the journal is to improve patient safety through informing and developing practice. It is an informative professional journal which provides current evidence-based practice, clinical, management and educational developments for practitioners working in the perioperative environment. The journal promotes perioperative practice by publishing clinical research-based articles, literature reviews, topical discussions, advice on clinical issues, current news items and product information.