Can medical students use artificial intelligence to learn transfusion? Evaluating ChatGPT responses to the American Society of Hematology medical student transfusion learning objectives.
{"title":"Can medical students use artificial intelligence to learn transfusion? Evaluating ChatGPT responses to the American Society of Hematology medical student transfusion learning objectives.","authors":"Eileen McBride, Elaine Leung, Jason Ford","doi":"10.1111/vox.70070","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objectives: </strong>Chat generative pretrained transformer (ChatGPT) is a large language model that is already in wide use among medical students as a means of learning. Many papers have evaluated ChatGPT as a presenter of medical knowledge for the general public and as a test-taking engine. For students who rely on ChatGPT to learn transfusion medicine, it is important to understand the limitations of the application.</p><p><strong>Materials and methods: </strong>Transfusion content from the American Society of Hematology 'medical student learning objectives' was edited into questions for the ChatGPT interface. The answers generated by ChatGPT were then marked by three experienced transfusion medicine physicians.</p><p><strong>Results: </strong>ChatGPT scored on average 2.27 ± 0.21 on a 4-point scale. Two-thirds of its answers scored A, B or C, representing excellent, good or satisfactory achievement, respectively. One-third of ChatGPT's answers were assigned a failing grade. Simple questions of basic transfusion science performed the best; more complex questions as well as questions where clinical practice has evolved substantially over the last several years performed the worst. Some answers were assessed to be unsafe in clinical practice.</p><p><strong>Conclusion: </strong>As a resource for medical students learning transfusion medicine, ChatGPT has significant limitations. A considerable proportion of its answers to transfusion questions are unreliable, inaccurate and even unsafe. These incorrect answers are presented with the same authoritative tone as its correct answers, and an inexperienced learner would be challenged to differentiate between true and untrue responses. At the present time, it is not recommended for medical students to use ChatGPT to learn transfusion medicine.</p>","PeriodicalId":23631,"journal":{"name":"Vox Sanguinis","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vox Sanguinis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/vox.70070","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and objectives: Chat generative pretrained transformer (ChatGPT) is a large language model that is already in wide use among medical students as a means of learning. Many papers have evaluated ChatGPT as a presenter of medical knowledge for the general public and as a test-taking engine. For students who rely on ChatGPT to learn transfusion medicine, it is important to understand the limitations of the application.
Materials and methods: Transfusion content from the American Society of Hematology 'medical student learning objectives' was edited into questions for the ChatGPT interface. The answers generated by ChatGPT were then marked by three experienced transfusion medicine physicians.
Results: ChatGPT scored on average 2.27 ± 0.21 on a 4-point scale. Two-thirds of its answers scored A, B or C, representing excellent, good or satisfactory achievement, respectively. One-third of ChatGPT's answers were assigned a failing grade. Simple questions of basic transfusion science performed the best; more complex questions as well as questions where clinical practice has evolved substantially over the last several years performed the worst. Some answers were assessed to be unsafe in clinical practice.
Conclusion: As a resource for medical students learning transfusion medicine, ChatGPT has significant limitations. A considerable proportion of its answers to transfusion questions are unreliable, inaccurate and even unsafe. These incorrect answers are presented with the same authoritative tone as its correct answers, and an inexperienced learner would be challenged to differentiate between true and untrue responses. At the present time, it is not recommended for medical students to use ChatGPT to learn transfusion medicine.
期刊介绍:
Vox Sanguinis reports on important, novel developments in transfusion medicine. Original papers, reviews and international fora are published on all aspects of blood transfusion and tissue transplantation, comprising five main sections:
1) Transfusion - Transmitted Disease and its Prevention:
Identification and epidemiology of infectious agents transmissible by blood;
Bacterial contamination of blood components;
Donor recruitment and selection methods;
Pathogen inactivation.
2) Blood Component Collection and Production:
Blood collection methods and devices (including apheresis);
Plasma fractionation techniques and plasma derivatives;
Preparation of labile blood components;
Inventory management;
Hematopoietic progenitor cell collection and storage;
Collection and storage of tissues;
Quality management and good manufacturing practice;
Automation and information technology.
3) Transfusion Medicine and New Therapies:
Transfusion thresholds and audits;
Haemovigilance;
Clinical trials regarding appropriate haemotherapy;
Non-infectious adverse affects of transfusion;
Therapeutic apheresis;
Support of transplant patients;
Gene therapy and immunotherapy.
4) Immunohaematology and Immunogenetics:
Autoimmunity in haematology;
Alloimmunity of blood;
Pre-transfusion testing;
Immunodiagnostics;
Immunobiology;
Complement in immunohaematology;
Blood typing reagents;
Genetic markers of blood cells and serum proteins: polymorphisms and function;
Genetic markers and disease;
Parentage testing and forensic immunohaematology.
5) Cellular Therapy:
Cell-based therapies;
Stem cell sources;
Stem cell processing and storage;
Stem cell products;
Stem cell plasticity;
Regenerative medicine with cells;
Cellular immunotherapy;
Molecular therapy;
Gene therapy.