Transforming education: tackling the two sigma problem with AI in journal clubs - a proof of concept.

IF 2.5 Q2 DENTISTRY, ORAL SURGERY & MEDICINE

BDJ Open Pub Date : 2025-05-08 DOI:10.1038/s41405-025-00338-4

Fahad Umer, Nighat Naved, Azra Naseem, Ayesha Mansoor, Syed Murtaza Raza Kazmi

{"title":"Transforming education: tackling the two sigma problem with AI in journal clubs - a proof of concept.","authors":"Fahad Umer, Nighat Naved, Azra Naseem, Ayesha Mansoor, Syed Murtaza Raza Kazmi","doi":"10.1038/s41405-025-00338-4","DOIUrl":null,"url":null,"abstract":"Introduction: Journal clubs are integral to continuing medical education, promoting critical thinking and evidence-based learning. However, inconsistent engagement, reliance on faculty expertise, and the complexity of research articles can limit their effectiveness. Generative Artificial Intelligence (Gen AI), particularly Large Language Models (LLMs) offers a potential solution, but general-purpose LLMs may generate inaccurate responses (\"hallucinations\"). Retrieval-Augmented Generation (RAG) mitigates this by integrating AI-generated content with curated knowledge sources, ensuring more accurate and contextually relevant responses. This study explores the development and preliminary evaluation of a RAG-enhanced LLM to support journal club discussions.Materials and methods: A specialized LLM was deployed using Microsoft Azure's GPT-4o. A vector database was created by embedding journal club articles using text-embedding-ada-002 (Version 2) for efficient information retrieval. A dedicated website provided user-friendly access. The study followed a design-based research (DBR) approach, engaging residents and faculty who interacted with the LLM before and during journal club sessions. Data collection included focus group discussions (FGDs) and questionnaires assessing engagement, usability, and impact.Results: The study involved a total of 13 residents and three faculty members as participants. 50% of residents reported a positive experience, while the rest had a neutral response, citing both advantages and limitations. The LLM improved article summarization, query responses, and engagement as reported by residents. Moreover, the faculty observed enhanced discussion quality and preparation whereas overall challenges included the need for precise prompts and occasional misleading responses.Conclusion: The study highlights the potential of a RAG-enhanced LLM to improve journal club engagement and learning. Future advancements in AI and open-source models may enhance accessibility, warranting further research.","PeriodicalId":36997,"journal":{"name":"BDJ Open","volume":"11 1","pages":"46"},"PeriodicalIF":2.5000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12062218/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BDJ Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s41405-025-00338-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Journal clubs are integral to continuing medical education, promoting critical thinking and evidence-based learning. However, inconsistent engagement, reliance on faculty expertise, and the complexity of research articles can limit their effectiveness. Generative Artificial Intelligence (Gen AI), particularly Large Language Models (LLMs) offers a potential solution, but general-purpose LLMs may generate inaccurate responses ("hallucinations"). Retrieval-Augmented Generation (RAG) mitigates this by integrating AI-generated content with curated knowledge sources, ensuring more accurate and contextually relevant responses. This study explores the development and preliminary evaluation of a RAG-enhanced LLM to support journal club discussions.

Materials and methods: A specialized LLM was deployed using Microsoft Azure's GPT-4o. A vector database was created by embedding journal club articles using text-embedding-ada-002 (Version 2) for efficient information retrieval. A dedicated website provided user-friendly access. The study followed a design-based research (DBR) approach, engaging residents and faculty who interacted with the LLM before and during journal club sessions. Data collection included focus group discussions (FGDs) and questionnaires assessing engagement, usability, and impact.

Results: The study involved a total of 13 residents and three faculty members as participants. 50% of residents reported a positive experience, while the rest had a neutral response, citing both advantages and limitations. The LLM improved article summarization, query responses, and engagement as reported by residents. Moreover, the faculty observed enhanced discussion quality and preparation whereas overall challenges included the need for precise prompts and occasional misleading responses.

Conclusion: The study highlights the potential of a RAG-enhanced LLM to improve journal club engagement and learning. Future advancements in AI and open-source models may enhance accessibility, warranting further research.

查看原文本刊更多论文

改变教育：用人工智能在期刊俱乐部解决两个西格玛问题——一个概念的证明。

导读：期刊俱乐部是继续医学教育不可或缺的一部分，促进批判性思维和循证学习。然而，不一致的参与、对教师专业知识的依赖以及研究文章的复杂性会限制它们的有效性。生成式人工智能（Gen AI），特别是大型语言模型（llm）提供了一个潜在的解决方案，但通用的llm可能会产生不准确的反应（“幻觉”）。检索增强生成（RAG）通过将人工智能生成的内容与精心策划的知识来源集成在一起，确保更准确和与上下文相关的响应，从而缓解了这一问题。本研究探讨了拉格增强型法学硕士的发展和初步评估，以支持期刊俱乐部的讨论。材料和方法：使用Microsoft Azure的gpt - 40部署专门的LLM。使用text- embeddings -ada-002 （Version 2）嵌入期刊俱乐部文章创建矢量数据库，以实现高效的信息检索。一个专门的网站提供了方便的访问。该研究采用了基于设计的研究（DBR）方法，吸引了在期刊俱乐部会议之前和会议期间与法学硕士互动的居民和教师。数据收集包括焦点小组讨论（fgd）和评估参与度、可用性和影响的问卷。结果：该研究共涉及13名住院医生和3名教职员工作为参与者。50%的居民报告了积极的体验，而其余的人则持中立态度，指出了优点和局限性。法学硕士改进了文章摘要、查询回复和居民报告的参与度。此外，教师们观察到讨论的质量和准备都得到了提高，而总体上的挑战包括需要精确的提示和偶尔的误导性回答。结论：该研究强调了rag增强LLM在提高期刊俱乐部参与度和学习方面的潜力。人工智能和开源模型的未来发展可能会增强可访问性，这需要进一步的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊