A guide to evade hallucinations and maintain reliability when using large language models for medical research: a narrative review.

IF 3.3 Q3 ENDOCRINOLOGY & METABOLISM
Sangzin Ahn
{"title":"A guide to evade hallucinations and maintain reliability when using large language models for medical research: a narrative review.","authors":"Sangzin Ahn","doi":"10.6065/apem.2448278.139","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) are increasingly prevalent in medical research; however, fundamental limitations in their architecture create inherent reliability challenges, particularly in specialized medical contexts. These limitations stem from autoregressive prediction mechanisms and computational constraints related to undecidability, hindering perfect accuracy. Current mitigation strategies include advanced prompting techniques such as Chain-of-Thought reasoning and Retrieval-Augmented Generation (RAG) frameworks, although these approaches are insufficient to eliminate the core reliability issues. Meta-analyses of human-artificial intelligence collaboration experiments revealed that, although LLMs can augment individual human capabilities, they are most effective in specific contexts allowing human verification. Successful integration of LLMs in medical research requires careful tool selection aligned with task requirements and appropriate verification mechanisms. Evolution of the field indicates a balanced approach combining technological innovation with established expertise, emphasizing human oversight particularly in complex biological systems. This review highlights the importance of understanding the technical limitations of LLMs while maximizing their potential through thoughtful application and rigorous verification processes, ensuring high standards of scientific integrity in medical research.</p>","PeriodicalId":44915,"journal":{"name":"Annals of Pediatric Endocrinology & Metabolism","volume":"30 3","pages":"115-118"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12235426/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Pediatric Endocrinology & Metabolism","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6065/apem.2448278.139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Large language models (LLMs) are increasingly prevalent in medical research; however, fundamental limitations in their architecture create inherent reliability challenges, particularly in specialized medical contexts. These limitations stem from autoregressive prediction mechanisms and computational constraints related to undecidability, hindering perfect accuracy. Current mitigation strategies include advanced prompting techniques such as Chain-of-Thought reasoning and Retrieval-Augmented Generation (RAG) frameworks, although these approaches are insufficient to eliminate the core reliability issues. Meta-analyses of human-artificial intelligence collaboration experiments revealed that, although LLMs can augment individual human capabilities, they are most effective in specific contexts allowing human verification. Successful integration of LLMs in medical research requires careful tool selection aligned with task requirements and appropriate verification mechanisms. Evolution of the field indicates a balanced approach combining technological innovation with established expertise, emphasizing human oversight particularly in complex biological systems. This review highlights the importance of understanding the technical limitations of LLMs while maximizing their potential through thoughtful application and rigorous verification processes, ensuring high standards of scientific integrity in medical research.

Abstract Image

Abstract Image

在医学研究中使用大型语言模型时避免幻觉和保持可靠性的指南:叙述性回顾。
大型语言模型(LLMs)在医学研究中越来越普遍;然而,其架构的基本限制带来了固有的可靠性挑战,特别是在专业医疗环境中。这些限制源于自回归预测机制和与不可预测性相关的计算约束,阻碍了完美的准确性。目前的缓解策略包括先进的提示技术,如思维链推理和检索增强生成(RAG)框架,尽管这些方法不足以消除核心可靠性问题。人类-人工智能协作实验的荟萃分析显示,尽管法学硕士可以增强个人的能力,但它们在允许人类验证的特定环境中最有效。法学硕士在医学研究中的成功整合需要仔细选择符合任务要求的工具和适当的验证机制。该领域的发展表明了一种平衡的方法,将技术创新与已有的专业知识结合起来,强调人类的监督,特别是在复杂的生物系统中。这篇综述强调了理解法学硕士的技术限制的重要性,同时通过周到的应用和严格的验证过程最大限度地发挥其潜力,确保医学研究中高标准的科学完整性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.00
自引率
18.20%
发文量
59
审稿时长
24 weeks
期刊介绍: The Annals of Pediatric Endocrinology & Metabolism Journal is the official publication of the Korean Society of Pediatric Endocrinology. Its formal abbreviated title is “Ann Pediatr Endocrinol Metab”. It is a peer-reviewed open access journal of medicine published in English. The journal was launched in 1996 under the title of ‘Journal of Korean Society of Pediatric Endocrinology’ until 2011 (pISSN 1226-2242). Since 2012, the title is now changed to ‘Annals of Pediatric Endocrinology & Metabolism’. The Journal is published four times per year on the last day of March, June, September, and December. It is widely distributed for free to members of the Korean Society of Pediatric Endocrinology, medical schools, libraries, and academic institutions. The journal is indexed/tracked/covered by web sites of PubMed Central, PubMed, Emerging Sources Citation Index (ESCI), Scopus, EBSCO, EMBASE, KoreaMed, KoMCI, KCI, Science Central, DOI/CrossRef, Directory of Open Access Journals(DOAJ), and Google Scholar. The aims of Annals of Pediatric Endocrinology & Metabolism are to contribute to the advancements in the fields of pediatric endocrinology & metabolism through the scientific reviews and interchange of all of pediatric endocrinology and metabolism. It aims to reflect the latest clinical, translational, and basic research trends from worldwide valuable achievements. In addition, genome research, epidemiology, public education and clinical practice guidelines in each country are welcomed for publication. The Journal particularly focuses on research conducted with Asian-Pacific children whose genetic and environmental backgrounds are different from those of the Western. Area of specific interest include the following : Growth, puberty, glucose metabolism including diabetes mellitus, obesity, nutrition, disorders of sexual development, pituitary, thyroid, parathyroid, adrenal cortex, bone or other endocrine and metabolic disorders from infancy through adolescence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信