{"title":"A guide to evade hallucinations and maintain reliability when using large language models for medical research: a narrative review.","authors":"Sangzin Ahn","doi":"10.6065/apem.2448278.139","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) are increasingly prevalent in medical research; however, fundamental limitations in their architecture create inherent reliability challenges, particularly in specialized medical contexts. These limitations stem from autoregressive prediction mechanisms and computational constraints related to undecidability, hindering perfect accuracy. Current mitigation strategies include advanced prompting techniques such as Chain-of-Thought reasoning and Retrieval-Augmented Generation (RAG) frameworks, although these approaches are insufficient to eliminate the core reliability issues. Meta-analyses of human-artificial intelligence collaboration experiments revealed that, although LLMs can augment individual human capabilities, they are most effective in specific contexts allowing human verification. Successful integration of LLMs in medical research requires careful tool selection aligned with task requirements and appropriate verification mechanisms. Evolution of the field indicates a balanced approach combining technological innovation with established expertise, emphasizing human oversight particularly in complex biological systems. This review highlights the importance of understanding the technical limitations of LLMs while maximizing their potential through thoughtful application and rigorous verification processes, ensuring high standards of scientific integrity in medical research.</p>","PeriodicalId":44915,"journal":{"name":"Annals of Pediatric Endocrinology & Metabolism","volume":"30 3","pages":"115-118"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12235426/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Pediatric Endocrinology & Metabolism","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6065/apem.2448278.139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0
Abstract
Large language models (LLMs) are increasingly prevalent in medical research; however, fundamental limitations in their architecture create inherent reliability challenges, particularly in specialized medical contexts. These limitations stem from autoregressive prediction mechanisms and computational constraints related to undecidability, hindering perfect accuracy. Current mitigation strategies include advanced prompting techniques such as Chain-of-Thought reasoning and Retrieval-Augmented Generation (RAG) frameworks, although these approaches are insufficient to eliminate the core reliability issues. Meta-analyses of human-artificial intelligence collaboration experiments revealed that, although LLMs can augment individual human capabilities, they are most effective in specific contexts allowing human verification. Successful integration of LLMs in medical research requires careful tool selection aligned with task requirements and appropriate verification mechanisms. Evolution of the field indicates a balanced approach combining technological innovation with established expertise, emphasizing human oversight particularly in complex biological systems. This review highlights the importance of understanding the technical limitations of LLMs while maximizing their potential through thoughtful application and rigorous verification processes, ensuring high standards of scientific integrity in medical research.
期刊介绍:
The Annals of Pediatric Endocrinology & Metabolism Journal is the official publication of the Korean Society of Pediatric Endocrinology. Its formal abbreviated title is “Ann Pediatr Endocrinol Metab”. It is a peer-reviewed open access journal of medicine published in English. The journal was launched in 1996 under the title of ‘Journal of Korean Society of Pediatric Endocrinology’ until 2011 (pISSN 1226-2242). Since 2012, the title is now changed to ‘Annals of Pediatric Endocrinology & Metabolism’. The Journal is published four times per year on the last day of March, June, September, and December. It is widely distributed for free to members of the Korean Society of Pediatric Endocrinology, medical schools, libraries, and academic institutions. The journal is indexed/tracked/covered by web sites of PubMed Central, PubMed, Emerging Sources Citation Index (ESCI), Scopus, EBSCO, EMBASE, KoreaMed, KoMCI, KCI, Science Central, DOI/CrossRef, Directory of Open Access Journals(DOAJ), and Google Scholar. The aims of Annals of Pediatric Endocrinology & Metabolism are to contribute to the advancements in the fields of pediatric endocrinology & metabolism through the scientific reviews and interchange of all of pediatric endocrinology and metabolism. It aims to reflect the latest clinical, translational, and basic research trends from worldwide valuable achievements. In addition, genome research, epidemiology, public education and clinical practice guidelines in each country are welcomed for publication. The Journal particularly focuses on research conducted with Asian-Pacific children whose genetic and environmental backgrounds are different from those of the Western. Area of specific interest include the following : Growth, puberty, glucose metabolism including diabetes mellitus, obesity, nutrition, disorders of sexual development, pituitary, thyroid, parathyroid, adrenal cortex, bone or other endocrine and metabolic disorders from infancy through adolescence.