Large Language Models in MASLD: The New Era of Generative Artificial Intelligence-Augmented Clinical Practice

IF 6 2区 医学 Q1 GASTROENTEROLOGY & HEPATOLOGY
Nicola Pugliese, Mauro Giuffrè, Jörn M. Schattenberg
{"title":"Large Language Models in MASLD: The New Era of Generative Artificial Intelligence-Augmented Clinical Practice","authors":"Nicola Pugliese,&nbsp;Mauro Giuffrè,&nbsp;Jörn M. Schattenberg","doi":"10.1111/liv.16162","DOIUrl":null,"url":null,"abstract":"<p>In a rapidly transforming healthcare landscape, generative artificial intelligence (GenAI) is rapidly emerging as a powerful ally, especially in the management of complex diseases such as metabolic dysfunction-associated steatotic liver disease (MASLD) [<span>1</span>]. At the core of this revolution are large language models (LLMs), such as OpenAI's GPT-4 or Meta's Llama-3, which are showing the potential to reshape how healthcare professionals interact with patients and their data [<span>1, 2</span>] (Figure 1). These tools have unprecedented potential, not only to improve the accuracy of diagnosis, but also to reach patients in underserved areas and provide accessible, timely, and patient-centred support [<span>1, 2</span>]. In particular, the field of digestive diseases has been very prolific in terms of LLM-oriented research, with a recent systematic review defining baseline ChatGPT accuracy ranging from 6.4% to 91.4% when applied to gastroenterology and hepatology queries in the form of simple text-comprehension and response-generation tasks without the use of real-world patient data [<span>3</span>]. However, the recent study by Wu et al. represents a step forward in LLM-oriented clinical research, demonstrating the potential role of OpenAI's GPT (i.e., GPT-3.5, GPT-4, and GPT-4 Vision) in diagnosing MASLD using real-world patient data, including both electronic health records and ultrasound images, and showing comparable diagnostic accuracy to established tools such as the fatty liver index (FLI) and the United States FLI (USFLI) [<span>4</span>]. Therefore, with further refinement, LLMs could quickly and accurately assess patients, ultimately accelerating diagnosis and enabling healthcare providers to make faster, more informed decisions. This potential is further supported by previous validations in machine learning, where algorithms have already demonstrated their capacity to enhance diagnostic processes, streamlining patient assessments and decision-making with impressive accuracy [<span>5</span>]. In addition, we can envision LLMs acting as virtual companions for MASLD patients, guiding them through lifestyle adjustments and answering questions in plain language [<span>6, 7</span>].</p><p>In addition to textual interactions, LLMs have demonstrated the ability to process and analyse sophisticated medical data, including images of liver biopsy specimens. A recent exploratory study demonstrated the potential of LLMs for histological diagnosis and fibrosis staging in metabolic dysfunction-associated steatohepatitis (MASH) [<span>8</span>]. In particular, ChatGPT-4 has demonstrated superior performance in the interpretation of histological images, achieving an accuracy of 87.5% compared to 38.3% for Google Bard [<span>8</span>]. The results of this preliminary study suggest that ChatGPT-4, which is accessible even in low- and middle-income countries, could significantly improve cost-effectiveness and resource allocation by aiding in the preliminary staging of MASH, particularly in contexts with limited pathology capabilities.</p><p>However, while these advancements bring optimism, the path forward is not without obstacles. It is crucial to remain aware that LLMs are prone to generating plausible but false information (i.e., hallucinations) and that there is a significant risk of patient data leakage of personal health information [<span>2</span>]. The problem of misinformation is of utmost importance in healthcare due to the risk of potential patient’ harm. To address the first issue, the medical community is exploring strategies to link LLM responses to evidence-based medicine through the ingestion of guidelines via retrieval-augmented generation (RAG) or supervised fine-tuning (SFT), both of which have significantly improved the accuracy of LLM responses in the medical field [<span>2, 9</span>]. For the privacy concern, one potential solution is the use of open-source models that can be hosted locally or, alternatively, strengthening and supporting joint ventures between hospital systems and medical companies (such as the collaboration between Epic and OpenAI) [<span>10</span>]. Accessibility remains another significant issue, particularly in rural or underdeveloped regions that may lack the necessary technology or internet connectivity. To overcome these barriers, healthcare systems must invest in education and infrastructure to ensure that these ground-breaking tools are available to all who need them.</p><p>These challenges are particularly important to address for conditions such as MASLD, which is anticipated to become increasingly prevalent and for which therapeutic options are limited [<span>1</span>]. Effective management of MASLD relies heavily on lifestyle changes, an area where LLMs could play a crucial role by providing accessible, personalised advice on diet, exercise, and other health behaviours. However, to fully realise this potential, issues of accuracy, privacy, and accessibility need to be addressed. With continued progress in these areas, LLMs could become transformative tools in the management of high-burden, lifestyle-dependent diseases. By providing accurate, secure, and easily accessible support, LLMs offer a path toward a more preventive, patient-centred model of care that empowers both healthcare providers and patients to achieve better long-term outcomes.</p><p>As we envision the future, the growing capabilities of GenAI hold the promise to revolutionise patient care, paving the way for unprecedented improvements in outcomes and creating a healthcare system that is more effective, impactful, and deeply patient-centred. While we are only at the beginning of this journey, continued collaboration and innovation will enable AI to play a defining role in the future of healthcare, transforming the way we support and empower both patients and providers.</p><p>N.P. declares speaker honorarium from Gilead, AlfaSigma, Advanz Pharma, Gore. J.M.S. declares consultant honorary from Akero, Alentis, Alexion, Altimmune, Astra Zeneca, 89Bio, Bionorica, Boehringer Ingelheim, Gilead Sciences, GSK, HistoIndex, Ipsen, Inventiva Pharma, Madrigal Pharmaceuticals, Kríya Therapeutics, Lilly, MSD Sharp &amp; Dohme GmbH, Nordic Bioscience, Northsea Therapeutics, Novartis, Novo Nordisk, Pfizer, Roche, Sanofi, Siemens Healthineers, Research; speaker honorarium from AbbVie, Boehringer Ingelheim, Gilead Sciences, Ipsen Novo Nordisk, Madrigal Pharmaceuticals, Worldwide Clinical Trials, Stockholder options: Hepta Bio. M.G. declares no COIs.</p>","PeriodicalId":18101,"journal":{"name":"Liver International","volume":"45 4","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/liv.16162","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Liver International","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/liv.16162","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

In a rapidly transforming healthcare landscape, generative artificial intelligence (GenAI) is rapidly emerging as a powerful ally, especially in the management of complex diseases such as metabolic dysfunction-associated steatotic liver disease (MASLD) [1]. At the core of this revolution are large language models (LLMs), such as OpenAI's GPT-4 or Meta's Llama-3, which are showing the potential to reshape how healthcare professionals interact with patients and their data [1, 2] (Figure 1). These tools have unprecedented potential, not only to improve the accuracy of diagnosis, but also to reach patients in underserved areas and provide accessible, timely, and patient-centred support [1, 2]. In particular, the field of digestive diseases has been very prolific in terms of LLM-oriented research, with a recent systematic review defining baseline ChatGPT accuracy ranging from 6.4% to 91.4% when applied to gastroenterology and hepatology queries in the form of simple text-comprehension and response-generation tasks without the use of real-world patient data [3]. However, the recent study by Wu et al. represents a step forward in LLM-oriented clinical research, demonstrating the potential role of OpenAI's GPT (i.e., GPT-3.5, GPT-4, and GPT-4 Vision) in diagnosing MASLD using real-world patient data, including both electronic health records and ultrasound images, and showing comparable diagnostic accuracy to established tools such as the fatty liver index (FLI) and the United States FLI (USFLI) [4]. Therefore, with further refinement, LLMs could quickly and accurately assess patients, ultimately accelerating diagnosis and enabling healthcare providers to make faster, more informed decisions. This potential is further supported by previous validations in machine learning, where algorithms have already demonstrated their capacity to enhance diagnostic processes, streamlining patient assessments and decision-making with impressive accuracy [5]. In addition, we can envision LLMs acting as virtual companions for MASLD patients, guiding them through lifestyle adjustments and answering questions in plain language [6, 7].

In addition to textual interactions, LLMs have demonstrated the ability to process and analyse sophisticated medical data, including images of liver biopsy specimens. A recent exploratory study demonstrated the potential of LLMs for histological diagnosis and fibrosis staging in metabolic dysfunction-associated steatohepatitis (MASH) [8]. In particular, ChatGPT-4 has demonstrated superior performance in the interpretation of histological images, achieving an accuracy of 87.5% compared to 38.3% for Google Bard [8]. The results of this preliminary study suggest that ChatGPT-4, which is accessible even in low- and middle-income countries, could significantly improve cost-effectiveness and resource allocation by aiding in the preliminary staging of MASH, particularly in contexts with limited pathology capabilities.

However, while these advancements bring optimism, the path forward is not without obstacles. It is crucial to remain aware that LLMs are prone to generating plausible but false information (i.e., hallucinations) and that there is a significant risk of patient data leakage of personal health information [2]. The problem of misinformation is of utmost importance in healthcare due to the risk of potential patient’ harm. To address the first issue, the medical community is exploring strategies to link LLM responses to evidence-based medicine through the ingestion of guidelines via retrieval-augmented generation (RAG) or supervised fine-tuning (SFT), both of which have significantly improved the accuracy of LLM responses in the medical field [2, 9]. For the privacy concern, one potential solution is the use of open-source models that can be hosted locally or, alternatively, strengthening and supporting joint ventures between hospital systems and medical companies (such as the collaboration between Epic and OpenAI) [10]. Accessibility remains another significant issue, particularly in rural or underdeveloped regions that may lack the necessary technology or internet connectivity. To overcome these barriers, healthcare systems must invest in education and infrastructure to ensure that these ground-breaking tools are available to all who need them.

These challenges are particularly important to address for conditions such as MASLD, which is anticipated to become increasingly prevalent and for which therapeutic options are limited [1]. Effective management of MASLD relies heavily on lifestyle changes, an area where LLMs could play a crucial role by providing accessible, personalised advice on diet, exercise, and other health behaviours. However, to fully realise this potential, issues of accuracy, privacy, and accessibility need to be addressed. With continued progress in these areas, LLMs could become transformative tools in the management of high-burden, lifestyle-dependent diseases. By providing accurate, secure, and easily accessible support, LLMs offer a path toward a more preventive, patient-centred model of care that empowers both healthcare providers and patients to achieve better long-term outcomes.

As we envision the future, the growing capabilities of GenAI hold the promise to revolutionise patient care, paving the way for unprecedented improvements in outcomes and creating a healthcare system that is more effective, impactful, and deeply patient-centred. While we are only at the beginning of this journey, continued collaboration and innovation will enable AI to play a defining role in the future of healthcare, transforming the way we support and empower both patients and providers.

N.P. declares speaker honorarium from Gilead, AlfaSigma, Advanz Pharma, Gore. J.M.S. declares consultant honorary from Akero, Alentis, Alexion, Altimmune, Astra Zeneca, 89Bio, Bionorica, Boehringer Ingelheim, Gilead Sciences, GSK, HistoIndex, Ipsen, Inventiva Pharma, Madrigal Pharmaceuticals, Kríya Therapeutics, Lilly, MSD Sharp & Dohme GmbH, Nordic Bioscience, Northsea Therapeutics, Novartis, Novo Nordisk, Pfizer, Roche, Sanofi, Siemens Healthineers, Research; speaker honorarium from AbbVie, Boehringer Ingelheim, Gilead Sciences, Ipsen Novo Nordisk, Madrigal Pharmaceuticals, Worldwide Clinical Trials, Stockholder options: Hepta Bio. M.G. declares no COIs.

Abstract Image

MASLD中的大型语言模型:生成式人工智能增强临床实践的新时代。
在快速转变的医疗保健领域,生殖人工智能(GenAI)正迅速成为一个强大的盟友,特别是在代谢功能障碍相关脂肪变性肝病(MASLD) bbb等复杂疾病的管理中。这场革命的核心是大型语言模型(llm),如OpenAI的GPT-4或Meta的lama-3,它们显示出重塑医疗保健专业人员与患者及其数据交互方式的潜力[1,2](图1)。这些工具具有前所未有的潜力,不仅可以提高诊断的准确性,还可以覆盖服务不足地区的患者,并提供可访问的,及时的,以患者为中心的支持[1,2]。特别是,消化疾病领域在面向法学硕士的研究方面非常丰富,最近的一项系统综述定义了ChatGPT基线准确度,当以简单的文本理解和响应生成任务的形式应用于胃肠病学和肝病学查询时,准确度从6.4%到91.4%不等,而不使用实际患者数据[3]。然而,Wu等人最近的研究代表了法学硕士导向的临床研究向前迈进了一步,证明了OpenAI的GPT(即GPT-3.5、GPT-4和GPT-4 Vision)在使用真实患者数据(包括电子健康记录和超声图像)诊断MASLD方面的潜在作用,并显示出与现有工具(如脂肪肝指数(FLI)和美国FLI (USFLI)[4])相当的诊断准确性。因此,通过进一步改进,llm可以快速准确地评估患者,最终加快诊断速度,并使医疗保健提供者能够做出更快、更明智的决策。这种潜力得到了机器学习领域先前验证的进一步支持,在机器学习领域,算法已经证明了它们能够增强诊断过程,简化患者评估和决策,其准确性令人印象深刻。此外,我们可以设想llm作为MASLD患者的虚拟伴侣,指导他们调整生活方式,用简单的语言回答问题[6,7]。除了文本交互外,法学硕士还展示了处理和分析复杂医学数据的能力,包括肝活检标本的图像。最近的一项探索性研究表明,llm在代谢功能障碍相关脂肪性肝炎(MASH)[8]的组织学诊断和纤维化分期方面具有潜力。特别是,ChatGPT-4在解释组织学图像方面表现出了卓越的性能,达到了87.5%的准确率,而谷歌Bard[8]的准确率为38.3%。这项初步研究的结果表明,即使在低收入和中等收入国家也可以获得ChatGPT-4,它可以通过帮助MASH的初步分期来显着提高成本效益和资源分配,特别是在病理能力有限的情况下。然而,尽管这些进步带来了乐观,但前进的道路并非没有障碍。至关重要的是,要始终意识到法学硕士容易产生看似合理但虚假的信息(即幻觉),并且存在患者数据泄露或个人健康信息bbb的重大风险。由于潜在的患者伤害风险,错误信息的问题在医疗保健中至关重要。为了解决第一个问题,医学界正在探索将LLM响应与循证医学联系起来的策略,通过检索增强生成(retrieval-augmented generation, RAG)或监督微调(supervised fine-tuning, SFT)摄入指南,这两种方法都显著提高了医学领域LLM响应的准确性[2,9]。出于隐私考虑,一个潜在的解决方案是使用可以在本地托管的开源模型,或者,加强和支持医院系统和医疗公司之间的合资企业(例如Epic和OpenAI之间的合作)[10]。可访问性仍然是另一个重要问题,特别是在可能缺乏必要技术或互联网连接的农村或不发达地区。为了克服这些障碍,卫生保健系统必须投资于教育和基础设施,以确保所有需要的人都能获得这些突破性的工具。对于MASLD等疾病而言,这些挑战尤其重要,因为预计MASLD将变得越来越普遍,而治疗选择有限。MASLD的有效管理在很大程度上依赖于生活方式的改变,法学硕士可以在这一领域发挥关键作用,提供有关饮食、运动和其他健康行为的可获得的个性化建议。然而,为了充分实现这一潜力,需要解决准确性、隐私性和可访问性等问题。随着这些领域的持续进展,法学硕士可能成为管理高负担、依赖生活方式的疾病的变革性工具。 通过提供准确、安全且易于获取的支持,法学硕士提供了一条通往更具预防性、以患者为中心的护理模式的道路,使医疗保健提供者和患者都能获得更好的长期结果。展望未来,GenAI不断增长的能力有望彻底改变患者护理,为前所未有的改善结果铺平道路,并创建一个更有效、更有影响力、更以患者为中心的医疗保健系统。虽然我们才刚刚开始这段旅程,但持续的合作和创新将使人工智能在未来的医疗保健中发挥决定性作用,改变我们支持和授权患者和提供者的方式。来自吉利德、AlfaSigma、Advanz Pharma、Gore的演讲者酬金宣布。J.M.S.宣布Akero、Alentis、Alexion、Altimmune、Astra Zeneca、89Bio、Bionorica、Boehringer Ingelheim、Gilead Sciences、GSK、HistoIndex、Ipsen、Inventiva Pharma、Madrigal Pharmaceuticals、Kríya Therapeutics、Lilly、MSD Sharp &amp;Dohme GmbH、Nordic Bioscience、Northsea Therapeutics、Novartis、Novo Nordisk、Pfizer、Roche、Sanofi、Siemens Healthineers、Research;来自艾伯维,勃林格殷格翰,吉利德科学,易普森诺和诺德,马德里制药,全球临床试验,股东期权:Hepta Bio。M.G.宣布没有coi。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Liver International
Liver International 医学-胃肠肝病学
CiteScore
13.90
自引率
4.50%
发文量
348
审稿时长
2 months
期刊介绍: Liver International promotes all aspects of the science of hepatology from basic research to applied clinical studies. Providing an international forum for the publication of high-quality original research in hepatology, it is an essential resource for everyone working on normal and abnormal structure and function in the liver and its constituent cells, including clinicians and basic scientists involved in the multi-disciplinary field of hepatology. The journal welcomes articles from all fields of hepatology, which may be published as original articles, brief definitive reports, reviews, mini-reviews, images in hepatology and letters to the Editor.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信