Hanfei Zhu, Ruojiang Wang, Jiajie Qian, Yuhao Wu, Zhuqing Jin, Xishen Shan, Fuhai Ji, Zixuan Yuan, Tingrui Pan
{"title":"Leveraging Large Language Models for Predicting Postoperative Acute Kidney Injury in Elderly Patients.","authors":"Hanfei Zhu, Ruojiang Wang, Jiajie Qian, Yuhao Wu, Zhuqing Jin, Xishen Shan, Fuhai Ji, Zixuan Yuan, Tingrui Pan","doi":"10.34133/bmef.0111","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> The objective of this work is to develop a framework based on large language models (LLMs) to predict postoperative acute kidney injury (AKI) outcomes in elderly patients. <b>Impact Statement:</b> Our study demonstrates that LLMs have the potential to address the issues of poor generalization and weak interpretability commonly encountered in disease prediction using traditional machine learning (ML) models. <b>Introduction:</b> AKI is a severe postoperative complication, especially in elderly patients with declining renal function. Current AKI prediction models rely on ML, but their lack of interpretability and generalizability limits clinical use. LLMs, with extensive pretraining and text generation capabilities, offer a new solution. <b>Methods:</b> We applied prompt engineering and knowledge distillation based on instruction fine-tuning to optimize LLMs for AKI prediction. The framework was tested on 2,649 samples from 2 private Chinese hospitals and one public South Korean dataset, which were divided into internal and external datasets. <b>Results:</b> The LLM framework showed robust external performance, with accuracy rates: commercial LLMs (internal: 63.73%, external: 68.73%), open-source LLMs (internal: 63.70%, external: 64.24%), and ML models (internal: 63.93%, external: 58.27%). LLMs also provided human-readable explanations for better clinical understanding. <b>Conclusion:</b> The proposed framework showcases the potential of LLMs to enhance generalization and interpretability in postoperative AKI prediction, paving the way for more robust and transparent predictive solutions in clinical settings.</p>","PeriodicalId":72430,"journal":{"name":"BME frontiers","volume":"6 ","pages":"0111"},"PeriodicalIF":5.0000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11896637/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BME frontiers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34133/bmef.0111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: The objective of this work is to develop a framework based on large language models (LLMs) to predict postoperative acute kidney injury (AKI) outcomes in elderly patients. Impact Statement: Our study demonstrates that LLMs have the potential to address the issues of poor generalization and weak interpretability commonly encountered in disease prediction using traditional machine learning (ML) models. Introduction: AKI is a severe postoperative complication, especially in elderly patients with declining renal function. Current AKI prediction models rely on ML, but their lack of interpretability and generalizability limits clinical use. LLMs, with extensive pretraining and text generation capabilities, offer a new solution. Methods: We applied prompt engineering and knowledge distillation based on instruction fine-tuning to optimize LLMs for AKI prediction. The framework was tested on 2,649 samples from 2 private Chinese hospitals and one public South Korean dataset, which were divided into internal and external datasets. Results: The LLM framework showed robust external performance, with accuracy rates: commercial LLMs (internal: 63.73%, external: 68.73%), open-source LLMs (internal: 63.70%, external: 64.24%), and ML models (internal: 63.93%, external: 58.27%). LLMs also provided human-readable explanations for better clinical understanding. Conclusion: The proposed framework showcases the potential of LLMs to enhance generalization and interpretability in postoperative AKI prediction, paving the way for more robust and transparent predictive solutions in clinical settings.