Aqeeb Ur Rehman, Javier A Neyra, Jin Chen, Lama Ghazi
{"title":"Machine learning models for acute kidney injury prediction and management: a scoping review of externally validated studies.","authors":"Aqeeb Ur Rehman, Javier A Neyra, Jin Chen, Lama Ghazi","doi":"10.1080/10408363.2025.2497843","DOIUrl":null,"url":null,"abstract":"<p><p>Despite advancements in medical care, acute kidney injury (AKI) remains a major contributor to adverse patient outcomes and presents a significant challenge due to its associated morbidity, mortality, and financial cost. Machine learning (ML) is increasingly being recognized for its potential to transform AKI care by enabling early prediction, detection, and facilitating an individualized approach to patient management. This scoping review aims to provide a comprehensive analysis of externally validated ML models for the prediction, detection, and management of AKI. We systematically searched for relevant literature from inception to 15 February 2024, using four databases-MEDLINE, EMBASE, Web of Science, and Scopus. We focused solely on models that had undergone external validation, employed Kidney Disease Improving Global Outcomes (KDIGO) definitions for AKI, and utilized ML models (excluding logistic regression models). A total of 44 studies encompassing 161 ML models for AKI prediction, severity assessment, and outcomes in both adult and pediatric populations were included in the review. These studies encompassed 4,153,424 patient admissions, with 1,209,659 in the development and internal validation cohorts and 2,943,765 in the external validation cohorts. The ML models demonstrated significant variability in performance owing to differing clinical settings, populations, and predictors used. Most of the included models were developed in specialized patient populations, such as those in intensive care units, post-surgical settings, and specific disease states (e.g. congestive heart failure, traumatic brain injury, etc.). Moreover, only a few models incorporated dynamic predictors of AKI which are crucial for improving clinical utility in rapidly evolving clinical conditions like AKI. The variable performance of these models when applied to external validation cohorts highlights the challenges of reproducibility and generalizability in implementing ML models in AKI care. Despite acceptable performance metrics, none of the models assessed in this review underwent validation or implementation in real-world clinical workflows. These findings underscore the need for standardized performance metrics and validation protocols to enhance the generalizability and clinical applicability of these models. Future efforts should focus on enhancing model adaptability by incorporating dynamic predictors and unstructured data and by ensuring that models are developed in diverse patient populations. Moreover, collaboration between clinicians and data scientists is critical to ensure the development of models that are clinically relevant, fair, and tailored to real-world healthcare environments.</p>","PeriodicalId":10760,"journal":{"name":"Critical reviews in clinical laboratory sciences","volume":" ","pages":"1-23"},"PeriodicalIF":6.6000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Critical reviews in clinical laboratory sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10408363.2025.2497843","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Despite advancements in medical care, acute kidney injury (AKI) remains a major contributor to adverse patient outcomes and presents a significant challenge due to its associated morbidity, mortality, and financial cost. Machine learning (ML) is increasingly being recognized for its potential to transform AKI care by enabling early prediction, detection, and facilitating an individualized approach to patient management. This scoping review aims to provide a comprehensive analysis of externally validated ML models for the prediction, detection, and management of AKI. We systematically searched for relevant literature from inception to 15 February 2024, using four databases-MEDLINE, EMBASE, Web of Science, and Scopus. We focused solely on models that had undergone external validation, employed Kidney Disease Improving Global Outcomes (KDIGO) definitions for AKI, and utilized ML models (excluding logistic regression models). A total of 44 studies encompassing 161 ML models for AKI prediction, severity assessment, and outcomes in both adult and pediatric populations were included in the review. These studies encompassed 4,153,424 patient admissions, with 1,209,659 in the development and internal validation cohorts and 2,943,765 in the external validation cohorts. The ML models demonstrated significant variability in performance owing to differing clinical settings, populations, and predictors used. Most of the included models were developed in specialized patient populations, such as those in intensive care units, post-surgical settings, and specific disease states (e.g. congestive heart failure, traumatic brain injury, etc.). Moreover, only a few models incorporated dynamic predictors of AKI which are crucial for improving clinical utility in rapidly evolving clinical conditions like AKI. The variable performance of these models when applied to external validation cohorts highlights the challenges of reproducibility and generalizability in implementing ML models in AKI care. Despite acceptable performance metrics, none of the models assessed in this review underwent validation or implementation in real-world clinical workflows. These findings underscore the need for standardized performance metrics and validation protocols to enhance the generalizability and clinical applicability of these models. Future efforts should focus on enhancing model adaptability by incorporating dynamic predictors and unstructured data and by ensuring that models are developed in diverse patient populations. Moreover, collaboration between clinicians and data scientists is critical to ensure the development of models that are clinically relevant, fair, and tailored to real-world healthcare environments.
尽管医疗保健取得了进步,但急性肾损伤(AKI)仍然是患者不良预后的主要原因,并且由于其相关的发病率、死亡率和经济成本而提出了重大挑战。机器学习(ML)通过实现早期预测、检测和促进患者管理的个性化方法,越来越多地认识到其改变AKI护理的潜力。本综述旨在对AKI的预测、检测和管理的外部验证ML模型进行全面分析。我们使用medline、EMBASE、Web of Science和Scopus四个数据库系统地检索了从成立到2024年2月15日的相关文献。我们只关注那些经过外部验证的模型,使用肾脏疾病改善总体结果(KDIGO)定义AKI,并使用ML模型(不包括逻辑回归模型)。该综述共纳入了44项研究,包括161个ML模型,用于AKI预测、严重程度评估和成人和儿童人群的结局。这些研究包括4,153,424例入院患者,其中1,209,659例为开发和内部验证队列,2,943,765例为外部验证队列。由于不同的临床环境、人群和使用的预测因子,ML模型表现出显著的性能差异。大多数纳入的模型是针对专门的患者群体开发的,例如重症监护病房、手术后环境和特定疾病状态(例如充血性心力衰竭、创伤性脑损伤等)。此外,只有少数模型纳入了AKI的动态预测因子,这对于提高AKI等快速发展的临床疾病的临床实用性至关重要。当这些模型应用于外部验证队列时,这些模型的可变性能突出了在AKI护理中实施ML模型的可重复性和泛化性的挑战。尽管可以接受的性能指标,在本综述中评估的模型都没有在现实世界的临床工作流程中进行验证或实施。这些发现强调需要标准化的性能指标和验证方案,以提高这些模型的普遍性和临床适用性。未来的工作应侧重于通过纳入动态预测因子和非结构化数据,并确保在不同的患者群体中开发模型,从而提高模型的适应性。此外,临床医生和数据科学家之间的协作对于确保模型的开发与临床相关、公平且适合现实世界的医疗保健环境至关重要。
期刊介绍:
Critical Reviews in Clinical Laboratory Sciences publishes comprehensive and high quality review articles in all areas of clinical laboratory science, including clinical biochemistry, hematology, microbiology, pathology, transfusion medicine, genetics, immunology and molecular diagnostics. The reviews critically evaluate the status of current issues in the selected areas, with a focus on clinical laboratory diagnostics and latest advances. The adjective “critical” implies a balanced synthesis of results and conclusions that are frequently contradictory and controversial.