{"title":"Predictive modeling of pediatric drug-induced liver injury: Dynamic classifier selection with clustering analysis.","authors":"Zixin Shi, Linjun Huang, Haolin Wang","doi":"10.1177/20552076251330078","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pediatric populations are more vulnerable to drug-induced liver injury (DILI) due to distinct pharmacokinetic profiles and ongoing physiological maturation processes. However, early identification and assessment of DILI in pediatric patients present significant clinical challenges, primarily due to the inherent complexity of pediatric cases and substantial limitations in available clinical data.</p><p><strong>Objective: </strong>This study introduces a framework that integrates clustering analysis with dynamic classifier selection (DCS) techniques to enhance pediatric DILI prediction. The proposed method addresses challenges such as patient heterogeneity and class imbalance, while optimizing predictive performance to support clinical decision-making.</p><p><strong>Methods: </strong>We investigated a retrospective cohort of 12,555 pediatric inpatients across six hospitals in Chongqing, China. The dataset encompassed a wide range of biomedical parameters, including laboratory results and liver function profiles, along with clinical documentation spanning demographic characteristics, medical histories, and medication regimens. Patients were stratified into four distinct clinical subgroups based on silhouette coefficient. A diverse pool of base classifiers was generated with varied initialization strategies and hyperparameter optimizations tailored to each patient cluster. The classification process was further refined through the implementation of Dynamic Classifier Selection with Multiple Classifier Behavior (DCS-MCB) methodology, which adaptively customizes model selection based on the distinctive clinical profiles of each subgroup.</p><p><strong>Results: </strong>The Clustering-enhanced DCS-MCB framework demonstrated superior performance compared to conventional machine learning models across evaluation metrics. The ensemble learning models consistently outperformed individual classifier models, with the presented study achieving the highest F1-score (0.926), MCC (0.917), G-mean (0.959), demonstrating the strength of this hybrid approach in addressing the complexities of pediatric DILI prediction.</p><p><strong>Conclusion: </strong>The integration of clustering analysis with dynamic classifier selection has demonstrated efficacy in complex real-world clinical settings. This methodology provides a more robust, precise, and clinically adaptable framework for patient stratification and drug safety surveillance.</p>","PeriodicalId":51333,"journal":{"name":"DIGITAL HEALTH","volume":"11 ","pages":"20552076251330078"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11926833/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DIGITAL HEALTH","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20552076251330078","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Pediatric populations are more vulnerable to drug-induced liver injury (DILI) due to distinct pharmacokinetic profiles and ongoing physiological maturation processes. However, early identification and assessment of DILI in pediatric patients present significant clinical challenges, primarily due to the inherent complexity of pediatric cases and substantial limitations in available clinical data.
Objective: This study introduces a framework that integrates clustering analysis with dynamic classifier selection (DCS) techniques to enhance pediatric DILI prediction. The proposed method addresses challenges such as patient heterogeneity and class imbalance, while optimizing predictive performance to support clinical decision-making.
Methods: We investigated a retrospective cohort of 12,555 pediatric inpatients across six hospitals in Chongqing, China. The dataset encompassed a wide range of biomedical parameters, including laboratory results and liver function profiles, along with clinical documentation spanning demographic characteristics, medical histories, and medication regimens. Patients were stratified into four distinct clinical subgroups based on silhouette coefficient. A diverse pool of base classifiers was generated with varied initialization strategies and hyperparameter optimizations tailored to each patient cluster. The classification process was further refined through the implementation of Dynamic Classifier Selection with Multiple Classifier Behavior (DCS-MCB) methodology, which adaptively customizes model selection based on the distinctive clinical profiles of each subgroup.
Results: The Clustering-enhanced DCS-MCB framework demonstrated superior performance compared to conventional machine learning models across evaluation metrics. The ensemble learning models consistently outperformed individual classifier models, with the presented study achieving the highest F1-score (0.926), MCC (0.917), G-mean (0.959), demonstrating the strength of this hybrid approach in addressing the complexities of pediatric DILI prediction.
Conclusion: The integration of clustering analysis with dynamic classifier selection has demonstrated efficacy in complex real-world clinical settings. This methodology provides a more robust, precise, and clinically adaptable framework for patient stratification and drug safety surveillance.