Machine Learning Modeling for Predicting Infliximab Pharmacokinetics in Pediatric and Young Adult Patients With Crohn Disease: Leveraging Ensemble Modeling With Synthetic and Real-World Data.
Kei Irie, Phillip Minar, Jack Reifenberg, Brendan M Boyle, Joshua D Noe, Jeffrey S Hyams, Tomoyuki Mizuno
{"title":"Machine Learning Modeling for Predicting Infliximab Pharmacokinetics in Pediatric and Young Adult Patients With Crohn Disease: Leveraging Ensemble Modeling With Synthetic and Real-World Data.","authors":"Kei Irie, Phillip Minar, Jack Reifenberg, Brendan M Boyle, Joshua D Noe, Jeffrey S Hyams, Tomoyuki Mizuno","doi":"10.1097/FTD.0000000000001348","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Predicting infliximab pharmacokinetics (PK) is essential for optimizing individualized dosing in pediatric patients with Crohn disease (CD). Machine learning (ML) has emerged as a tool for predicting drug exposure; however, its development typically requires large datasets. This study aimed to develop an ML model for infliximab PK prediction by leveraging population PK model-based synthetic and real-world data.</p><p><strong>Methods: </strong>An initial ML model was trained using the XGBoost algorithm with synthetic infliximab concentration data (n = 560,000) generated from an established pediatric PK model. The prediction errors were assessed using real-world data, including 292 plasma concentrations from 93 pediatric and young adult patients with CD. A second XGBoost model, incorporating clinical features, was used to correct these errors. The performance of the model was evaluated using the root mean square error (RMSE) and mean prediction error (MPE).</p><p><strong>Results: </strong>The first ML model yielded RMSE and MPE values of 6.44 and 1.84 mcg/mL, respectively. The features of the second XGBoost model included the predicted infliximab concentrations, cumulative dose, and dosing interval duration. A 5-fold cross-validation demonstrated improved performance of the ensemble model (RMSE = 4.30 ± 1.09 mcg/mL, MPE = 0.21 ± 0.39 mcg/mL) compared with the initial model and was comparable with the Bayesian approach (RMSE = 4.81 mcg/mL, MPE = -0.67 mcg/mL).</p><p><strong>Conclusions: </strong>This study demonstrated the feasibility of combining synthetic and real-world data to develop an ML-based approach for infliximab PK prediction, potentially enhancing precision dosing in pediatric CD.</p>","PeriodicalId":23052,"journal":{"name":"Therapeutic Drug Monitoring","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Therapeutic Drug Monitoring","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/FTD.0000000000001348","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Predicting infliximab pharmacokinetics (PK) is essential for optimizing individualized dosing in pediatric patients with Crohn disease (CD). Machine learning (ML) has emerged as a tool for predicting drug exposure; however, its development typically requires large datasets. This study aimed to develop an ML model for infliximab PK prediction by leveraging population PK model-based synthetic and real-world data.
Methods: An initial ML model was trained using the XGBoost algorithm with synthetic infliximab concentration data (n = 560,000) generated from an established pediatric PK model. The prediction errors were assessed using real-world data, including 292 plasma concentrations from 93 pediatric and young adult patients with CD. A second XGBoost model, incorporating clinical features, was used to correct these errors. The performance of the model was evaluated using the root mean square error (RMSE) and mean prediction error (MPE).
Results: The first ML model yielded RMSE and MPE values of 6.44 and 1.84 mcg/mL, respectively. The features of the second XGBoost model included the predicted infliximab concentrations, cumulative dose, and dosing interval duration. A 5-fold cross-validation demonstrated improved performance of the ensemble model (RMSE = 4.30 ± 1.09 mcg/mL, MPE = 0.21 ± 0.39 mcg/mL) compared with the initial model and was comparable with the Bayesian approach (RMSE = 4.81 mcg/mL, MPE = -0.67 mcg/mL).
Conclusions: This study demonstrated the feasibility of combining synthetic and real-world data to develop an ML-based approach for infliximab PK prediction, potentially enhancing precision dosing in pediatric CD.
期刊介绍:
Therapeutic Drug Monitoring is a peer-reviewed, multidisciplinary journal directed to an audience of pharmacologists, clinical chemists, laboratorians, pharmacists, drug researchers and toxicologists. It fosters the exchange of knowledge among the various disciplines–clinical pharmacology, pathology, toxicology, analytical chemistry–that share a common interest in Therapeutic Drug Monitoring. The journal presents studies detailing the various factors that affect the rate and extent drugs are absorbed, metabolized, and excreted. Regular features include review articles on specific classes of drugs, original articles, case reports, technical notes, and continuing education articles.