Precision Dosing in Presence of Multiobjective Therapies by Integrating Reinforcement Learning and PK-PD Models: Application to Givinostat Treatment of Polycythemia Vera

IF 3 3区医学 Q2 PHARMACOLOGY & PHARMACY

CPT: Pharmacometrics & Systems Pharmacology Pub Date : 2025-05-05 DOI:10.1002/psp4.70012

Alessandro De Carlo, Elena Maria Tosca, Paolo Magni

{"title":"Precision Dosing in Presence of Multiobjective Therapies by Integrating Reinforcement Learning and PK-PD Models: Application to Givinostat Treatment of Polycythemia Vera","authors":"Alessandro De Carlo, Elena Maria Tosca, Paolo Magni","doi":"10.1002/psp4.70012","DOIUrl":null,"url":null,"abstract":"Precision dosing aims to optimize and customize pharmacological treatment at the individual level. The integration of pharmacometric models with Reinforcement Learning (RL) algorithms is currently under investigation to support the personalization of adaptive dosing therapies. In this study, this hybrid technique is applied to the real multiobjective precision dosing problem of givinostat treatment in polycythemia vera (PV) patients. PV is a chronic myeloproliferative disease with an overproduction of platelets (PLT), white blood cells (WBC), and hematocrit (HCT). The therapeutic goal is to simultaneously normalize the levels of these efficacy/safety biomarkers, thus inducing a complete hematological response (CHR). An RL algorithm, Q-Learning (QL), was integrated with a PK-PD model describing the givinostat effect on PLT, WBC, and HCT to derive both an adaptive dosing protocol (QLpop-agent) for the whole population and personalized dosing strategies by coupling a specific QL-agent to each patient (QLind-agents). QLpop-agent learned a general adaptive dosing protocol that achieved a similar CHR rate (77% vs. 83%) when compared to the actual givinostat clinical protocol on 10 simulated populations. Treatment efficacy and safety increased with a deeper dosing personalization by QLind-agents. These QL-based patient-specific adaptive dosing rules outperformed both the clinical protocol and QLpop-agent by reaching the CHR in 93% of the test patients and completely avoided severe toxicities during the whole treatment period. These results confirm that RL and PK-PD models can be valid tools for supporting adaptive dosing strategies as interesting performances were achieved in both learning a general set of rules and in customizing treatment for each patient.","PeriodicalId":10774,"journal":{"name":"CPT: Pharmacometrics & Systems Pharmacology","volume":"14 6","pages":"1018-1031"},"PeriodicalIF":3.0000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/psp4.70012","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CPT: Pharmacometrics & Systems Pharmacology","FirstCategoryId":"3","ListUrlMain":"https://ascpt.onlinelibrary.wiley.com/doi/10.1002/psp4.70012","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

Abstract

Precision dosing aims to optimize and customize pharmacological treatment at the individual level. The integration of pharmacometric models with Reinforcement Learning (RL) algorithms is currently under investigation to support the personalization of adaptive dosing therapies. In this study, this hybrid technique is applied to the real multiobjective precision dosing problem of givinostat treatment in polycythemia vera (PV) patients. PV is a chronic myeloproliferative disease with an overproduction of platelets (PLT), white blood cells (WBC), and hematocrit (HCT). The therapeutic goal is to simultaneously normalize the levels of these efficacy/safety biomarkers, thus inducing a complete hematological response (CHR). An RL algorithm, Q-Learning (QL), was integrated with a PK-PD model describing the givinostat effect on PLT, WBC, and HCT to derive both an adaptive dosing protocol (QL_pop-agent) for the whole population and personalized dosing strategies by coupling a specific QL-agent to each patient (QL_ind-agents). QL_pop-agent learned a general adaptive dosing protocol that achieved a similar CHR rate (77% vs. 83%) when compared to the actual givinostat clinical protocol on 10 simulated populations. Treatment efficacy and safety increased with a deeper dosing personalization by QL_ind-agents. These QL-based patient-specific adaptive dosing rules outperformed both the clinical protocol and QL_pop-agent by reaching the CHR in 93% of the test patients and completely avoided severe toxicities during the whole treatment period. These results confirm that RL and PK-PD models can be valid tools for supporting adaptive dosing strategies as interesting performances were achieved in both learning a general set of rules and in customizing treatment for each patient.

Abstract Image

查看原文本刊更多论文

整合强化学习和PK-PD模型的多目标治疗精准给药：在给予维诺他治疗真性红细胞增多症中的应用。

精确给药的目的是在个体水平上优化和定制药物治疗。目前正在研究将药物计量模型与强化学习（RL）算法相结合，以支持适应性给药治疗的个性化。在这项研究中，这种混合技术被应用于真性红细胞增多症（PV）患者给予他汀治疗的多目标精确给药问题。PV是一种慢性骨髓增生性疾病，伴有血小板（PLT）、白细胞（WBC）和红细胞压积（HCT）的过度产生。治疗目标是同时使这些疗效/安全性生物标志物的水平正常化，从而诱导完全血液学反应（CHR）。将RL算法Q-Learning （QL）与描述给维他汀对PLT、WBC和HCT影响的PK-PD模型相结合，得出适用于整个人群的自适应给药方案（QLpop-agent）和通过将特定的QL-agent耦合到每个患者的个性化给药策略（QLind-agents）。qpop -agent学习了一种通用的自适应给药方案，与10个模拟人群的实际给予维司他临床方案相比，该方案实现了相似的CHR率（77%对83%）。随着QLind-agents给药个性化程度的加深，治疗效果和安全性也随之提高。这些基于ql的患者特异性适应性给药规则优于临床方案和QLpop-agent，在93%的测试患者中达到了CHR，并且在整个治疗期间完全避免了严重的毒性。这些结果证实，RL和PK-PD模型可以作为支持自适应给药策略的有效工具，因为在学习一般规则集和为每个患者定制治疗方面都取得了有趣的表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊