ML Framework for Aggregating Individual-Level and averaged clinical data.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Pub Date : 2025-07-01 DOI:10.1109/EMBC58623.2025.11254899

A Vaquero Castro, M Simeoni, E Grisan

{"title":"ML Framework for Aggregating Individual-Level and averaged clinical data.","authors":"A Vaquero Castro, M Simeoni, E Grisan","doi":"10.1109/EMBC58623.2025.11254899","DOIUrl":null,"url":null,"abstract":"<p><p>Pharmacokinetics-Pharmacodynamics (PK/PD) data analysis is a cornerstone of both drug development and efficacy and safety studies. However, individual-level PK/PD data are difficult to obtain, expensive, and scattered throughout different clinical trials, for which usually only aggregated statistics are publicly reported. Meta-Analysis (MA) approaches from simple MA to the more advanced multi-variate meta-regression, and Model-Based MA (MBMA) are among the available tools to interpret average-level data. Ideally, the availability of individual patient data (IPD) would allow methods based on parametric pharmacological models, such as MBMA, to provide a better characterization of the relationships between covariates and PK/PD parameters. We propose to leverage a generative-AI approach to regenerate the IPD data of cohorts with only population-level statistics, by exploiting the availability of a small set of IPD.To test the methodology, we simulate a scenario with different datasets related to different clinical studies. The generative model is trained using IPD from a single study and can then generate IPD data from the population statistics of all others. We show that our algorithm can successfully learn and apply the original relationships of the IPD study to regenerate information lost by averaging data for external reporting purposes. In order to validate and test the analysis, we carried out performance tests showing a good agreement between model-simulated ground truth data and ML-generated data.</p>","PeriodicalId":72237,"journal":{"name":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference","volume":"2025 ","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EMBC58623.2025.11254899","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Pharmacokinetics-Pharmacodynamics (PK/PD) data analysis is a cornerstone of both drug development and efficacy and safety studies. However, individual-level PK/PD data are difficult to obtain, expensive, and scattered throughout different clinical trials, for which usually only aggregated statistics are publicly reported. Meta-Analysis (MA) approaches from simple MA to the more advanced multi-variate meta-regression, and Model-Based MA (MBMA) are among the available tools to interpret average-level data. Ideally, the availability of individual patient data (IPD) would allow methods based on parametric pharmacological models, such as MBMA, to provide a better characterization of the relationships between covariates and PK/PD parameters. We propose to leverage a generative-AI approach to regenerate the IPD data of cohorts with only population-level statistics, by exploiting the availability of a small set of IPD.To test the methodology, we simulate a scenario with different datasets related to different clinical studies. The generative model is trained using IPD from a single study and can then generate IPD data from the population statistics of all others. We show that our algorithm can successfully learn and apply the original relationships of the IPD study to regenerate information lost by averaging data for external reporting purposes. In order to validate and test the analysis, we carried out performance tests showing a good agreement between model-simulated ground truth data and ML-generated data.

查看原文本刊更多论文

汇总个人水平和平均临床数据的ML框架。

药代动力学-药效学（PK/PD）数据分析是药物开发和疗效和安全性研究的基石。然而，个体水平的PK/PD数据很难获得，价格昂贵，并且分散在不同的临床试验中，通常只有汇总的统计数据被公开报道。从简单的综合分析到更高级的多元元回归，以及基于模型的综合分析（MBMA）都是解释平均水平数据的可用工具。理想情况下，个体患者数据（IPD）的可用性将允许基于参数药理学模型（如MBMA）的方法更好地表征协变量与PK/PD参数之间的关系。我们建议利用一种生成人工智能方法，通过利用一小部分IPD的可用性，仅用人口水平的统计数据来重新生成队列的IPD数据。为了验证该方法，我们用与不同临床研究相关的不同数据集模拟了一个场景。生成模型使用来自单个研究的IPD进行训练，然后可以从所有其他研究的人口统计数据中生成IPD数据。我们表明，我们的算法可以成功地学习并应用IPD研究的原始关系，以重新生成因外部报告目的而平均数据而丢失的信息。为了验证和测试分析，我们进行了性能测试，显示模型模拟的地面真实数据与ml生成的数据之间有很好的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

CiteScore

0.80

自引率

0.00%

发文量