使用高斯 Copula 进行生存分析而无需共享单个患者数据

IF 1.3 4区 医学 Q4 PHARMACOLOGY & PHARMACY
Federico Bonofiglio
{"title":"使用高斯 Copula 进行生存分析而无需共享单个患者数据","authors":"Federico Bonofiglio","doi":"10.1002/pst.2415","DOIUrl":null,"url":null,"abstract":"<p><p>Cox regression and Kaplan-Meier estimations are often needed in clinical research and this requires access to individual patient data (IPD). However, IPD cannot always be shared because of privacy or proprietary restrictions, which complicates the making of such estimations. We propose a method that generates pseudodata replacing the IPD by only sharing non-disclosive aggregates such as IPD marginal moments and a correlation matrix. Such aggregates are collected by a central computer and input as parameters to a Gaussian copula (GC) that generates the pseudodata. Survival inferences are computed on the pseudodata as if it were the IPD. Using practical examples we demonstrate the utility of the method, via the amount of IPD inferential content recoverable by the GC. We compare GC to a summary-based meta-analysis and an IPD bootstrap distributed across several centers. Other pseudodata approaches are also considered. In the empirical results, GC approximates the utility of the IPD bootstrap although it might yield more conservative inferences and it might have limitations in subgroup analyses. Overall, GC avoids many legal problems related to IPD privacy or property while enabling approximation of common IPD survival analyses otherwise difficult to conduct. Sharing more IPD aggregates than is currently practiced could facilitate \"second purpose\"-research and relax concerns regarding IPD access.</p>","PeriodicalId":19934,"journal":{"name":"Pharmaceutical Statistics","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Survival Analysis Without Sharing of Individual Patient Data by Using a Gaussian Copula.\",\"authors\":\"Federico Bonofiglio\",\"doi\":\"10.1002/pst.2415\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Cox regression and Kaplan-Meier estimations are often needed in clinical research and this requires access to individual patient data (IPD). However, IPD cannot always be shared because of privacy or proprietary restrictions, which complicates the making of such estimations. We propose a method that generates pseudodata replacing the IPD by only sharing non-disclosive aggregates such as IPD marginal moments and a correlation matrix. Such aggregates are collected by a central computer and input as parameters to a Gaussian copula (GC) that generates the pseudodata. Survival inferences are computed on the pseudodata as if it were the IPD. Using practical examples we demonstrate the utility of the method, via the amount of IPD inferential content recoverable by the GC. We compare GC to a summary-based meta-analysis and an IPD bootstrap distributed across several centers. Other pseudodata approaches are also considered. In the empirical results, GC approximates the utility of the IPD bootstrap although it might yield more conservative inferences and it might have limitations in subgroup analyses. Overall, GC avoids many legal problems related to IPD privacy or property while enabling approximation of common IPD survival analyses otherwise difficult to conduct. Sharing more IPD aggregates than is currently practiced could facilitate \\\"second purpose\\\"-research and relax concerns regarding IPD access.</p>\",\"PeriodicalId\":19934,\"journal\":{\"name\":\"Pharmaceutical Statistics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pharmaceutical Statistics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/pst.2415\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutical Statistics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/pst.2415","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

摘要

临床研究中经常需要进行 Cox 回归和 Kaplan-Meier 估计,这就需要获取患者的个人数据(IPD)。然而,由于隐私或专有权的限制,IPD 并不总能共享,这就使此类估算变得更加复杂。我们提出了一种方法,通过只共享 IPD 边际矩和相关矩阵等非披露性总体数据来生成替代 IPD 的伪数据。这些总体数据由中央计算机收集,并作为参数输入到生成伪数据的高斯共线公式(GC)中。对伪数据进行生存推断计算时,就像计算 IPD 一样。通过实际案例,我们展示了该方法的实用性,即 GC 可恢复 IPD 推断内容的数量。我们将 GC 与基于摘要的荟萃分析和分布在多个中心的 IPD 引导分析进行了比较。我们还考虑了其他伪数据方法。在实证结果中,GC 近似于 IPD 自举法的效用,尽管它可能产生更保守的推论,而且在亚组分析中可能有局限性。总的来说,GC 可以避免许多与 IPD 隐私或财产相关的法律问题,同时还能近似地进行普通 IPD 生存分析,否则很难进行。与目前的做法相比,共享更多的 IPD 总量可促进 "第二目的 "研究,并放松对 IPD 访问的担忧。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Survival Analysis Without Sharing of Individual Patient Data by Using a Gaussian Copula.

Cox regression and Kaplan-Meier estimations are often needed in clinical research and this requires access to individual patient data (IPD). However, IPD cannot always be shared because of privacy or proprietary restrictions, which complicates the making of such estimations. We propose a method that generates pseudodata replacing the IPD by only sharing non-disclosive aggregates such as IPD marginal moments and a correlation matrix. Such aggregates are collected by a central computer and input as parameters to a Gaussian copula (GC) that generates the pseudodata. Survival inferences are computed on the pseudodata as if it were the IPD. Using practical examples we demonstrate the utility of the method, via the amount of IPD inferential content recoverable by the GC. We compare GC to a summary-based meta-analysis and an IPD bootstrap distributed across several centers. Other pseudodata approaches are also considered. In the empirical results, GC approximates the utility of the IPD bootstrap although it might yield more conservative inferences and it might have limitations in subgroup analyses. Overall, GC avoids many legal problems related to IPD privacy or property while enabling approximation of common IPD survival analyses otherwise difficult to conduct. Sharing more IPD aggregates than is currently practiced could facilitate "second purpose"-research and relax concerns regarding IPD access.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pharmaceutical Statistics
Pharmaceutical Statistics 医学-统计学与概率论
CiteScore
2.70
自引率
6.70%
发文量
90
审稿时长
6-12 weeks
期刊介绍: Pharmaceutical Statistics is an industry-led initiative, tackling real problems in statistical applications. The Journal publishes papers that share experiences in the practical application of statistics within the pharmaceutical industry. It covers all aspects of pharmaceutical statistical applications from discovery, through pre-clinical development, clinical development, post-marketing surveillance, consumer health, production, epidemiology, and health economics. The Journal is both international and multidisciplinary. It includes high quality practical papers, case studies and review papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信