针对分布式数据的考克斯比例危害模型中隐私增强型协作推理

Mengtong Hu, Xu Shi, Peter X. -K. Song
{"title":"针对分布式数据的考克斯比例危害模型中隐私增强型协作推理","authors":"Mengtong Hu, Xu Shi, Peter X. -K. Song","doi":"arxiv-2409.04716","DOIUrl":null,"url":null,"abstract":"Data sharing barriers are paramount challenges arising from multicenter\nclinical studies where multiple data sources are stored in a distributed\nfashion at different local study sites. Particularly in the case of\ntime-to-event analysis when global risk sets are needed for the Cox\nproportional hazards model, access to a centralized database is typically\nnecessary. Merging such data sources into a common data storage for a\ncentralized statistical analysis requires a data use agreement, which is often\ntime-consuming. Furthermore, the construction and distribution of risk sets to\nparticipating clinical centers for subsequent calculations may pose a risk of\nrevealing individual-level information. We propose a new collaborative Cox\nmodel that eliminates the need for accessing the centralized database and\nconstructing global risk sets but needs only the sharing of summary statistics\nwith significantly smaller dimensions than risk sets. Thus, the proposed\ncollaborative inference enjoys maximal protection of data privacy. We show\ntheoretically and numerically that the new distributed proportional hazards\nmodel approach has little loss of statistical power when compared to the\ncentralized method that requires merging the entire data. We present a\nrenewable sieve method to establish large-sample properties for the proposed\nmethod. We illustrate its performance through simulation experiments and a\nreal-world data example from patients with kidney transplantation in the Organ\nProcurement and Transplantation Network (OPTN) to understand the factors\nassociated with the 5-year death-censored graft failure (DCGF) for patients who\nunderwent kidney transplants in the US.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Privacy enhanced collaborative inference in the Cox proportional hazards model for distributed data\",\"authors\":\"Mengtong Hu, Xu Shi, Peter X. -K. Song\",\"doi\":\"arxiv-2409.04716\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data sharing barriers are paramount challenges arising from multicenter\\nclinical studies where multiple data sources are stored in a distributed\\nfashion at different local study sites. Particularly in the case of\\ntime-to-event analysis when global risk sets are needed for the Cox\\nproportional hazards model, access to a centralized database is typically\\nnecessary. Merging such data sources into a common data storage for a\\ncentralized statistical analysis requires a data use agreement, which is often\\ntime-consuming. Furthermore, the construction and distribution of risk sets to\\nparticipating clinical centers for subsequent calculations may pose a risk of\\nrevealing individual-level information. We propose a new collaborative Cox\\nmodel that eliminates the need for accessing the centralized database and\\nconstructing global risk sets but needs only the sharing of summary statistics\\nwith significantly smaller dimensions than risk sets. Thus, the proposed\\ncollaborative inference enjoys maximal protection of data privacy. We show\\ntheoretically and numerically that the new distributed proportional hazards\\nmodel approach has little loss of statistical power when compared to the\\ncentralized method that requires merging the entire data. We present a\\nrenewable sieve method to establish large-sample properties for the proposed\\nmethod. We illustrate its performance through simulation experiments and a\\nreal-world data example from patients with kidney transplantation in the Organ\\nProcurement and Transplantation Network (OPTN) to understand the factors\\nassociated with the 5-year death-censored graft failure (DCGF) for patients who\\nunderwent kidney transplants in the US.\",\"PeriodicalId\":501379,\"journal\":{\"name\":\"arXiv - STAT - Statistics Theory\",\"volume\":\"33 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.04716\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.04716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在多中心临床研究中,多个数据源以分布式方式存储在不同的研究地点,数据共享障碍是这些研究面临的最大挑战。特别是在时间到事件分析中,当需要为共比例危险模型建立全局风险集时,通常需要访问集中式数据库。将这些数据源合并到一个共同的数据存储器中以进行集中统计分析,需要签订数据使用协议,这通常很耗费时间。此外,在参与计算的临床中心之间构建和分发风险集可能会带来暴露个体水平信息的风险。我们提出了一种新的协作 Cox 模型,该模型无需访问中央数据库和构建全局风险集,而只需共享维度明显小于风险集的汇总统计数据。因此,所提出的协作推理能最大程度地保护数据隐私。我们从理论和数值上证明,与需要合并整个数据的集中式方法相比,新的分布式比例危险模型方法几乎不会损失统计能力。我们提出了一种可筛分方法,以建立所提议方法的大样本属性。我们通过模拟实验和器官采购与移植网络(OPTN)中肾移植患者的真实世界数据实例来说明该方法的性能,从而了解与美国肾移植患者 5 年死亡删失移植物失败(DCGF)相关的因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Privacy enhanced collaborative inference in the Cox proportional hazards model for distributed data
Data sharing barriers are paramount challenges arising from multicenter clinical studies where multiple data sources are stored in a distributed fashion at different local study sites. Particularly in the case of time-to-event analysis when global risk sets are needed for the Cox proportional hazards model, access to a centralized database is typically necessary. Merging such data sources into a common data storage for a centralized statistical analysis requires a data use agreement, which is often time-consuming. Furthermore, the construction and distribution of risk sets to participating clinical centers for subsequent calculations may pose a risk of revealing individual-level information. We propose a new collaborative Cox model that eliminates the need for accessing the centralized database and constructing global risk sets but needs only the sharing of summary statistics with significantly smaller dimensions than risk sets. Thus, the proposed collaborative inference enjoys maximal protection of data privacy. We show theoretically and numerically that the new distributed proportional hazards model approach has little loss of statistical power when compared to the centralized method that requires merging the entire data. We present a renewable sieve method to establish large-sample properties for the proposed method. We illustrate its performance through simulation experiments and a real-world data example from patients with kidney transplantation in the Organ Procurement and Transplantation Network (OPTN) to understand the factors associated with the 5-year death-censored graft failure (DCGF) for patients who underwent kidney transplants in the US.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信