Geographically weighted accelerated failure time model for spatial survival data: application to ovarian cancer survival data in New Jersey.

IF 3.9 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES
Jiaxin Cai, Yemian Li, Weiwei Hu, Hui Jing, Baibing Mi, Leilei Pei, Yaling Zhao, Hong Yan, Fangyao Chen
{"title":"Geographically weighted accelerated failure time model for spatial survival data: application to ovarian cancer survival data in New Jersey.","authors":"Jiaxin Cai, Yemian Li, Weiwei Hu, Hui Jing, Baibing Mi, Leilei Pei, Yaling Zhao, Hong Yan, Fangyao Chen","doi":"10.1186/s12874-024-02346-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In large multiregional cohort studies, survival data is often collected at small geographical levels (such as counties) and aggregated at larger levels, leading to correlated patterns that are associated with location. Traditional studies typically analyze such data globally or locally by region, often neglecting the spatial information inherent in the data, which can introduce bias in effect estimates and potentially reduce statistical power.</p><p><strong>Method: </strong>We propose a Geographically Weighted Accelerated Failure Time Model for spatial survival data to investigate spatial heterogeneity. We establish a weighting scheme and bandwidth selection based on quasi-likelihood information criteria. Theoretical properties of the proposed estimators are thoroughly examined. To demonstrate the efficacy of the model in various scenarios, we conduct a simulation study with different sample sizes and adherence to the proportional hazards assumption or not. Additionally, we apply the proposed method to analyze ovarian cancer survival data from the Surveillance, Epidemiology, and End Results cancer registry in the state of New Jersey.</p><p><strong>Results: </strong>Our simulation results indicate that the proposed model exhibits superior performance in terms of four measurements compared to existing methods, including the geographically weighted Cox model, when the proportional hazards assumption is violated. Furthermore, in scenarios where the sample size per location is 20-25, the simulation data failed to fit the local model, while our proposed model still demonstrates satisfactory performance. In the empirical study, we identify clear spatial variations in the effects of all three covariates.</p><p><strong>Conclusion: </strong>Our proposed model offers a novel approach to exploring spatial heterogeneity of survival data compared to global and local models, providing an alternative to geographically weighted Cox regression when the proportional hazards assumption is not met. It addresses the issue of certain counties' survival data being unable to fit the model due to limited samples, particularly in the context of rare diseases.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"24 1","pages":"239"},"PeriodicalIF":3.9000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11476607/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-024-02346-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: In large multiregional cohort studies, survival data is often collected at small geographical levels (such as counties) and aggregated at larger levels, leading to correlated patterns that are associated with location. Traditional studies typically analyze such data globally or locally by region, often neglecting the spatial information inherent in the data, which can introduce bias in effect estimates and potentially reduce statistical power.

Method: We propose a Geographically Weighted Accelerated Failure Time Model for spatial survival data to investigate spatial heterogeneity. We establish a weighting scheme and bandwidth selection based on quasi-likelihood information criteria. Theoretical properties of the proposed estimators are thoroughly examined. To demonstrate the efficacy of the model in various scenarios, we conduct a simulation study with different sample sizes and adherence to the proportional hazards assumption or not. Additionally, we apply the proposed method to analyze ovarian cancer survival data from the Surveillance, Epidemiology, and End Results cancer registry in the state of New Jersey.

Results: Our simulation results indicate that the proposed model exhibits superior performance in terms of four measurements compared to existing methods, including the geographically weighted Cox model, when the proportional hazards assumption is violated. Furthermore, in scenarios where the sample size per location is 20-25, the simulation data failed to fit the local model, while our proposed model still demonstrates satisfactory performance. In the empirical study, we identify clear spatial variations in the effects of all three covariates.

Conclusion: Our proposed model offers a novel approach to exploring spatial heterogeneity of survival data compared to global and local models, providing an alternative to geographically weighted Cox regression when the proportional hazards assumption is not met. It addresses the issue of certain counties' survival data being unable to fit the model due to limited samples, particularly in the context of rare diseases.

空间生存数据的地理加权加速失败时间模型:应用于新泽西州的卵巢癌生存数据。
背景:在大型多区域队列研究中,生存数据通常在较小的地理层面(如县)收集,然后在较大的层面汇总,从而产生与地点相关的关联模式。传统的研究通常按地区对这些数据进行全球或局部分析,往往忽略了数据中固有的空间信息,这会给效应估计带来偏差,并可能降低统计能力:方法:我们针对空间生存数据提出了地理加权加速失效时间模型,以研究空间异质性。我们建立了基于准似然法信息标准的加权方案和带宽选择。我们对所提出的估计器的理论特性进行了深入研究。为了证明该模型在各种情况下的有效性,我们进行了一项模拟研究,研究了不同的样本大小和是否遵守比例危险假设。此外,我们还应用所提出的方法分析了来自新泽西州癌症监测、流行病学和最终结果登记处的卵巢癌生存数据:我们的模拟结果表明,当违反比例危险假设时,与包括地理加权 Cox 模型在内的现有方法相比,所提出的模型在四项测量指标上表现出更优越的性能。此外,在每个地点的样本量为 20-25 个的情况下,模拟数据与本地模型不匹配,而我们提出的模型仍然表现出令人满意的性能。在实证研究中,我们发现所有三个协变量的影响都存在明显的空间变化:与全局模型和局部模型相比,我们提出的模型为探索生存数据的空间异质性提供了一种新方法,在不满足比例危险假设的情况下,提供了地域加权 Cox 回归的替代方案。它解决了某些县的生存数据因样本有限而无法适合模型的问题,特别是在罕见病的情况下。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Medical Research Methodology
BMC Medical Research Methodology 医学-卫生保健
CiteScore
6.50
自引率
2.50%
发文量
298
审稿时长
3-8 weeks
期刊介绍: BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信