An extension of estimating equations to model longitudinal medical cost trajectory with Medicare claims data linked to SEER cancer registry.

IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY
Annals of Applied Statistics Pub Date : 2023-03-01 Epub Date: 2023-01-24
Shikun Wang, Jing Ning, Ying Xu, Ya-Chen Tina Shih, Y U Shen, Liang Li
{"title":"An extension of estimating equations to model longitudinal medical cost trajectory with Medicare claims data linked to SEER cancer registry.","authors":"Shikun Wang, Jing Ning, Ying Xu, Ya-Chen Tina Shih, Y U Shen, Liang Li","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Insurance claims data is an increasingly important health policy research resource, given its longitudinal assessment of cancer care clinical outcomes. Population-level information on medical cost trajectory from disease diagnosis to terminal events, such as death, specifically interests policy makers. Estimating the mean cost trajectory has statistical challenges. The shape of the trajectory is usually highly nonlinear with varying durations, depending on the diagnosis-to-death population time distribution. The terminal event may be right censored, resulting in missing subsequent costs. Medical costs often have skewed distributions with zero-inflation and heteroscedasticity, which may not fit well with the commonly used parametric family of distributions. In this paper, we propose a flexible semi-parametric model to address challenges without imposing a cost data distributional assumption. The estimation procedure is based on generalized estimating equations with censored covariates. The proposed model adopts a bivariate surface that quantifies the interrelationship between longitudinal medical costs and survival, and results in the nonlinear population mean cost trajectory conditional on the death time. We develop a novel generalized estimating equations algorithm to accommodate covariates subject to right-censoring, without fully specifying the joint distribution of the cost and survival data. We provide theoretical and simulation-based justification for the proposed approach, and apply the methods to estimate prostate cancer patient cost trajectories from the Surveillance, Epidemiology, and End Results (SEER)-Medicare linked database.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"17 1","pages":"881-899"},"PeriodicalIF":1.4000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12061044/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Applied Statistics","FirstCategoryId":"100","ListUrlMain":"","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/24 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Insurance claims data is an increasingly important health policy research resource, given its longitudinal assessment of cancer care clinical outcomes. Population-level information on medical cost trajectory from disease diagnosis to terminal events, such as death, specifically interests policy makers. Estimating the mean cost trajectory has statistical challenges. The shape of the trajectory is usually highly nonlinear with varying durations, depending on the diagnosis-to-death population time distribution. The terminal event may be right censored, resulting in missing subsequent costs. Medical costs often have skewed distributions with zero-inflation and heteroscedasticity, which may not fit well with the commonly used parametric family of distributions. In this paper, we propose a flexible semi-parametric model to address challenges without imposing a cost data distributional assumption. The estimation procedure is based on generalized estimating equations with censored covariates. The proposed model adopts a bivariate surface that quantifies the interrelationship between longitudinal medical costs and survival, and results in the nonlinear population mean cost trajectory conditional on the death time. We develop a novel generalized estimating equations algorithm to accommodate covariates subject to right-censoring, without fully specifying the joint distribution of the cost and survival data. We provide theoretical and simulation-based justification for the proposed approach, and apply the methods to estimate prostate cancer patient cost trajectories from the Surveillance, Epidemiology, and End Results (SEER)-Medicare linked database.

估计方程的扩展,以模拟纵向医疗成本轨迹与医疗保险索赔数据与SEER癌症登记。
保险索赔数据是越来越重要的卫生政策研究资源,因为它对癌症治疗临床结果进行了纵向评估。从疾病诊断到死亡等终末事件的医疗费用轨迹的人口层面信息,特别使决策者感兴趣。估算平均成本轨迹具有统计学上的挑战。轨迹的形状通常是高度非线性的,其持续时间随诊断到死亡的总体时间分布而变化。最终事件可能被正确审查,导致后续成本的缺失。医疗费用通常具有零通货膨胀和异方差的偏态分布,这可能不适合常用的参数分布族。在本文中,我们提出了一个灵活的半参数模型来解决挑战,而不强加成本数据分布假设。估计过程基于带有截尾协变量的广义估计方程。该模型采用双变量曲面来量化纵向医疗费用与生存之间的相互关系,并得到以死亡时间为条件的非线性总体平均费用轨迹。我们开发了一种新的广义估计方程算法,以适应受右审查的协变量,而不完全指定成本和生存数据的联合分布。我们为所提出的方法提供了理论和基于模拟的理由,并应用这些方法从监测、流行病学和最终结果(SEER)-医疗保险关联数据库中估计前列腺癌患者的成本轨迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Annals of Applied Statistics
Annals of Applied Statistics 社会科学-统计学与概率论
CiteScore
3.10
自引率
5.60%
发文量
131
审稿时长
6-12 weeks
期刊介绍: Statistical research spans an enormous range from direct subject-matter collaborations to pure mathematical theory. The Annals of Applied Statistics, the newest journal from the IMS, is aimed at papers in the applied half of this range. Published quarterly in both print and electronic form, our goal is to provide a timely and unified forum for all areas of applied statistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信