Better tools for better estimates: improving approaches to handling missing data in Swiss cancer registries.

IF 2.1 4区 医学 Q3 ONCOLOGY
European Journal of Cancer Prevention Pub Date : 2024-09-01 Epub Date: 2024-03-06 DOI:10.1097/CEJ.0000000000000881
Cornelia Richter, Lea Wildisen, Sabine Rohrmann, Sarah R Haile
{"title":"Better tools for better estimates: improving approaches to handling missing data in Swiss cancer registries.","authors":"Cornelia Richter, Lea Wildisen, Sabine Rohrmann, Sarah R Haile","doi":"10.1097/CEJ.0000000000000881","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Various approaches have been used in the literature to handle missing vital status data in cancer registries. We aimed to compare these approaches to determine which led to the least biased estimates in typical analytic tasks of cancer registries.</p><p><strong>Methods: </strong>A simulation study was performed using data from the Swiss National Agency for Cancer Registration for six tumor types. First, 5%, 10% and 15% missingness in the vital status were introduced artificially in the complete data. Second, missing vital status data were handled by applying no, single or multiple imputations. Five-year overall survival estimates, relative survival or standardized incidence ratio were computed. Estimates were compared with the true value.</p><p><strong>Results: </strong>Standardized incidence ratio estimates for colorectal cancer obtained with multiple imputation yielded least biased results (-0.06 to -0.04), but the widest confidence intervals. Single imputation was more biased (-0.32) than using no imputation at all (-0.21). A similar pattern was observed for overall survival and relative survival.</p><p><strong>Conclusion: </strong>This simulation study indicated that often used single imputation (sometimes referred to as simulating follow-up times) techniques to fill in missing vital status data are likely too biased to be useful in practice. Multiple imputation approaches yielded standardized incidence ratio, overall and relative survival estimates with the least bias, indicating reasonable performance that is likely to generalize to other settings.</p>","PeriodicalId":11830,"journal":{"name":"European Journal of Cancer Prevention","volume":" ","pages":"400-406"},"PeriodicalIF":2.1000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Cancer Prevention","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/CEJ.0000000000000881","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/6 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: Various approaches have been used in the literature to handle missing vital status data in cancer registries. We aimed to compare these approaches to determine which led to the least biased estimates in typical analytic tasks of cancer registries.

Methods: A simulation study was performed using data from the Swiss National Agency for Cancer Registration for six tumor types. First, 5%, 10% and 15% missingness in the vital status were introduced artificially in the complete data. Second, missing vital status data were handled by applying no, single or multiple imputations. Five-year overall survival estimates, relative survival or standardized incidence ratio were computed. Estimates were compared with the true value.

Results: Standardized incidence ratio estimates for colorectal cancer obtained with multiple imputation yielded least biased results (-0.06 to -0.04), but the widest confidence intervals. Single imputation was more biased (-0.32) than using no imputation at all (-0.21). A similar pattern was observed for overall survival and relative survival.

Conclusion: This simulation study indicated that often used single imputation (sometimes referred to as simulating follow-up times) techniques to fill in missing vital status data are likely too biased to be useful in practice. Multiple imputation approaches yielded standardized incidence ratio, overall and relative survival estimates with the least bias, indicating reasonable performance that is likely to generalize to other settings.

更好的工具促进更好的估算:改进瑞士癌症登记中缺失数据的处理方法。
目的:文献中使用了多种方法来处理癌症登记中缺失的生命状态数据。我们旨在对这些方法进行比较,以确定在癌症登记的典型分析任务中,哪种方法导致的估计值偏差最小:我们使用瑞士国家癌症登记局提供的六种肿瘤类型的数据进行了模拟研究。首先,在完整数据中人为引入 5%、10% 和 15%的生命体征缺失率。其次,对生命体征数据的缺失采用无缺失、单一缺失或多重缺失处理。计算五年总生存率估计值、相对生存率或标准化发病率比。将估计值与真实值进行比较:结果:采用多重估算得出的结直肠癌标准化发病率估计值偏差最小(-0.06 至 -0.04),但置信区间最宽。与完全不使用归因法(-0.21)相比,单次归因法的偏差更大(-0.32)。总生存率和相对生存率也观察到类似的模式:这项模拟研究表明,常用的单项归因(有时也称为模拟随访时间)技术在填补缺失的生命体征数据时很可能偏差过大,无法在实践中发挥作用。多重估算方法得出的标准化发病率、总生存率和相对生存率估计值偏差最小,表明其性能合理,有可能推广到其他环境中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.10
自引率
4.20%
发文量
96
审稿时长
1 months
期刊介绍: European Journal of Cancer Prevention aims to promote an increased awareness of all aspects of cancer prevention and to stimulate new ideas and innovations. The Journal has a wide-ranging scope, covering such aspects as descriptive and metabolic epidemiology, histopathology, genetics, biochemistry, molecular biology, microbiology, clinical medicine, intervention trials and public education, basic laboratory studies and special group studies. Although affiliated to a European organization, the journal addresses issues of international importance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信