Bias correction via outcome reassignment for cross-sectional data with binary disease outcome.

IF 1.2 3区数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Lifetime Data Analysis Pub Date : 2022-10-01 Epub Date: 2022-06-24 DOI:10.1007/s10985-022-09559-3

Mei-Cheng Wang, Yuxin Zhu

{"title":"Bias correction via outcome reassignment for cross-sectional data with binary disease outcome.","authors":"Mei-Cheng Wang, Yuxin Zhu","doi":"10.1007/s10985-022-09559-3","DOIUrl":null,"url":null,"abstract":"<p><p>Cross-sectionally sampled data with binary disease outcome are commonly analyzed in observational studies to identify the relationship between covariates and disease outcome. A cross-sectional population is defined as a population of living individuals at the sampling or observational time. It is generally understood that binary disease outcome from cross-sectional data contains less information than longitudinally collected time-to-event data, but there is insufficient understanding as to whether bias can possibly exist in cross-sectional data and how the bias is related to the population risk of interest. Wang and Yang (2021) presented the complexity and bias in cross-sectional data with binary disease outcome with detailed analytical explorations into the data structure. As the distribution of the cross-sectional binary outcome is quite different from the population risk distribution, bias can arise when using cross-sectional data analysis to draw inference for population risk. In this paper we argue that the commonly adopted age-specific risk probability is biased for the estimation of population risk and propose an outcome reassignment approach which reassigns a portion of the observed binary outcome, 0 or 1, to the other disease category. A sign test and a semiparametric pseudo-likelihood method are developed for analyzing cross-sectional data using the OR approach. Simulations and an analysis based on Alzheimer's Disease data are presented to illustrate the proposed methods.</p>","PeriodicalId":49908,"journal":{"name":"Lifetime Data Analysis","volume":" ","pages":"659-674"},"PeriodicalIF":1.2000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lifetime Data Analysis","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10985-022-09559-3","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/6/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Cross-sectionally sampled data with binary disease outcome are commonly analyzed in observational studies to identify the relationship between covariates and disease outcome. A cross-sectional population is defined as a population of living individuals at the sampling or observational time. It is generally understood that binary disease outcome from cross-sectional data contains less information than longitudinally collected time-to-event data, but there is insufficient understanding as to whether bias can possibly exist in cross-sectional data and how the bias is related to the population risk of interest. Wang and Yang (2021) presented the complexity and bias in cross-sectional data with binary disease outcome with detailed analytical explorations into the data structure. As the distribution of the cross-sectional binary outcome is quite different from the population risk distribution, bias can arise when using cross-sectional data analysis to draw inference for population risk. In this paper we argue that the commonly adopted age-specific risk probability is biased for the estimation of population risk and propose an outcome reassignment approach which reassigns a portion of the observed binary outcome, 0 or 1, to the other disease category. A sign test and a semiparametric pseudo-likelihood method are developed for analyzing cross-sectional data using the OR approach. Simulations and an analysis based on Alzheimer's Disease data are presented to illustrate the proposed methods.

Abstract Image

查看原文本刊更多论文

通过结果重新分配对具有二元疾病结果的横断面数据进行偏倚校正。

在观察性研究中，通常对具有二元疾病结局的横断面抽样数据进行分析，以确定协变量与疾病结局之间的关系。横断面人口被定义为在抽样或观察时间活的个体的人口。人们普遍认为，来自横断面数据的二元疾病结局所包含的信息少于纵向收集的事件时间数据，但对于横断面数据中是否可能存在偏倚以及偏倚如何与感兴趣的人群风险相关，人们的理解不足。Wang和Yang(2021)通过对数据结构的详细分析探索，提出了具有二元疾病结局的横断面数据的复杂性和偏倚性。由于横断面二值结果的分布与总体风险分布有很大的不同，在使用横断面数据分析进行总体风险推断时可能会产生偏差。在本文中，我们认为通常采用的年龄特异性风险概率对于估计人群风险是有偏差的，并提出了一种结果重新分配方法，该方法将观察到的二进制结果的一部分(0或1)重新分配给其他疾病类别。提出了一种符号检验和半参数伪似然方法，用于使用OR方法分析截面数据。基于阿尔茨海默病数据的仿真和分析说明了所提出的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Lifetime Data Analysis 数学-数学跨学科应用

CiteScore

2.30

自引率

7.70%

发文量

审稿时长

3 months

期刊介绍： The objective of Lifetime Data Analysis is to advance and promote statistical science in the various applied fields that deal with lifetime data, including: Actuarial Science – Economics – Engineering Sciences – Environmental Sciences – Management Science – Medicine – Operations Research – Public Health – Social and Behavioral Sciences.