贝叶斯潜在类模型用于评估基于索赔的疾病结果定义的有效性。

Annals of clinical epidemiology Pub Date : 2024-07-18 eCollection Date: 2024-10-01 DOI:10.37737/ace.24012

Satoshi Uno, Toshiro Tango

{"title":"贝叶斯潜在类模型用于评估基于索赔的疾病结果定义的有效性。","authors":"Satoshi Uno, Toshiro Tango","doi":"10.37737/ace.24012","DOIUrl":null,"url":null,"abstract":"Background: Large electronic databases have been widely used in recent years; however, they can be susceptible to bias due to incomplete information. To address this, validation studies have been conducted to assess the accuracy of disease diagnoses defined in databases. However, such studies may be constrained by potential misclassification in references and the interdependence between diagnoses from the same data source.Methods: This study employs latent class modeling with Bayesian inference to estimate the sensitivity, specificity, and positive/negative predictive values of different diagnostic definitions. Four models are defined with/without assumptions of the gold standard and conditional independence, and then compared with breast cancer study data as a motivating example. Additionally, simulations that generated data under various true values are used to compare the performance of each model with bias, Pearson-type goodness-of-fit statistics, and widely applicable information criterion.Results: The model assuming conditional dependence and non-gold standard references exhibited the best predictive performance among the four models in the motivating example data analysis. The disease prevalence was slightly higher than that in previous findings, and the sensitivities were significantly lower than those of the other models. Additionally, bias evaluation showed that the Bayesian models with more assumptions and the frequentist model performed better under the true value conditions. The Bayesian model with fewer assumptions performed well in terms of goodness of fit and widely applicable information criteria.Conclusions: The current assessments of outcome validation can introduce bias. The proposed approach can be adopted broadly as a valuable method for validation studies.","PeriodicalId":517436,"journal":{"name":"Annals of clinical epidemiology","volume":"6 4","pages":"77-86"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668686/pdf/","citationCount":"0","resultStr":"{\"title\":\"Bayesian Latent Class Models for Evaluating the Validity of Claim-based Definitions of Disease Outcomes.\",\"authors\":\"Satoshi Uno, Toshiro Tango\",\"doi\":\"10.37737/ace.24012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Large electronic databases have been widely used in recent years; however, they can be susceptible to bias due to incomplete information. To address this, validation studies have been conducted to assess the accuracy of disease diagnoses defined in databases. However, such studies may be constrained by potential misclassification in references and the interdependence between diagnoses from the same data source.Methods: This study employs latent class modeling with Bayesian inference to estimate the sensitivity, specificity, and positive/negative predictive values of different diagnostic definitions. Four models are defined with/without assumptions of the gold standard and conditional independence, and then compared with breast cancer study data as a motivating example. Additionally, simulations that generated data under various true values are used to compare the performance of each model with bias, Pearson-type goodness-of-fit statistics, and widely applicable information criterion.Results: The model assuming conditional dependence and non-gold standard references exhibited the best predictive performance among the four models in the motivating example data analysis. The disease prevalence was slightly higher than that in previous findings, and the sensitivities were significantly lower than those of the other models. Additionally, bias evaluation showed that the Bayesian models with more assumptions and the frequentist model performed better under the true value conditions. The Bayesian model with fewer assumptions performed well in terms of goodness of fit and widely applicable information criteria.Conclusions: The current assessments of outcome validation can introduce bias. The proposed approach can be adopted broadly as a valuable method for validation studies.\",\"PeriodicalId\":517436,\"journal\":{\"name\":\"Annals of clinical epidemiology\",\"volume\":\"6 4\",\"pages\":\"77-86\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668686/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of clinical epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37737/ace.24012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/10/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of clinical epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37737/ace.24012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景：近年来，大型电子数据库得到了广泛的应用；然而，由于信息不完整，它们可能容易受到偏见的影响。为了解决这个问题，已经进行了验证研究，以评估数据库中定义的疾病诊断的准确性。然而，这些研究可能受到参考文献中潜在的错误分类和来自同一数据源的诊断之间的相互依赖性的限制。方法：本研究采用贝叶斯推断潜类模型来估计不同诊断定义的敏感性、特异性和阳性/阴性预测值。在金标准和条件独立性的假设下定义了四个模型，然后与乳腺癌研究数据进行比较，作为一个激励的例子。此外，在各种真值下生成数据的模拟用于比较每个模型的性能与偏差，皮尔逊型拟合优度统计和广泛适用的信息准则。结果：假设条件依赖和非金标准参考的模型在激励样本数据分析中表现出最好的预测性能。该模型的患病率略高于以往的研究结果，敏感性明显低于其他模型。此外，偏差评估表明，假设较多的贝叶斯模型和频率模型在真值条件下表现较好。假设较少的贝叶斯模型在拟合优度和广泛适用的信息标准方面表现良好。结论：目前对结果验证的评估可能会引入偏倚。所提出的方法可以作为一种有价值的验证研究方法被广泛采用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Bayesian Latent Class Models for Evaluating the Validity of Claim-based Definitions of Disease Outcomes.

Background: Large electronic databases have been widely used in recent years; however, they can be susceptible to bias due to incomplete information. To address this, validation studies have been conducted to assess the accuracy of disease diagnoses defined in databases. However, such studies may be constrained by potential misclassification in references and the interdependence between diagnoses from the same data source.

Methods: This study employs latent class modeling with Bayesian inference to estimate the sensitivity, specificity, and positive/negative predictive values of different diagnostic definitions. Four models are defined with/without assumptions of the gold standard and conditional independence, and then compared with breast cancer study data as a motivating example. Additionally, simulations that generated data under various true values are used to compare the performance of each model with bias, Pearson-type goodness-of-fit statistics, and widely applicable information criterion.

Results: The model assuming conditional dependence and non-gold standard references exhibited the best predictive performance among the four models in the motivating example data analysis. The disease prevalence was slightly higher than that in previous findings, and the sensitivities were significantly lower than those of the other models. Additionally, bias evaluation showed that the Bayesian models with more assumptions and the frequentist model performed better under the true value conditions. The Bayesian model with fewer assumptions performed well in terms of goodness of fit and widely applicable information criteria.

Conclusions: The current assessments of outcome validation can introduce bias. The proposed approach can be adopted broadly as a valuable method for validation studies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Annals of clinical epidemiology

自引率

0.00%

发文量