Jun Tamura, Yusuke Saigusa, Junichi Fujita, Kouji Yamamoto
{"title":"Bayesian method for comparing F1 scores in the absence of a gold standard.","authors":"Jun Tamura, Yusuke Saigusa, Junichi Fujita, Kouji Yamamoto","doi":"10.1080/10543406.2025.2450319","DOIUrl":null,"url":null,"abstract":"<p><p>In the field of medicine, evaluating the diagnostic performance of new diagnostic methods can be challenging, especially in the absence of a gold standard. This study proposes a methodology for assessing the performance of diagnostic tests by estimating the posterior distribution of the <math><mrow><msub><mi>F</mi><mn>1</mn></msub></mrow></math> score using latent class analysis, without relying on a gold standard. The proposed method utilizes Markov Chain Monte Carlo sampling to estimate the posterior distribution of the <math><mrow><msub><mi>F</mi><mn>1</mn></msub></mrow></math> score, enabling a comprehensive evaluation of diagnostic test methods. By applying this method to internet addiction, we demonstrate how latent class analysis can be effectively used to assess diagnostic performance, offering a practical solution for situations where no gold standard is available. The effectiveness of the proposed approach was evaluated through simulation studies by examining the coverage probability of the 95% highest density interval of the estimated posterior distributions.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1-11"},"PeriodicalIF":1.2000,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biopharmaceutical Statistics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10543406.2025.2450319","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of medicine, evaluating the diagnostic performance of new diagnostic methods can be challenging, especially in the absence of a gold standard. This study proposes a methodology for assessing the performance of diagnostic tests by estimating the posterior distribution of the score using latent class analysis, without relying on a gold standard. The proposed method utilizes Markov Chain Monte Carlo sampling to estimate the posterior distribution of the score, enabling a comprehensive evaluation of diagnostic test methods. By applying this method to internet addiction, we demonstrate how latent class analysis can be effectively used to assess diagnostic performance, offering a practical solution for situations where no gold standard is available. The effectiveness of the proposed approach was evaluated through simulation studies by examining the coverage probability of the 95% highest density interval of the estimated posterior distributions.
期刊介绍:
The Journal of Biopharmaceutical Statistics, a rapid publication journal, discusses quality applications of statistics in biopharmaceutical research and development. Now publishing six times per year, it includes expositions of statistical methodology with immediate applicability to biopharmaceutical research in the form of full-length and short manuscripts, review articles, selected/invited conference papers, short articles, and letters to the editor. Addressing timely and provocative topics important to the biostatistical profession, the journal covers:
Drug, device, and biological research and development;
Drug screening and drug design;
Assessment of pharmacological activity;
Pharmaceutical formulation and scale-up;
Preclinical safety assessment;
Bioavailability, bioequivalence, and pharmacokinetics;
Phase, I, II, and III clinical development including complex innovative designs;
Premarket approval assessment of clinical safety;
Postmarketing surveillance;
Big data and artificial intelligence and applications.