Viktoria Gastens, Arnaud Chiolero, Martin Feller, Douglas C Bauer, Nicolas Rodondi, Cinzia Del Giovane
{"title":"Development and internal validation of a new life expectancy estimator for multimorbid older adults.","authors":"Viktoria Gastens, Arnaud Chiolero, Martin Feller, Douglas C Bauer, Nicolas Rodondi, Cinzia Del Giovane","doi":"10.1186/s41512-025-00185-9","DOIUrl":"10.1186/s41512-025-00185-9","url":null,"abstract":"<p><strong>Background: </strong>As populations are aging, the number of older patients with multiple chronic diseases demanding complex care increases. Although clinical guidelines recommend care to be personalized accounting for life expectancy, there are no tools to estimate life expectancy among multimorbid patients. Our objective was therefore to develop and internally validate a life expectancy estimator specifically for older multimorbid adults.</p><p><strong>Methods: </strong>We analyzed data from the OPERAM (OPtimising thERapy to prevent avoidable hospital admissions in multimorbid older people) study in Bern, Switzerland. Participants aged 70 years old or more with multimorbidity (3 or more chronic medical conditions) and polypharmacy (use of 5 drugs or more for > 30 days) were included. All-cause mortality was assessed during 3 years of follow-up. We built a 3-year mortality prognostic index and transformed this index into a life expectancy estimator. Mortality risk candidate predictors included demographic variables (age, sex), clinical characteristics (metastatic cancer, number of drugs, body mass index, weight loss), smoking, functional status variables (Barthel-Index, falls, nursing home residence), and hospitalization. We internally validated and optimism corrected the model using bootstrapping techniques. We transformed the mortality prognostic index into a life expectancy estimator using the Gompertz survival function.</p><p><strong>Results: </strong>Eight hundred five participants were included in the analysis. During 3 years of follow-up, 292 participants (36%) died. Age, metastatic cancer, number of drugs, lower body mass index, weight loss, number of hospitalizations, and lower Barthel-Index (functional impairment) were selected as predictors in the final multivariable model. Our model showed moderate discrimination with an optimism-corrected C statistic of 0.70. The optimism-corrected calibration slope was 0.96. The Gompertz-predicted mean life expectancy in our sample was 5.4 years (standard deviation 3.5 years). Categorization into three life expectancy groups led to visually good separation in Kaplan-Meier curves. We also developed a web application that calculates an individual's life expectancy estimation.</p><p><strong>Conclusion: </strong>A life expectancy estimator for multimorbid older adults based on an internally validated 3-year mortality risk index was developed. Further validation of the score among various populations of multimorbid patients is needed before its implementation into practice.</p><p><strong>Trial registration: </strong>ClinicalTrials.gov NCT02986425. First submitted 21/10/2016. First posted 08/12/2016.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11877760/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143544800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Akshay Swaminathan, Ujwal Srivastava, Lucia Tu, Ivan Lopez, Nigam H Shah, Andrew J Vickers
{"title":"Against reflexive recalibration: towards a causal framework for addressing miscalibration.","authors":"Akshay Swaminathan, Ujwal Srivastava, Lucia Tu, Ivan Lopez, Nigam H Shah, Andrew J Vickers","doi":"10.1186/s41512-024-00184-2","DOIUrl":"10.1186/s41512-024-00184-2","url":null,"abstract":"","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11812191/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143392662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Models for predicting risk of endometrial cancer: a systematic review.","authors":"Bea Harris Forder, Anastasia Ardasheva, Karyna Atha, Hannah Nentwich, Roxanna Abhari, Christiana Kartsonaki","doi":"10.1186/s41512-024-00178-0","DOIUrl":"10.1186/s41512-024-00178-0","url":null,"abstract":"<p><strong>Background: </strong>Endometrial cancer (EC) is the most prevalent gynaecological cancer in the UK with a rising incidence. Various models exist to predict the risk of developing EC, for different settings and prediction timeframes. This systematic review aims to provide a summary of models and assess their characteristics and performance.</p><p><strong>Methods: </strong>A systematic search of the MEDLINE and Embase (OVID) databases was used to identify risk prediction models related to EC and studies validating these models. Papers relating to predicting the risk of a future diagnosis of EC were selected for inclusion. Study characteristics, variables included in the model, methods used, and model performance, were extracted. The Prediction model Risk-of-Bias Assessment Tool was used to assess model quality.</p><p><strong>Results: </strong>Twenty studies describing 19 models were included. Ten were designed for the general population and nine for high-risk populations. Three models were developed for premenopausal women and two for postmenopausal women. Logistic regression was the most used development method. Three models, all in the general population, had a low risk of bias and all models had high applicability. Most models had moderate (area under the receiver operating characteristic curve (AUC) 0.60-0.80) or high predictive ability (AUC > 0.80) with AUCs ranging from 0.56 to 0.92. Calibration was assessed for five models. Two of these, the Hippisley-Cox and Coupland QCancer models, had high predictive ability and were well calibrated; these models also received a low risk of bias rating.</p><p><strong>Conclusions: </strong>Several models of moderate-high predictive ability exist for predicting the risk of EC, but study quality varies, with most models at high risk of bias. External validation of well-performing models in large, diverse cohorts is needed to assess their utility.</p><p><strong>Registration: </strong>The protocol for this review is available on PROSPERO (CRD42022303085).</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"3"},"PeriodicalIF":0.0,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11792366/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143124016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bethany Hillier, Katie Scandrett, April Coombe, Tina Hernandez-Boussard, Ewout Steyerberg, Yemisi Takwoingi, Vladica Velickovic, Jacqueline Dinnes
{"title":"Risk prediction tools for pressure injury occurrence: an umbrella review of systematic reviews reporting model development and validation methods.","authors":"Bethany Hillier, Katie Scandrett, April Coombe, Tina Hernandez-Boussard, Ewout Steyerberg, Yemisi Takwoingi, Vladica Velickovic, Jacqueline Dinnes","doi":"10.1186/s41512-024-00182-4","DOIUrl":"10.1186/s41512-024-00182-4","url":null,"abstract":"<p><strong>Background: </strong>Pressure injuries (PIs) place a substantial burden on healthcare systems worldwide. Risk stratification of those who are at risk of developing PIs allows preventive interventions to be focused on patients who are at the highest risk. The considerable number of risk assessment scales and prediction models available underscores the need for a thorough evaluation of their development, validation, and clinical utility. Our objectives were to identify and describe available risk prediction tools for PI occurrence, their content and the development and validation methods used.</p><p><strong>Methods: </strong>The umbrella review was conducted according to Cochrane guidance. MEDLINE, Embase, CINAHL, EPISTEMONIKOS, Google Scholar, and reference lists were searched to identify relevant systematic reviews. The risk of bias was assessed using adapted AMSTAR-2 criteria. Results were described narratively. All included reviews contributed to building a comprehensive list of risk prediction tools.</p><p><strong>Results: </strong>We identified 32 eligible systematic reviews only seven of which described the development and validation of risk prediction tools for PI. Nineteen reviews assessed the prognostic accuracy of the tools and 11 assessed clinical effectiveness. Of the seven reviews reporting model development and validation, six included only machine learning models. Two reviews included external validations of models, although only one review reported any details on external validation methods or results. This was also the only review to report measures of both discrimination and calibration. Five reviews presented measures of discrimination, such as the area under the curve (AUC), sensitivities, specificities, F1 scores, and G-means. For the four reviews that assessed the risk of bias assessment using the PROBAST tool, all models but one were found to be at high or unclear risk of bias.</p><p><strong>Conclusions: </strong>Available tools do not meet current standards for the development or reporting of risk prediction models. The majority of tools have not been externally validated. Standardised and rigorous approaches to risk prediction model development and validation are needed.</p><p><strong>Trial registration: </strong>The protocol was registered on the Open Science Framework ( https://osf.io/tepyk ).</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"2"},"PeriodicalIF":0.0,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11730812/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Uwe M Pommerich, Peter W Stubbs, Jørgen Feldbæk Nielsen
{"title":"Rehabilitation outcomes after comprehensive post-acute inpatient rehabilitation following moderate to severe acquired brain injury-study protocol for an overall prognosis study based on routinely collected health data.","authors":"Uwe M Pommerich, Peter W Stubbs, Jørgen Feldbæk Nielsen","doi":"10.1186/s41512-024-00183-3","DOIUrl":"https://doi.org/10.1186/s41512-024-00183-3","url":null,"abstract":"<p><strong>Background: </strong>The initial theme of the PROGRESS framework for prognosis research is termed overall prognosis research. Its aim is to describe the most likely course of health conditions in the context of current care. These average group-level prognoses may be used to inform patients, health policies, trial designs, or further prognosis research. Acquired brain injury, such as stroke, traumatic brain injury or encephalopathy, is a major cause of disability and functional limitations, worldwide. Rehabilitation aims to maximize independent functioning and meaningful participation in society post-injury. While some observational studies can allow for an inference of the overall prognosis of the level of independent functioning, the context for the provision of rehabilitation is rarely described. The aim of this protocol is to provide a detailed account of the clinical context to aid the interpretation of our upcoming overall prognosis study.</p><p><strong>Methods: </strong>The study will occur at a Danish post-acute inpatient rehabilitation facility providing specialised inpatient rehabilitation for individuals with moderate to severe acquired brain injury. Routinely collected electronic health data will be extracted from the healthcare provider's database and deterministically linked on an individual level to construct the study cohort. The study period spans from March 2011 to December 2022. Four outcomes will measure the level of functioning. Rehabilitation needs will also be described. Outcomes and rehabilitation needs will be described for the entire cohort, across rehabilitation complexity levels and stratified for relevant demographic and clinical parameters. Descriptive statistics will be used to estimate average prognoses for the level of functioning at discharge from post-acute rehabilitation. The patterns of missing data will be investigated.</p><p><strong>Discussion: </strong>This protocol is intended to provide transparency in our upcoming study based on routinely collected clinical data. It will aid in the interpretation of the overall prognosis estimates within the context of our current clinical practice and the assessment of potential sources of bias independently.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"1"},"PeriodicalIF":0.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11706155/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142959579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Johanna A A Damen, Banafsheh Arshi, Maarten van Smeden, Silvia Bertagnolio, Janet V Diaz, Ronaldo Silva, Soe Soe Thwin, Laure Wynants, Karel G M Moons
{"title":"Validation of prognostic models predicting mortality or ICU admission in patients with COVID-19 in low- and middle-income countries: a global individual participant data meta-analysis.","authors":"Johanna A A Damen, Banafsheh Arshi, Maarten van Smeden, Silvia Bertagnolio, Janet V Diaz, Ronaldo Silva, Soe Soe Thwin, Laure Wynants, Karel G M Moons","doi":"10.1186/s41512-024-00181-5","DOIUrl":"10.1186/s41512-024-00181-5","url":null,"abstract":"<p><strong>Background: </strong>We evaluated the performance of prognostic models for predicting mortality or ICU admission in hospitalized patients with COVID-19 in the World Health Organization (WHO) Global Clinical Platform, a repository of individual-level clinical data of patients hospitalized with COVID-19, including in low- and middle-income countries (LMICs).</p><p><strong>Methods: </strong>We identified eligible multivariable prognostic models for predicting overall mortality and ICU admission during hospital stay in patients with confirmed or suspected COVID-19 from a living review of COVID-19 prediction models. These models were evaluated using data contributed to the WHO Global Clinical Platform for COVID-19 from nine LMICs (Burkina Faso, Cameroon, Democratic Republic of Congo, Guinea, India, Niger, Nigeria, Zambia, and Zimbabwe). Model performance was assessed in terms of discrimination and calibration.</p><p><strong>Results: </strong>Out of 144 eligible models, 140 were excluded due to a high risk of bias, predictors unavailable in LIMCs, or insufficient model description. Among 11,338 participants, the remaining models showed good discrimination for predicting in-hospital mortality (3 models), with areas under the curve (AUCs) ranging between 0.76 (95% CI 0.71-0.81) and 0.84 (95% CI 0.77-0.89). An AUC of 0.74 (95% CI 0.70-0.78) was found for predicting ICU admission risk (one model). All models showed signs of miscalibration and overfitting, with extensive heterogeneity between countries.</p><p><strong>Conclusions: </strong>Among the available COVID-19 prognostic models, only a few could be validated on data collected from LMICs, mainly due to limited predictor availability. Despite their discriminative ability, selected models for mortality prediction or ICU admission showed varying and suboptimal calibration.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"8 1","pages":"17"},"PeriodicalIF":0.0,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11656577/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142856909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bwambale Jonani, Emmanuel Charles Kasule, Herman Roman Bwire, Gerald Mboowa
{"title":"Reported prevalence and comparison of diagnostic approaches for Candida africana: a systematic review with meta-analysis.","authors":"Bwambale Jonani, Emmanuel Charles Kasule, Herman Roman Bwire, Gerald Mboowa","doi":"10.1186/s41512-024-00180-6","DOIUrl":"10.1186/s41512-024-00180-6","url":null,"abstract":"<p><p>This systematic review and meta-analysis evaluated reported prevalence and diagnostic methods for identifying Candida africana, an opportunistic yeast associated with vaginal and oral candidiasis. A comprehensive literature search yielded 53 studies meeting the inclusion criteria, 2 of which were case studies. The pooled prevalence of C. africana among 20,571 participants was 0.9% (95% CI: 0.7-1.3%), with significant heterogeneity observed (I<sup>2</sup> = 79%, p < 0.01). Subgroup analyses revealed regional variations, with North America showing the highest prevalence (4.6%, 95% CI: 1.8-11.2%). The majority 84.52% of the C. africana have been isolated from vaginal samples, 8.37% from oral samples, 3.77% from urine, 2.09% from glans penis swabs, and 0.42% from rectal swabs, nasal swabs, and respiratory tract expectorations respectively. No C. africana has been isolated from nail samples. Hyphal wall protein 1 gene PCR was the most used diagnostic method for identifying C. africana. It has been used to identify 70% of the isolates. A comparison of methods revealed that the Vitek-2 system consistently failed to differentiate C. africana from Candida albicans, whereas MALDI-TOF misidentified several isolates compared with HWP1 PCR. Factors beyond diagnostic methodology may influence C. africana detection rates. We highlight the importance of adapting molecular methods for resource-limited settings or developing equally accurate but more accessible alternatives for the identification and differentiation of highly similar and cryptic Candida species such as C. africana.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"8 1","pages":"16"},"PeriodicalIF":0.0,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11619109/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142787989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration.","authors":"Peter C Austin, Douglas S Lee, Bo Wang","doi":"10.1186/s41512-024-00179-z","DOIUrl":"10.1186/s41512-024-00179-z","url":null,"abstract":"<p><strong>Background: </strong>Machine learning methods are increasingly being used to predict clinical outcomes. Optimism is the difference in model performance between derivation and validation samples. The term \"data hungriness\" refers to the sample size needed for a modelling technique to generate a prediction model with minimal optimism. Our objective was to compare the relative data hungriness of different statistical and machine learning methods when assessed using calibration.</p><p><strong>Methods: </strong>We used Monte Carlo simulations to assess the effect of number of events per variable (EPV) on the optimism of six learning methods when assessing model calibration: unpenalized logistic regression, ridge regression, lasso regression, bagged classification trees, random forests, and stochastic gradient boosting machines using trees as the base learners. We performed simulations in two large cardiovascular datasets each of which comprised an independent derivation and validation sample: patients hospitalized with acute myocardial infarction and patients hospitalized with heart failure. We used six data-generating processes, each based on one of the six learning methods. We allowed the sample sizes to be such that the number of EPV ranged from 10 to 200 in increments of 10. We applied six prediction methods in each of the simulated derivation samples and evaluated calibration in the simulated validation samples using the integrated calibration index, the calibration intercept, and the calibration slope. We also examined Nagelkerke's R<sup>2</sup>, the scaled Brier score, and the c-statistic.</p><p><strong>Results: </strong>Across all 12 scenarios (2 diseases × 6 data-generating processes), penalized logistic regression displayed very low optimism even when the number of EPV was very low. Random forests and bagged trees tended to be the most data hungry and displayed the greatest optimism.</p><p><strong>Conclusions: </strong>When assessed using calibration, penalized logistic regression was substantially less data hungry than methods from the machine learning literature.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"8 1","pages":"15"},"PeriodicalIF":0.0,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11539735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142585094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lasai Barreñada, Paula Dhiman, Dirk Timmerman, Anne-Laure Boulesteix, Ben Van Calster
{"title":"Understanding overfitting in random forest for probability estimation: a visualization and simulation study.","authors":"Lasai Barreñada, Paula Dhiman, Dirk Timmerman, Anne-Laure Boulesteix, Ben Van Calster","doi":"10.1186/s41512-024-00177-1","DOIUrl":"https://doi.org/10.1186/s41512-024-00177-1","url":null,"abstract":"<p><strong>Background: </strong>Random forests have become popular for clinical risk prediction modeling. In a case study on predicting ovarian malignancy, we observed training AUCs close to 1. Although this suggests overfitting, performance was competitive on test data. We aimed to understand the behavior of random forests for probability estimation by (1) visualizing data space in three real-world case studies and (2) a simulation study.</p><p><strong>Methods: </strong>For the case studies, multinomial risk estimates were visualized using heatmaps in a 2-dimensional subspace. The simulation study included 48 logistic data-generating mechanisms (DGM), varying the predictor distribution, the number of predictors, the correlation between predictors, the true AUC, and the strength of true predictors. For each DGM, 1000 training datasets of size 200 or 4000 with binary outcomes were simulated, and random forest models were trained with minimum node size 2 or 20 using the ranger R package, resulting in 192 scenarios in total. Model performance was evaluated on large test datasets (N = 100,000).</p><p><strong>Results: </strong>The visualizations suggested that the model learned \"spikes of probability\" around events in the training set. A cluster of events created a bigger peak or plateau (signal), isolated events local peaks (noise). In the simulation study, median training AUCs were between 0.97 and 1 unless there were 4 binary predictors or 16 binary predictors with a minimum node size of 20. The median discrimination loss, i.e., the difference between the median test AUC and the true AUC, was 0.025 (range 0.00 to 0.13). Median training AUCs had Spearman correlations of around 0.70 with discrimination loss. Median test AUCs were higher with higher events per variable, higher minimum node size, and binary predictors. Median training calibration slopes were always above 1 and were not correlated with median test slopes across scenarios (Spearman correlation - 0.11). Median test slopes were higher with higher true AUC, higher minimum node size, and higher sample size.</p><p><strong>Conclusions: </strong>Random forests learn local probability peaks that often yield near perfect training AUCs without strongly affecting AUCs on test data. When the aim is probability estimation, the simulation results go against the common recommendation to use fully grown trees in random forest models.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"8 1","pages":"14"},"PeriodicalIF":0.0,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11437774/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142333691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas R Fanshawe, Brian D Nicholson, Rafael Perera, Jason L Oke
{"title":"A review of methods for the analysis of diagnostic tests performed in sequence.","authors":"Thomas R Fanshawe, Brian D Nicholson, Rafael Perera, Jason L Oke","doi":"10.1186/s41512-024-00175-3","DOIUrl":"10.1186/s41512-024-00175-3","url":null,"abstract":"<p><strong>Background: </strong>Many clinical pathways for the diagnosis of disease are based on diagnostic tests that are performed in sequence. The performance of the full diagnostic sequence is dictated by the diagnostic performance of each test in the sequence as well as the conditional dependence between them, given true disease status. Resulting estimates of performance, such as the sensitivity and specificity of the test sequence, are key parameters in health-economic evaluations. We conducted a methodological review of statistical methods for assessing the performance of diagnostic tests performed in sequence, with the aim of guiding data analysts towards classes of methods that may be suitable given the design and objectives of the testing sequence.</p><p><strong>Methods: </strong>We searched PubMed, Scopus and Web of Science for relevant papers describing methodology for analysing sequences of diagnostic tests. Papers were classified by the characteristics of the method used, and these were used to group methods into themes. We illustrate some of the methods using data from a cohort study of repeat faecal immunochemical testing for colorectal cancer in symptomatic patients, to highlight the importance of allowing for conditional dependence in test sequences and adjustment for an imperfect reference standard.</p><p><strong>Results: </strong>Five overall themes were identified, detailing methods for combining multiple tests in sequence, estimating conditional dependence, analysing sequences of diagnostic tests used for risk assessment, analysing test sequences in conjunction with an imperfect or incomplete reference standard, and meta-analysis of test sequences.</p><p><strong>Conclusions: </strong>This methodological review can be used to help researchers identify suitable analytic methods for studies that use diagnostic tests performed in sequence.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"8 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370044/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142121261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}