{"title":"Making the best use of quantitative fecal immunochemical test results in colorectal cancer screening","authors":"Hermann Brenner, Michael Hoffmeister","doi":"10.1111/joim.13812","DOIUrl":null,"url":null,"abstract":"<p>Fecal immunochemical tests (FITs) have become the most widely used tests for colorectal cancer (CRC) screening [<span>1</span>]. They detect the vast majority of CRCs and some proportion of advanced precancerous neoplasms [<span>2</span>], and modeling studies suggest that annual or biennial FIT-based screening programs have the potential to substantially lower the burden of CRC incidence and mortality [<span>3</span>]. Yet, uncertainty prevails with respect to the optimal use of FITs regarding a number of key parameters of screening programs, such as the starting age of screening, screening intervals, and positivity thresholds of FITs. In this issue, Westerberg et al. reported most valuable results from the baseline exam of the large Swedish SCREESCO screening trial that may inform the design and planning of screening programs [<span>4</span>]. In particular, the study allows thorough evaluation of the tradeoffs between increasing the positive predictive value (PPV) and the decreasing numbers needed to undergo colonoscopy (“numbers needed to scope”, NNS) on one hand, and decreasing sensitivity on the other hand, when increasing the FIT positivity threshold from 10 μg hemoglobin (Hb)/g feces to higher levels. These data may be most valuable for multiple purposes, including the provision of key background information for more comprehensive modeling of the effectiveness and cost-effectiveness of various screening strategies.</p><p>However, in the interpretation of the results, a number of additional factors require careful consideration. In the SCREESCO trial, two FITs per screening round were applied, and the overall test result was rated as positive if one of the two FITs showed an Hb concentration >10 μg/g feces in the baseline scenario, or >20, 40, 60, 80, 120, or 160 μg/g in the alternative scenarios. By contrast, in most screening programs, just one FIT is employed per screening round. Defining the result as positive if one of two tests is positive increases the sensitivity and decreases the specificity compared to the application of a single test. This implies that comparable positivity rates and sensitivity would be expected at somewhat lower cutoffs in one-sample rather than two-sample testing, which should be kept in mind in interpreting the presented data. It remains an open question whether two-sample testing is worth the extra effort and cost. Possibly, (almost) equivalent results as those reported by Westerberg et al. could be obtained with one-sample testing by lowering the FIT cutoff [<span>5</span>]. Further analyses of the dataset by Westerberg et al. may offer unique opportunities to answer this question.</p><p>Another important aspect to keep in mind is that all the results reported by Westerberg et al. refer to a first-round FIT screening. With annual or biennial FIT-based screening, as recommended and practiced in many countries, the prevalences of advanced neoplasms will decrease at subsequent screening rounds. Although this may have a limited impact on sensitivity and specificity, PPVs would be expected to be lower, and NNS would be expected to be higher in subsequent screening rounds. Moreover, the starting age of screening in the SCREESCO screening trial was 60 years, whereas much lower starting ages, for example, at 50 or even 45 years of age, are implemented in most screening programs [<span>6</span>]. As prevalences of colorectal neoplasms are lower at younger ages, this would again imply that PPVs would be expected to be lower, and NNS would be expected to be higher than those reported from the SCREESCO trial.</p><p>Another very relevant question addressed by Westerberg et al. is whether and how sex differences in CRC epidemiology or screening test performance should be reflected in the interpretation of FIT results. It is a well-known universal observation that men have higher age-specific and age-standardized incidence and prevalence of colorectal neoplasms than women [<span>7</span>]. This implies that at any given age and FIT cutoff, positivity rates and PPVs are expected to be higher, and NNS are expected to be lower for men than for women, an observation that was also made in the study by Westerberg et al. These sex differences could be accounted for by using a lower cutoff for men than for women, by which equal PPVs and NNS could be achieved and limited colonoscopy capacities could be used in the most efficient possible way. A potential drawback of this approach would, though be that women would have a lower chance of having their neoplasms early detected or prevented than men. In an attempt to achieve “gender fairness” in terms of equal positivity rates for women and men, the Swedish screening program of Stockholm–Gotland even set a higher positivity threshold (80 μg/g) for men than for women (40 μg/g) [<span>8</span>]. However, such an approach further increases rather than decreases gender discrepancies in PPVs and NNS. The resulting high NNS for women may be a rather inefficient use of limited colonoscopy capacities and appears to require careful reconsideration.</p><p>A question the study by Westerberg et al. could not address, but which is carefully discussed by the authors, is the lack of colonoscopy results of participants with fecal Hb concentrations below 10 μg Hb/g feces. As a result, only relative sensitivities rather than absolute sensitivities could be derived, and it is unclear how the prevalence of neoplasms among participants in the various categories of FIT positivity compares to the vast majority of FIT negative screening participants. Such information can be derived from other studies conducted in the setting of screening colonoscopies [<span>9</span>]. These data show that even people in the “low positive range” with fecal Hb concentrations between 10 and 25 μg/g feces have a 3.5-fold risk of carrying any AN compared to the vast majority of >80% of screening participants with Hb concentrations below 8 μg/g feces. The strongly increased risk already in the “low positive range” would suggest that the observations by Westerberg et al. of higher PPVs and lower NNS achieved with higher FIT cutoffs compared to those with the 10 μg Hb/g cutoff should not be interpreted as support for higher FIT cutoffs. Although higher cutoffs may, in some instances, be inevitable due to limited colonoscopy resources, colonoscopic follow-up appears to be warranted after FIT-based detection of ≥3.5-fold increased risk of AN compared to the vast majority of the screening population.</p><p>The final goal of CRC screening should be to lower CRC incidence and mortality as much and efficiently as possible. Randomized controlled trials (RCTs) require a long time period from conception until the availability of long-term incidence and mortality outcomes and very large sample sizes, even if only two different screening strategies are compared. This strongly limits the implementation and use of RCTs for evaluating innovative screening approaches. Well-designed modeling studies may be a promising and rational complementary approach for the timely evaluation of novel screening strategies [<span>10</span>]. The detailed results on diagnostic performance parameters of FIT reported by Westerberg et al., along with data from screening colonoscopy cohorts, may be most valuable in informing such modeling studies for FIT-based and alternative screening approaches.</p><p><b>Hermann Brenner</b>: Writing—original draft; writing—review and editing; conceptualization. <b>Michael Hoffmeister</b>: Writing—review and editing.</p><p>The authors have no conflicts of interest to declare.</p>","PeriodicalId":196,"journal":{"name":"Journal of Internal Medicine","volume":"296 2","pages":"118-120"},"PeriodicalIF":9.0000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/joim.13812","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Internal Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/joim.13812","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Fecal immunochemical tests (FITs) have become the most widely used tests for colorectal cancer (CRC) screening [1]. They detect the vast majority of CRCs and some proportion of advanced precancerous neoplasms [2], and modeling studies suggest that annual or biennial FIT-based screening programs have the potential to substantially lower the burden of CRC incidence and mortality [3]. Yet, uncertainty prevails with respect to the optimal use of FITs regarding a number of key parameters of screening programs, such as the starting age of screening, screening intervals, and positivity thresholds of FITs. In this issue, Westerberg et al. reported most valuable results from the baseline exam of the large Swedish SCREESCO screening trial that may inform the design and planning of screening programs [4]. In particular, the study allows thorough evaluation of the tradeoffs between increasing the positive predictive value (PPV) and the decreasing numbers needed to undergo colonoscopy (“numbers needed to scope”, NNS) on one hand, and decreasing sensitivity on the other hand, when increasing the FIT positivity threshold from 10 μg hemoglobin (Hb)/g feces to higher levels. These data may be most valuable for multiple purposes, including the provision of key background information for more comprehensive modeling of the effectiveness and cost-effectiveness of various screening strategies.
However, in the interpretation of the results, a number of additional factors require careful consideration. In the SCREESCO trial, two FITs per screening round were applied, and the overall test result was rated as positive if one of the two FITs showed an Hb concentration >10 μg/g feces in the baseline scenario, or >20, 40, 60, 80, 120, or 160 μg/g in the alternative scenarios. By contrast, in most screening programs, just one FIT is employed per screening round. Defining the result as positive if one of two tests is positive increases the sensitivity and decreases the specificity compared to the application of a single test. This implies that comparable positivity rates and sensitivity would be expected at somewhat lower cutoffs in one-sample rather than two-sample testing, which should be kept in mind in interpreting the presented data. It remains an open question whether two-sample testing is worth the extra effort and cost. Possibly, (almost) equivalent results as those reported by Westerberg et al. could be obtained with one-sample testing by lowering the FIT cutoff [5]. Further analyses of the dataset by Westerberg et al. may offer unique opportunities to answer this question.
Another important aspect to keep in mind is that all the results reported by Westerberg et al. refer to a first-round FIT screening. With annual or biennial FIT-based screening, as recommended and practiced in many countries, the prevalences of advanced neoplasms will decrease at subsequent screening rounds. Although this may have a limited impact on sensitivity and specificity, PPVs would be expected to be lower, and NNS would be expected to be higher in subsequent screening rounds. Moreover, the starting age of screening in the SCREESCO screening trial was 60 years, whereas much lower starting ages, for example, at 50 or even 45 years of age, are implemented in most screening programs [6]. As prevalences of colorectal neoplasms are lower at younger ages, this would again imply that PPVs would be expected to be lower, and NNS would be expected to be higher than those reported from the SCREESCO trial.
Another very relevant question addressed by Westerberg et al. is whether and how sex differences in CRC epidemiology or screening test performance should be reflected in the interpretation of FIT results. It is a well-known universal observation that men have higher age-specific and age-standardized incidence and prevalence of colorectal neoplasms than women [7]. This implies that at any given age and FIT cutoff, positivity rates and PPVs are expected to be higher, and NNS are expected to be lower for men than for women, an observation that was also made in the study by Westerberg et al. These sex differences could be accounted for by using a lower cutoff for men than for women, by which equal PPVs and NNS could be achieved and limited colonoscopy capacities could be used in the most efficient possible way. A potential drawback of this approach would, though be that women would have a lower chance of having their neoplasms early detected or prevented than men. In an attempt to achieve “gender fairness” in terms of equal positivity rates for women and men, the Swedish screening program of Stockholm–Gotland even set a higher positivity threshold (80 μg/g) for men than for women (40 μg/g) [8]. However, such an approach further increases rather than decreases gender discrepancies in PPVs and NNS. The resulting high NNS for women may be a rather inefficient use of limited colonoscopy capacities and appears to require careful reconsideration.
A question the study by Westerberg et al. could not address, but which is carefully discussed by the authors, is the lack of colonoscopy results of participants with fecal Hb concentrations below 10 μg Hb/g feces. As a result, only relative sensitivities rather than absolute sensitivities could be derived, and it is unclear how the prevalence of neoplasms among participants in the various categories of FIT positivity compares to the vast majority of FIT negative screening participants. Such information can be derived from other studies conducted in the setting of screening colonoscopies [9]. These data show that even people in the “low positive range” with fecal Hb concentrations between 10 and 25 μg/g feces have a 3.5-fold risk of carrying any AN compared to the vast majority of >80% of screening participants with Hb concentrations below 8 μg/g feces. The strongly increased risk already in the “low positive range” would suggest that the observations by Westerberg et al. of higher PPVs and lower NNS achieved with higher FIT cutoffs compared to those with the 10 μg Hb/g cutoff should not be interpreted as support for higher FIT cutoffs. Although higher cutoffs may, in some instances, be inevitable due to limited colonoscopy resources, colonoscopic follow-up appears to be warranted after FIT-based detection of ≥3.5-fold increased risk of AN compared to the vast majority of the screening population.
The final goal of CRC screening should be to lower CRC incidence and mortality as much and efficiently as possible. Randomized controlled trials (RCTs) require a long time period from conception until the availability of long-term incidence and mortality outcomes and very large sample sizes, even if only two different screening strategies are compared. This strongly limits the implementation and use of RCTs for evaluating innovative screening approaches. Well-designed modeling studies may be a promising and rational complementary approach for the timely evaluation of novel screening strategies [10]. The detailed results on diagnostic performance parameters of FIT reported by Westerberg et al., along with data from screening colonoscopy cohorts, may be most valuable in informing such modeling studies for FIT-based and alternative screening approaches.
Hermann Brenner: Writing—original draft; writing—review and editing; conceptualization. Michael Hoffmeister: Writing—review and editing.
The authors have no conflicts of interest to declare.
期刊介绍:
JIM – The Journal of Internal Medicine, in continuous publication since 1863, is an international, peer-reviewed scientific journal. It publishes original work in clinical science, spanning from bench to bedside, encompassing a wide range of internal medicine and its subspecialties. JIM showcases original articles, reviews, brief reports, and research letters in the field of internal medicine.