Easy ensemble classifier-group and intersectional fairness and threshold (EEC-GIFT): a fairness-aware machine learning framework for lung cancer screening eligibility using real-world data.
Piyawan Conahan, Lary A Robinson, Trung Le, Gilmer Valdes, Matthew B Schabath, Margaret M Byrne, Lee Green, Issam El Naqa, Yi Luo
{"title":"Easy ensemble classifier-group and intersectional fairness and threshold (EEC-GIFT): a fairness-aware machine learning framework for lung cancer screening eligibility using real-world data.","authors":"Piyawan Conahan, Lary A Robinson, Trung Le, Gilmer Valdes, Matthew B Schabath, Margaret M Byrne, Lee Green, Issam El Naqa, Yi Luo","doi":"10.1093/jncics/pkaf030","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>We use real-world data to develop a lung cancer screening (LCS) eligibility mechanism that is both accurate and free from racial bias.</p><p><strong>Methods: </strong>Our data came from the Prostate, Lung, Colorectal, and Ovarian (PLCO) cancer screening trial. We built a systematic fairness-aware machine learning framework by integrating a Group and Intersectional Fairness and Threshold (GIFT) strategy with an easy ensemble classifier-(EEC-) or logistic regression-(LR-) based model. The best LCS eligibility mechanism EEC-GIFT* and LR-GIFT* were applied to the testing dataset and their performances were compared to the 2021 US Preventive Services Task Force (USPSTF) criteria and PLCOM2012 model. The equal opportunity difference (EOD) of developing lung cancer between Black and White smokers was used to evaluate mechanism fairness.</p><p><strong>Results: </strong>The fairness of LR-GIFT* or EEC-GIFT* during training was notably greater than that of the LR or EEC models without greatly reducing their accuracy. During testing, the EEC-GIFT* (85.16% vs 78.08%, P < .001) and LR-GIFT* (85.98% vs 78.08%, P < .001) models significantly improved sensitivity without sacrificing specificity compared to the 2021 USPSTF criteria. The EEC-GIFT* (0.785 vs 0.788, P = .28) and LR-GIFT* (0.785 vs 0.788, P = .30) showed similar area under receiver operating characteristic curve values compared to the PLCOM2012 model. While the average EODs between Blacks and Whites were significant for the 2021 USPSTF criteria (0.0673, P < .001), PLCOM2012 (0.0566, P < .001), and LR-GIFT* (0.0081, P < .001), the EEC-GIFT* model was unbiased (0.0034, P = .07).</p><p><strong>Conclusion: </strong>Our EEC-GIFT* LCS eligibility mechanism can significantly mitigate racial biases in eligibility determination without compromising its predictive performance.</p>","PeriodicalId":14681,"journal":{"name":"JNCI Cancer Spectrum","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11986816/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JNCI Cancer Spectrum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jncics/pkaf030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: We use real-world data to develop a lung cancer screening (LCS) eligibility mechanism that is both accurate and free from racial bias.
Methods: Our data came from the Prostate, Lung, Colorectal, and Ovarian (PLCO) cancer screening trial. We built a systematic fairness-aware machine learning framework by integrating a Group and Intersectional Fairness and Threshold (GIFT) strategy with an easy ensemble classifier-(EEC-) or logistic regression-(LR-) based model. The best LCS eligibility mechanism EEC-GIFT* and LR-GIFT* were applied to the testing dataset and their performances were compared to the 2021 US Preventive Services Task Force (USPSTF) criteria and PLCOM2012 model. The equal opportunity difference (EOD) of developing lung cancer between Black and White smokers was used to evaluate mechanism fairness.
Results: The fairness of LR-GIFT* or EEC-GIFT* during training was notably greater than that of the LR or EEC models without greatly reducing their accuracy. During testing, the EEC-GIFT* (85.16% vs 78.08%, P < .001) and LR-GIFT* (85.98% vs 78.08%, P < .001) models significantly improved sensitivity without sacrificing specificity compared to the 2021 USPSTF criteria. The EEC-GIFT* (0.785 vs 0.788, P = .28) and LR-GIFT* (0.785 vs 0.788, P = .30) showed similar area under receiver operating characteristic curve values compared to the PLCOM2012 model. While the average EODs between Blacks and Whites were significant for the 2021 USPSTF criteria (0.0673, P < .001), PLCOM2012 (0.0566, P < .001), and LR-GIFT* (0.0081, P < .001), the EEC-GIFT* model was unbiased (0.0034, P = .07).
Conclusion: Our EEC-GIFT* LCS eligibility mechanism can significantly mitigate racial biases in eligibility determination without compromising its predictive performance.
目的:我们使用真实世界的数据来开发一种既准确又不存在种族偏见的肺癌筛查(LCS)资格机制。方法:我们的数据来自前列腺、肺、结直肠和卵巢(PLCO)癌症筛查试验。我们通过将组和交叉公平和阈值(GIFT)策略与简单的集成分类器(EEC-)或基于逻辑回归(LR-)的模型集成,构建了一个系统的公平感知机器学习框架。将最佳LCS资格机制EEC-GIFT*和LR-GIFT*应用于测试数据集,并将其性能与2021年美国预防服务工作组(USPSTF)标准和PLCOM2012模型进行比较。采用黑人和白人吸烟者患肺癌的机会均等差异(EOD)来评价机制的公平性。结果:LR- gift *或EEC- gift *在训练过程中的公平性显著高于LR或EEC模型,但未显著降低其准确性。结论:我们的EEC-GIFT* LCS资格判定机制在不影响其预测性能的前提下,显著减轻了资格判定中的种族偏见。