Asim Dhungana, Augustin Vannier, Fangyuan Zhao, Jincong Q Freeman, Poornima Saha, Megan Sullivan, Katharine Yao, Elbio M Flores, Olufunmilayo I Olopade, Alexander T Pearson, Dezheng Huo, Frederick M Howard
{"title":"开发并验证用于准确预测复发的临床乳腺癌工具。","authors":"Asim Dhungana, Augustin Vannier, Fangyuan Zhao, Jincong Q Freeman, Poornima Saha, Megan Sullivan, Katharine Yao, Elbio M Flores, Olufunmilayo I Olopade, Alexander T Pearson, Dezheng Huo, Frederick M Howard","doi":"10.1038/s41523-024-00651-5","DOIUrl":null,"url":null,"abstract":"<p><p>Given high costs of Oncotype DX (ODX) testing, widely used in recurrence risk assessment for early-stage breast cancer, studies have predicted ODX using quantitative clinicopathologic variables. However, such models have incorporated only small cohorts. Using a cohort of patients from the National Cancer Database (NCDB, n = 53,346), we trained machine learning models to predict low-risk (0-25) or high-risk (26-100) ODX using quantitative estrogen receptor (ER)/progesterone receptor (PR)/Ki-67 status, quantitative ER/PR status alone, and no quantitative features. Models were externally validated on a diverse cohort of 970 patients (median follow-up 55 months) for accuracy in ODX prediction and recurrence. Comparing the area under the receiver operating characteristic curve (AUROC) in a held-out set from NCDB, models incorporating quantitative ER/PR (AUROC 0.78, 95% CI 0.77-0.80) and ER/PR/Ki-67 (AUROC 0.81, 95% CI 0.80-0.83) outperformed the non-quantitative model (AUROC 0.70, 95% CI 0.68-0.72). These results were preserved in the validation cohort, where the ER/PR/Ki-67 model (AUROC 0.87, 95% CI 0.81-0.93, p = 0.009) and the ER/PR model (AUROC 0.86, 95% CI 0.80-0.92, p = 0.031) significantly outperformed the non-quantitative model (AUROC 0.80, 95% CI 0.73-0.87). Using a high-sensitivity rule-out threshold, the non-quantitative, quantitative ER/PR and ER/PR/Ki-67 models identified 35%, 30% and 43% of patients as low-risk in the validation cohort. Of these low-risk patients, fewer than 3% had a recurrence at 5 years. These models may help identify patients who can forgo genomic testing and initiate endocrine therapy alone. An online calculator is provided for further study.</p>","PeriodicalId":19247,"journal":{"name":"NPJ Breast Cancer","volume":"10 1","pages":"46"},"PeriodicalIF":6.5000,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11180107/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of a clinical breast cancer tool for accurate prediction of recurrence.\",\"authors\":\"Asim Dhungana, Augustin Vannier, Fangyuan Zhao, Jincong Q Freeman, Poornima Saha, Megan Sullivan, Katharine Yao, Elbio M Flores, Olufunmilayo I Olopade, Alexander T Pearson, Dezheng Huo, Frederick M Howard\",\"doi\":\"10.1038/s41523-024-00651-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Given high costs of Oncotype DX (ODX) testing, widely used in recurrence risk assessment for early-stage breast cancer, studies have predicted ODX using quantitative clinicopathologic variables. However, such models have incorporated only small cohorts. Using a cohort of patients from the National Cancer Database (NCDB, n = 53,346), we trained machine learning models to predict low-risk (0-25) or high-risk (26-100) ODX using quantitative estrogen receptor (ER)/progesterone receptor (PR)/Ki-67 status, quantitative ER/PR status alone, and no quantitative features. Models were externally validated on a diverse cohort of 970 patients (median follow-up 55 months) for accuracy in ODX prediction and recurrence. Comparing the area under the receiver operating characteristic curve (AUROC) in a held-out set from NCDB, models incorporating quantitative ER/PR (AUROC 0.78, 95% CI 0.77-0.80) and ER/PR/Ki-67 (AUROC 0.81, 95% CI 0.80-0.83) outperformed the non-quantitative model (AUROC 0.70, 95% CI 0.68-0.72). These results were preserved in the validation cohort, where the ER/PR/Ki-67 model (AUROC 0.87, 95% CI 0.81-0.93, p = 0.009) and the ER/PR model (AUROC 0.86, 95% CI 0.80-0.92, p = 0.031) significantly outperformed the non-quantitative model (AUROC 0.80, 95% CI 0.73-0.87). Using a high-sensitivity rule-out threshold, the non-quantitative, quantitative ER/PR and ER/PR/Ki-67 models identified 35%, 30% and 43% of patients as low-risk in the validation cohort. Of these low-risk patients, fewer than 3% had a recurrence at 5 years. These models may help identify patients who can forgo genomic testing and initiate endocrine therapy alone. An online calculator is provided for further study.</p>\",\"PeriodicalId\":19247,\"journal\":{\"name\":\"NPJ Breast Cancer\",\"volume\":\"10 1\",\"pages\":\"46\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2024-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11180107/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Breast Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41523-024-00651-5\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Breast Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41523-024-00651-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
鉴于广泛用于早期乳腺癌复发风险评估的 Oncotype DX(ODX)检测成本高昂,有研究利用定量临床病理变量来预测 ODX。然而,这些模型只纳入了小规模的队列。我们利用全国癌症数据库(NCDB,n = 53,346)中的患者队列,训练了机器学习模型,利用定量雌激素受体(ER)/孕激素受体(PR)/Ki-67 状态、单独定量ER/PR 状态和无定量特征预测低风险(0-25)或高风险(26-100)ODX。在 970 例患者(中位随访 55 个月)的不同队列中对模型进行了外部验证,以确定 ODX 预测和复发的准确性。比较 NCDB 保留集的接收者操作特征曲线下面积(AUROC),包含定量 ER/PR(AUROC 0.78,95% CI 0.77-0.80)和 ER/PR/Ki-67(AUROC 0.81,95% CI 0.80-0.83)的模型优于非定量模型(AUROC 0.70,95% CI 0.68-0.72)。这些结果在验证队列中得以保留,ER/PR/Ki-67 模型(AUROC 0.87,95% CI 0.81-0.93,p = 0.009)和 ER/PR 模型(AUROC 0.86,95% CI 0.80-0.92,p = 0.031)的表现明显优于非定量模型(AUROC 0.80,95% CI 0.73-0.87)。使用高灵敏度排除阈值,非定量、定量 ER/PR 和 ER/PR/Ki-67 模型分别将验证队列中 35%、30% 和 43% 的患者识别为低风险患者。在这些低风险患者中,5 年后复发的患者不到 3%。这些模型可能有助于确定哪些患者可以放弃基因组检测,仅开始内分泌治疗。我们还提供了一个在线计算器供进一步研究。
Development and validation of a clinical breast cancer tool for accurate prediction of recurrence.
Given high costs of Oncotype DX (ODX) testing, widely used in recurrence risk assessment for early-stage breast cancer, studies have predicted ODX using quantitative clinicopathologic variables. However, such models have incorporated only small cohorts. Using a cohort of patients from the National Cancer Database (NCDB, n = 53,346), we trained machine learning models to predict low-risk (0-25) or high-risk (26-100) ODX using quantitative estrogen receptor (ER)/progesterone receptor (PR)/Ki-67 status, quantitative ER/PR status alone, and no quantitative features. Models were externally validated on a diverse cohort of 970 patients (median follow-up 55 months) for accuracy in ODX prediction and recurrence. Comparing the area under the receiver operating characteristic curve (AUROC) in a held-out set from NCDB, models incorporating quantitative ER/PR (AUROC 0.78, 95% CI 0.77-0.80) and ER/PR/Ki-67 (AUROC 0.81, 95% CI 0.80-0.83) outperformed the non-quantitative model (AUROC 0.70, 95% CI 0.68-0.72). These results were preserved in the validation cohort, where the ER/PR/Ki-67 model (AUROC 0.87, 95% CI 0.81-0.93, p = 0.009) and the ER/PR model (AUROC 0.86, 95% CI 0.80-0.92, p = 0.031) significantly outperformed the non-quantitative model (AUROC 0.80, 95% CI 0.73-0.87). Using a high-sensitivity rule-out threshold, the non-quantitative, quantitative ER/PR and ER/PR/Ki-67 models identified 35%, 30% and 43% of patients as low-risk in the validation cohort. Of these low-risk patients, fewer than 3% had a recurrence at 5 years. These models may help identify patients who can forgo genomic testing and initiate endocrine therapy alone. An online calculator is provided for further study.
期刊介绍:
npj Breast Cancer publishes original research articles, reviews, brief correspondence, meeting reports, editorial summaries and hypothesis generating observations which could be unexplained or preliminary findings from experiments, novel ideas, or the framing of new questions that need to be solved. Featured topics of the journal include imaging, immunotherapy, molecular classification of disease, mechanism-based therapies largely targeting signal transduction pathways, carcinogenesis including hereditary susceptibility and molecular epidemiology, survivorship issues including long-term toxicities of treatment and secondary neoplasm occurrence, the biophysics of cancer, mechanisms of metastasis and their perturbation, and studies of the tumor microenvironment.