Development and performance of female breast cancer incidence risk prediction models: a systematic review and meta-analysis.

IF 4.3
Annals of medicine Pub Date : 2025-12-01 Epub Date: 2025-07-20 DOI:10.1080/07853890.2025.2534522
Liyuan Liu, Peng Zhou, Lijuan Hou, Chunyu Kao, Ziyu Zhang, Di Wang, Lixiang Yu, Fei Wang, Yongjiu Wang, Zhigang Yu
{"title":"Development and performance of female breast cancer incidence risk prediction models: a systematic review and meta-analysis.","authors":"Liyuan Liu, Peng Zhou, Lijuan Hou, Chunyu Kao, Ziyu Zhang, Di Wang, Lixiang Yu, Fei Wang, Yongjiu Wang, Zhigang Yu","doi":"10.1080/07853890.2025.2534522","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Accurate breast cancer risk prediction is essential for early detection and personalized prevention strategies. While traditional models, such as Gail and Tyrer-Cuzick, are widely utilized, machine learning-based approaches may offer enhanced predictive performance. This systematic review and meta-analysis compare the accuracy of traditional statistical models and machine learning models in breast cancer risk prediction.</p><p><strong>Methods: </strong>A total of 144 studies from 27 countries were systematically reviewed, incorporating genetic, clinical, and imaging data. Pooled C-statistics were calculated to assess model discrimination, while observed-to-expected (O/E) ratios were used to evaluate calibration. Subgroup and sensitivity analyses were conducted to examine heterogeneity and assess the influence of study bias across various populations.</p><p><strong>Results: </strong>Machine learning-based models demonstrated superior performance, with a pooled C-statistic of 0.74, compared to 0.67 for traditional models. Models that integrated genetic and imaging data showed the highest levels of accuracy, although performance varied by population. Sensitivity analyses excluding high-bias studies showed improved discrimination in models incorporating genetic factors, with the pooled C-statistic increasing to 0.72. Traditional models, such as Gail, exhibited notably poor predictive accuracy in non-Western populations, as evidenced by a C-statistic of 0.543 in Chinese cohorts.</p><p><strong>Conclusion: </strong>Machine learning models provide significantly greater predictive accuracy for breast cancer risk, particularly when incorporating multidimensional data. However, issues related to model generalizability and interpretability remain, particularly in diverse populations. Future research should focus on developing more interpretable models and expanding global validation efforts to improve model applicability across different demographic groups.</p>","PeriodicalId":93874,"journal":{"name":"Annals of medicine","volume":"57 1","pages":"2534522"},"PeriodicalIF":4.3000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12278464/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/07853890.2025.2534522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/20 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Accurate breast cancer risk prediction is essential for early detection and personalized prevention strategies. While traditional models, such as Gail and Tyrer-Cuzick, are widely utilized, machine learning-based approaches may offer enhanced predictive performance. This systematic review and meta-analysis compare the accuracy of traditional statistical models and machine learning models in breast cancer risk prediction.

Methods: A total of 144 studies from 27 countries were systematically reviewed, incorporating genetic, clinical, and imaging data. Pooled C-statistics were calculated to assess model discrimination, while observed-to-expected (O/E) ratios were used to evaluate calibration. Subgroup and sensitivity analyses were conducted to examine heterogeneity and assess the influence of study bias across various populations.

Results: Machine learning-based models demonstrated superior performance, with a pooled C-statistic of 0.74, compared to 0.67 for traditional models. Models that integrated genetic and imaging data showed the highest levels of accuracy, although performance varied by population. Sensitivity analyses excluding high-bias studies showed improved discrimination in models incorporating genetic factors, with the pooled C-statistic increasing to 0.72. Traditional models, such as Gail, exhibited notably poor predictive accuracy in non-Western populations, as evidenced by a C-statistic of 0.543 in Chinese cohorts.

Conclusion: Machine learning models provide significantly greater predictive accuracy for breast cancer risk, particularly when incorporating multidimensional data. However, issues related to model generalizability and interpretability remain, particularly in diverse populations. Future research should focus on developing more interpretable models and expanding global validation efforts to improve model applicability across different demographic groups.

女性乳腺癌发病风险预测模型的发展与性能:系统回顾与荟萃分析。
准确的乳腺癌风险预测对于早期发现和个性化预防策略至关重要。虽然Gail和Tyrer-Cuzick等传统模型被广泛使用,但基于机器学习的方法可能会提高预测性能。本系统综述和荟萃分析比较了传统统计模型和机器学习模型在乳腺癌风险预测中的准确性。方法:系统回顾了来自27个国家的144项研究,包括遗传学、临床和影像学资料。计算合并c统计量来评估模型判别,而使用观察到的期望(O/E)比率来评估校准。进行亚组和敏感性分析以检验异质性并评估研究偏倚对不同人群的影响。结果:基于机器学习的模型表现出优越的性能,其汇集c统计量为0.74,而传统模型为0.67。整合了遗传和成像数据的模型显示出最高水平的准确性,尽管表现因人群而异。排除高偏倚研究的敏感性分析显示,在纳入遗传因素的模型中,鉴别性得到改善,合并c统计量增加到0.72。传统模型,如Gail,在非西方人群中表现出明显较差的预测准确性,中国队列的c统计量为0.543。结论:机器学习模型对乳腺癌风险的预测精度显著提高,特别是在结合多维数据时。然而,与模型的普遍性和可解释性有关的问题仍然存在,特别是在不同的人群中。未来的研究应侧重于开发更多可解释的模型,并扩大全球验证工作,以提高模型在不同人口群体中的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信