人工智能驱动的结直肠癌化学毒性预测:种族、SDOH和生物衰老的影响

IF 3.4 2区 医学 Q2 ONCOLOGY
Claire Han, Christin Burd, Jesse Plascak, Fode Tounkara, Ashley Rosko, Anne Noonan, Alai Tan, Diane Von Ah, Xia Ning
{"title":"人工智能驱动的结直肠癌化学毒性预测:种族、SDOH和生物衰老的影响","authors":"Claire Han, Christin Burd, Jesse Plascak, Fode Tounkara, Ashley Rosko, Anne Noonan, Alai Tan, Diane Von Ah, Xia Ning","doi":"10.1186/s12885-025-14831-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patients with colorectal cancer (CRC) often experience chemotoxicity that impacts treatment adherence, survival, and quality of life. Early screening for chemotoxicity risk is vital, yet comprehensive predictive models are lacking. The objective of this study was to develop effective artificial intelligence (AI)/machine learning (ML) models, integrating racialized group, social determinants of health (SDOH) (Area Deprivation Index [ADI], employment status), and biological aging (Levine Phenotypic Age) to predict overall, gastrointestinal (GI), and hematological chemotoxicity.</p><p><strong>Methods: </strong>We used electronic health records data from 1,735 adult patients with CRC. Sociodemographic/clinical variables, Levine Phenotypic Age (biological aging), and SDOH (including geospatial variations measured by ADI) were analyzed using descriptive statistics. Associations with chemotoxicity (overall, GI, hematological) were evaluated via univariate tests. Significant predictors from univariate tests were selected for AI/ML modeling. Six supervised ML models were trained on 80% of cases (n = 1,388), with 20% (n = 347) reserved for testing. Performance was assessed via accuracy, area under the curve (AUC), and F1-score. Permutation feature importance ranked predictors to define the most significant predictors of chemotoxicity.</p><p><strong>Results: </strong>Support Vector Machine and XGBoost models demonstrated high accuracy in both the training and test datasets. Notably, the AUC (0.988) was highest for the Support Vector Machine model in predicting overall chemotoxicity within the training dataset. Key predictors of overall and GI toxicities included higher Levine Phenotypic Age, elevated inflammatory markers (e.g., C-reactive protein), and poor SDOH (e.g., higher ADI, unemployment). Hematological toxicity was linked to lower inflammatory markers, higher Levine Phenotypic Age, and younger chronological age. Race (non-Hispanic Black), body mass index, and lifestyle also influenced overall and GI toxicities.</p><p><strong>Conclusions: </strong>ML-based chemotoxicity prediction models incorporating racialized group, SDOH, and biological aging had high accuracy. Greater biological aging, poor SDOH including ADI, and higher inflammation markers were common risk factors for overall and GI chemotoxicity. In contrast, chronological and biological ages and immune/inflammation markers were only linked to hematological chemotoxicity. Integrating these factors into predictive models can help clinicians identify at-risk patients and tailor interventions (e.g., anti-inflammatory, anti-aging strategies) to reduce chemotoxicity and improve survivorship outcomes.</p>","PeriodicalId":9131,"journal":{"name":"BMC Cancer","volume":"25 1","pages":"1513"},"PeriodicalIF":3.4000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AI-driven chemotoxicity prediction in colorectal cancer: impact of race, SDOH, and biological aging.\",\"authors\":\"Claire Han, Christin Burd, Jesse Plascak, Fode Tounkara, Ashley Rosko, Anne Noonan, Alai Tan, Diane Von Ah, Xia Ning\",\"doi\":\"10.1186/s12885-025-14831-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Patients with colorectal cancer (CRC) often experience chemotoxicity that impacts treatment adherence, survival, and quality of life. Early screening for chemotoxicity risk is vital, yet comprehensive predictive models are lacking. The objective of this study was to develop effective artificial intelligence (AI)/machine learning (ML) models, integrating racialized group, social determinants of health (SDOH) (Area Deprivation Index [ADI], employment status), and biological aging (Levine Phenotypic Age) to predict overall, gastrointestinal (GI), and hematological chemotoxicity.</p><p><strong>Methods: </strong>We used electronic health records data from 1,735 adult patients with CRC. Sociodemographic/clinical variables, Levine Phenotypic Age (biological aging), and SDOH (including geospatial variations measured by ADI) were analyzed using descriptive statistics. Associations with chemotoxicity (overall, GI, hematological) were evaluated via univariate tests. Significant predictors from univariate tests were selected for AI/ML modeling. Six supervised ML models were trained on 80% of cases (n = 1,388), with 20% (n = 347) reserved for testing. Performance was assessed via accuracy, area under the curve (AUC), and F1-score. Permutation feature importance ranked predictors to define the most significant predictors of chemotoxicity.</p><p><strong>Results: </strong>Support Vector Machine and XGBoost models demonstrated high accuracy in both the training and test datasets. Notably, the AUC (0.988) was highest for the Support Vector Machine model in predicting overall chemotoxicity within the training dataset. Key predictors of overall and GI toxicities included higher Levine Phenotypic Age, elevated inflammatory markers (e.g., C-reactive protein), and poor SDOH (e.g., higher ADI, unemployment). Hematological toxicity was linked to lower inflammatory markers, higher Levine Phenotypic Age, and younger chronological age. Race (non-Hispanic Black), body mass index, and lifestyle also influenced overall and GI toxicities.</p><p><strong>Conclusions: </strong>ML-based chemotoxicity prediction models incorporating racialized group, SDOH, and biological aging had high accuracy. Greater biological aging, poor SDOH including ADI, and higher inflammation markers were common risk factors for overall and GI chemotoxicity. In contrast, chronological and biological ages and immune/inflammation markers were only linked to hematological chemotoxicity. Integrating these factors into predictive models can help clinicians identify at-risk patients and tailor interventions (e.g., anti-inflammatory, anti-aging strategies) to reduce chemotoxicity and improve survivorship outcomes.</p>\",\"PeriodicalId\":9131,\"journal\":{\"name\":\"BMC Cancer\",\"volume\":\"25 1\",\"pages\":\"1513\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12885-025-14831-4\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12885-025-14831-4","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:结直肠癌(CRC)患者经常经历影响治疗依从性、生存和生活质量的化学毒性。化学毒性风险的早期筛查至关重要,但缺乏全面的预测模型。本研究的目的是开发有效的人工智能(AI)/机器学习(ML)模型,整合种族化群体、健康社会决定因素(SDOH)(区域剥夺指数[ADI]、就业状况)和生物衰老(莱文表型年龄),以预测整体、胃肠道(GI)和血液化学毒性。方法:我们使用了1735名成年结直肠癌患者的电子健康记录数据。采用描述性统计分析社会人口学/临床变量、Levine表型年龄(生物老化)和SDOH(包括ADI测量的地理空间变化)。通过单变量试验评估与化学毒性(总体、GI、血液学)的关系。从单变量检验中选择显著的预测因子进行AI/ML建模。在80%的案例(n = 1388)上训练了6个有监督的ML模型,其中20% (n = 347)保留用于测试。通过准确性、曲线下面积(AUC)和f1评分来评估疗效。排列特征重要性排序预测因子定义最显著的化学毒性预测因子。结果:支持向量机和XGBoost模型在训练和测试数据集上都显示出较高的准确性。值得注意的是,在预测训练数据集中的总体化学毒性时,支持向量机模型的AUC(0.988)最高。总体和胃肠道毒性的关键预测因素包括较高的Levine表型年龄、升高的炎症标志物(如c反应蛋白)和较差的SDOH(如较高的ADI、失业率)。血液学毒性与较低的炎症标志物、较高的莱文表型年龄和较年轻的实足年龄有关。种族(非西班牙裔黑人)、体重指数和生活方式也影响总体和胃肠道毒性。结论:结合种族化基团、SDOH和生物老化的基于ml的化学毒性预测模型具有较高的准确性。较大的生物老化、较差的SDOH(包括ADI)和较高的炎症标志物是总体和胃肠道化学毒性的常见危险因素。相反,时间和生物年龄以及免疫/炎症标志物仅与血液学化学毒性有关。将这些因素整合到预测模型中可以帮助临床医生识别高危患者并定制干预措施(例如,抗炎、抗衰老策略),以减少化学毒性并改善生存结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
AI-driven chemotoxicity prediction in colorectal cancer: impact of race, SDOH, and biological aging.

Background: Patients with colorectal cancer (CRC) often experience chemotoxicity that impacts treatment adherence, survival, and quality of life. Early screening for chemotoxicity risk is vital, yet comprehensive predictive models are lacking. The objective of this study was to develop effective artificial intelligence (AI)/machine learning (ML) models, integrating racialized group, social determinants of health (SDOH) (Area Deprivation Index [ADI], employment status), and biological aging (Levine Phenotypic Age) to predict overall, gastrointestinal (GI), and hematological chemotoxicity.

Methods: We used electronic health records data from 1,735 adult patients with CRC. Sociodemographic/clinical variables, Levine Phenotypic Age (biological aging), and SDOH (including geospatial variations measured by ADI) were analyzed using descriptive statistics. Associations with chemotoxicity (overall, GI, hematological) were evaluated via univariate tests. Significant predictors from univariate tests were selected for AI/ML modeling. Six supervised ML models were trained on 80% of cases (n = 1,388), with 20% (n = 347) reserved for testing. Performance was assessed via accuracy, area under the curve (AUC), and F1-score. Permutation feature importance ranked predictors to define the most significant predictors of chemotoxicity.

Results: Support Vector Machine and XGBoost models demonstrated high accuracy in both the training and test datasets. Notably, the AUC (0.988) was highest for the Support Vector Machine model in predicting overall chemotoxicity within the training dataset. Key predictors of overall and GI toxicities included higher Levine Phenotypic Age, elevated inflammatory markers (e.g., C-reactive protein), and poor SDOH (e.g., higher ADI, unemployment). Hematological toxicity was linked to lower inflammatory markers, higher Levine Phenotypic Age, and younger chronological age. Race (non-Hispanic Black), body mass index, and lifestyle also influenced overall and GI toxicities.

Conclusions: ML-based chemotoxicity prediction models incorporating racialized group, SDOH, and biological aging had high accuracy. Greater biological aging, poor SDOH including ADI, and higher inflammation markers were common risk factors for overall and GI chemotoxicity. In contrast, chronological and biological ages and immune/inflammation markers were only linked to hematological chemotoxicity. Integrating these factors into predictive models can help clinicians identify at-risk patients and tailor interventions (e.g., anti-inflammatory, anti-aging strategies) to reduce chemotoxicity and improve survivorship outcomes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Cancer
BMC Cancer 医学-肿瘤学
CiteScore
6.00
自引率
2.60%
发文量
1204
审稿时长
6.8 months
期刊介绍: BMC Cancer is an open access, peer-reviewed journal that considers articles on all aspects of cancer research, including the pathophysiology, prevention, diagnosis and treatment of cancers. The journal welcomes submissions concerning molecular and cellular biology, genetics, epidemiology, and clinical trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信