人工智能生存模型用于识别阿扎尔队列人群中发生糖尿病的相关危险因素。

IF 2.8 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Health Promotion Perspectives Pub Date : 2025-05-06 eCollection Date: 2025-05-01 DOI:10.34172/hpp.025.43105
Neda Gilani, Mohammadhossein Somi, Farzaneh Hamidi, Pasqualina Santaguida, Elnaz Faramarzi, Reza Arabi Belaghi
{"title":"人工智能生存模型用于识别阿扎尔队列人群中发生糖尿病的相关危险因素。","authors":"Neda Gilani, Mohammadhossein Somi, Farzaneh Hamidi, Pasqualina Santaguida, Elnaz Faramarzi, Reza Arabi Belaghi","doi":"10.34172/hpp.025.43105","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aimed to identify some risk factors associated with time to diabetes type II events using artificial intelligence (AI) survival models (SM) in a population cohort from East Azerbaijan, Iran.</p><p><strong>Methods: </strong>Data from Azar-Cohort spanning from 2014 to 2020 was analyzed using the random forest (RF) variable selection method along with Cox regression to identify the most relevant risk factors associated with diabetes. We then developed prediction models using RF survival analysis. Lasso-variable selection and RF variable selection were used to select the most important variables. The concordance index (C-index) was used to evaluate the concordance of the prediction models.</p><p><strong>Results: </strong>Our LASSO-Cox regression identified six factors to be significantly associated with diabetes: age, mean corpuscular hemoglobin concentration (MCHC), waist circumference (WC), body mass index (BMI), use of sleep medication, and hypertension stage 1 and stage 2. The model included all variables with a C-index of 76.3%. In contrast, the RF analysis identified 21 important variables predicting a higher probability of having diabetes. Of those, WC, MCHC, triglyceride, and age were the most important predictors of diabetes. The RF model converged after 500 trees with an out-of-bag (OOB) of 0.28 and a C-index of 79.5%.</p><p><strong>Conclusion: </strong>RF machine learning algorithms and LASSO-Cox regression analyses consistently identified WC, hypertension, and MCHC as the main risk factors for developing diabetes. The RF approach demonstrated slightly better accuracy in predicting the likelihood of diabetes at different time points.</p>","PeriodicalId":46588,"journal":{"name":"Health Promotion Perspectives","volume":"15 1","pages":"82-92"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12125507/pdf/","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.\",\"authors\":\"Neda Gilani, Mohammadhossein Somi, Farzaneh Hamidi, Pasqualina Santaguida, Elnaz Faramarzi, Reza Arabi Belaghi\",\"doi\":\"10.34172/hpp.025.43105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>This study aimed to identify some risk factors associated with time to diabetes type II events using artificial intelligence (AI) survival models (SM) in a population cohort from East Azerbaijan, Iran.</p><p><strong>Methods: </strong>Data from Azar-Cohort spanning from 2014 to 2020 was analyzed using the random forest (RF) variable selection method along with Cox regression to identify the most relevant risk factors associated with diabetes. We then developed prediction models using RF survival analysis. Lasso-variable selection and RF variable selection were used to select the most important variables. The concordance index (C-index) was used to evaluate the concordance of the prediction models.</p><p><strong>Results: </strong>Our LASSO-Cox regression identified six factors to be significantly associated with diabetes: age, mean corpuscular hemoglobin concentration (MCHC), waist circumference (WC), body mass index (BMI), use of sleep medication, and hypertension stage 1 and stage 2. The model included all variables with a C-index of 76.3%. In contrast, the RF analysis identified 21 important variables predicting a higher probability of having diabetes. Of those, WC, MCHC, triglyceride, and age were the most important predictors of diabetes. The RF model converged after 500 trees with an out-of-bag (OOB) of 0.28 and a C-index of 79.5%.</p><p><strong>Conclusion: </strong>RF machine learning algorithms and LASSO-Cox regression analyses consistently identified WC, hypertension, and MCHC as the main risk factors for developing diabetes. The RF approach demonstrated slightly better accuracy in predicting the likelihood of diabetes at different time points.</p>\",\"PeriodicalId\":46588,\"journal\":{\"name\":\"Health Promotion Perspectives\",\"volume\":\"15 1\",\"pages\":\"82-92\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12125507/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Promotion Perspectives\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34172/hpp.025.43105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Promotion Perspectives","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34172/hpp.025.43105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

摘要

背景:本研究旨在利用人工智能(AI)生存模型(SM)在伊朗东阿塞拜疆的人群队列中确定与2型糖尿病事件发生时间相关的一些危险因素。方法:采用随机森林(RF)变量选择法和Cox回归分析2014 - 2020年azar队列数据,确定与糖尿病相关的最相关危险因素。然后,我们利用射频生存分析建立了预测模型。采用套索变量选择法和射频变量选择法选择最重要的变量。采用一致性指数(C-index)评价预测模型的一致性。结果:我们的LASSO-Cox回归确定了6个与糖尿病显著相关的因素:年龄、平均红细胞血红蛋白浓度(MCHC)、腰围(WC)、体重指数(BMI)、睡眠药物的使用以及高血压1期和2期。该模型包含了所有变量,c指数为76.3%。相比之下,射频分析确定了21个重要变量,预测患糖尿病的可能性更高。其中,WC、MCHC、甘油三酯和年龄是糖尿病最重要的预测因子。该模型在500棵树后收敛,出袋率(OOB)为0.28,c指数为79.5%。结论:射频机器学习算法和LASSO-Cox回归分析一致认为WC、高血压和MCHC是发生糖尿病的主要危险因素。RF方法在不同时间点预测糖尿病可能性的准确性稍好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.

Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.

Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.

Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.

Background: This study aimed to identify some risk factors associated with time to diabetes type II events using artificial intelligence (AI) survival models (SM) in a population cohort from East Azerbaijan, Iran.

Methods: Data from Azar-Cohort spanning from 2014 to 2020 was analyzed using the random forest (RF) variable selection method along with Cox regression to identify the most relevant risk factors associated with diabetes. We then developed prediction models using RF survival analysis. Lasso-variable selection and RF variable selection were used to select the most important variables. The concordance index (C-index) was used to evaluate the concordance of the prediction models.

Results: Our LASSO-Cox regression identified six factors to be significantly associated with diabetes: age, mean corpuscular hemoglobin concentration (MCHC), waist circumference (WC), body mass index (BMI), use of sleep medication, and hypertension stage 1 and stage 2. The model included all variables with a C-index of 76.3%. In contrast, the RF analysis identified 21 important variables predicting a higher probability of having diabetes. Of those, WC, MCHC, triglyceride, and age were the most important predictors of diabetes. The RF model converged after 500 trees with an out-of-bag (OOB) of 0.28 and a C-index of 79.5%.

Conclusion: RF machine learning algorithms and LASSO-Cox regression analyses consistently identified WC, hypertension, and MCHC as the main risk factors for developing diabetes. The RF approach demonstrated slightly better accuracy in predicting the likelihood of diabetes at different time points.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Health Promotion Perspectives
Health Promotion Perspectives PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-
CiteScore
7.10
自引率
2.30%
发文量
27
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信