Salomón Martín Pérez, Flora Sanchez Jimenez, Sandra Fuentes Cantero, Marta Jímenez Barragan, Catalina Sanchez Mora, Juan M Borreguero Leon, Arrobas Velilla Teresa, Agustín Valido Morales, Juan A Delgado Torralbo, Antonio León Justel
{"title":"利用人工智能评估COVID-19关键疾病的遗传易感性:机器学习模型的比较研究","authors":"Salomón Martín Pérez, Flora Sanchez Jimenez, Sandra Fuentes Cantero, Marta Jímenez Barragan, Catalina Sanchez Mora, Juan M Borreguero Leon, Arrobas Velilla Teresa, Agustín Valido Morales, Juan A Delgado Torralbo, Antonio León Justel","doi":"10.1515/almed-2025-0073","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Early prediction of critical COVID-19 disease is crucial for an optimal clinical management. The objective of this study was to optimize predictive models for critical COVID-19 disease. Clinical data, laboratory data and genetic polymorphisms were integrated into AI models to compare the performance of different machine learning algorithms.</p><p><strong>Methods: </strong>Data from 155 inpatients were analyzed, 23 of whom developed critical disease. A univariate analysis was performed to assess potential correlations between seven SNPs, nine clinical variables and 10 laboratory parameters at admission.</p><p><strong>Results: </strong>Of the 7 SNPs, only three SNPs demonstrated a significant association with critical disase, namely: rs77534576, rs10774671 and rs10490770. The ensemble models exhibited the best performance: Random Forest (AUC=0.989), XGBoost (AUC=0.954) and AdaBoost (AUC=0.927). Variable importance varied across models, with age, C-reactive protein, heart diseases and the three SNPs being the most influential features. The predictive power of models improved with the integration of the three SNPs, as compared to previous studies where genetic data were not included. Internal validation confirmed the superiority and stability of the ensemble models.</p><p><strong>Conclusions: </strong>Machine learning models may help predict progression into critical COVID-19-disease. The predictive power of models improves when SNPs associated with COVID-19 severity are integrated with laboratory and clinical data. Prior to implementation in clinical practice, larger studies in different populations are needed to validate and support the generalization of these results.</p>","PeriodicalId":72097,"journal":{"name":"Advances in laboratory medicine","volume":"6 2","pages":"181-189"},"PeriodicalIF":1.1000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107411/pdf/","citationCount":"0","resultStr":"{\"title\":\"Use of artificial intelligence to assess genetic predisposition to develop critical COVID-19 disease: a comparative study of machine learning models.\",\"authors\":\"Salomón Martín Pérez, Flora Sanchez Jimenez, Sandra Fuentes Cantero, Marta Jímenez Barragan, Catalina Sanchez Mora, Juan M Borreguero Leon, Arrobas Velilla Teresa, Agustín Valido Morales, Juan A Delgado Torralbo, Antonio León Justel\",\"doi\":\"10.1515/almed-2025-0073\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Early prediction of critical COVID-19 disease is crucial for an optimal clinical management. The objective of this study was to optimize predictive models for critical COVID-19 disease. Clinical data, laboratory data and genetic polymorphisms were integrated into AI models to compare the performance of different machine learning algorithms.</p><p><strong>Methods: </strong>Data from 155 inpatients were analyzed, 23 of whom developed critical disease. A univariate analysis was performed to assess potential correlations between seven SNPs, nine clinical variables and 10 laboratory parameters at admission.</p><p><strong>Results: </strong>Of the 7 SNPs, only three SNPs demonstrated a significant association with critical disase, namely: rs77534576, rs10774671 and rs10490770. The ensemble models exhibited the best performance: Random Forest (AUC=0.989), XGBoost (AUC=0.954) and AdaBoost (AUC=0.927). Variable importance varied across models, with age, C-reactive protein, heart diseases and the three SNPs being the most influential features. The predictive power of models improved with the integration of the three SNPs, as compared to previous studies where genetic data were not included. Internal validation confirmed the superiority and stability of the ensemble models.</p><p><strong>Conclusions: </strong>Machine learning models may help predict progression into critical COVID-19-disease. The predictive power of models improves when SNPs associated with COVID-19 severity are integrated with laboratory and clinical data. Prior to implementation in clinical practice, larger studies in different populations are needed to validate and support the generalization of these results.</p>\",\"PeriodicalId\":72097,\"journal\":{\"name\":\"Advances in laboratory medicine\",\"volume\":\"6 2\",\"pages\":\"181-189\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12107411/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in laboratory medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/almed-2025-0073\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q4\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in laboratory medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/almed-2025-0073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/1 0:00:00","PubModel":"eCollection","JCR":"Q4","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
Use of artificial intelligence to assess genetic predisposition to develop critical COVID-19 disease: a comparative study of machine learning models.
Objectives: Early prediction of critical COVID-19 disease is crucial for an optimal clinical management. The objective of this study was to optimize predictive models for critical COVID-19 disease. Clinical data, laboratory data and genetic polymorphisms were integrated into AI models to compare the performance of different machine learning algorithms.
Methods: Data from 155 inpatients were analyzed, 23 of whom developed critical disease. A univariate analysis was performed to assess potential correlations between seven SNPs, nine clinical variables and 10 laboratory parameters at admission.
Results: Of the 7 SNPs, only three SNPs demonstrated a significant association with critical disase, namely: rs77534576, rs10774671 and rs10490770. The ensemble models exhibited the best performance: Random Forest (AUC=0.989), XGBoost (AUC=0.954) and AdaBoost (AUC=0.927). Variable importance varied across models, with age, C-reactive protein, heart diseases and the three SNPs being the most influential features. The predictive power of models improved with the integration of the three SNPs, as compared to previous studies where genetic data were not included. Internal validation confirmed the superiority and stability of the ensemble models.
Conclusions: Machine learning models may help predict progression into critical COVID-19-disease. The predictive power of models improves when SNPs associated with COVID-19 severity are integrated with laboratory and clinical data. Prior to implementation in clinical practice, larger studies in different populations are needed to validate and support the generalization of these results.